Computer Vision
CSE471Prof. Makarand Tapaswi + Prof. Charu Sharma•Spring 2025-26•4 credits
Unit 4 — Convolutional Neural Networks (CNNs)
The CNN era — and the first 'real' CV topic of the course. Convolutional layers (params, RF, padding, stride, dilation); 1×1 convs; pooling; batch normalisation in CNNs; equivariance vs invariance. Architectures in chronological order: LeNet → AlexNet → VGG → Inception → ResNet → DenseNet → SENet → MobileNet → EfficientNet. Closes with backprop through convs/pools and CNNs for video / audio.