Ramakrishna Mission Vivekananda Educational and Research Institute (RKMVERI)

ACVDL Course Archives

CS411: Applications of Computer Vision and Deep Learning

Department of Computer Science & Big Data Analytics

This master archive consolidates the comprehensive curriculum, lecture notes, coding tutorials, student projects showcases, and end-semester examinations across academic iterations of CS411 for PMRF teaching work. Commencing with foundational mathematical optimization and PyTorch programming, the course progresses toward practical implementation of cutting-edge deep learning architectures, including Variational Autoencoders (VAEs), Convolutional Neural Networks (CNNs), Vision Transformers, and Large Language Models (LLMs).

Instructor Jimut Bahan Pal
Academic Level Postgraduate / Advanced Elective
Course Credits 3 Credits (50+ Lecture Hours per Offering)

Academic Year Archives

Course Editions
Spring 2026 Current Curriculum Archive

Generative Vision, Transformers & LLMs

Access 2026 Archive →

Comprehensive archives covering PyTorch pipelines, Softmax & Cross-Entropy derivations, VAEs, Deep NLP, Vision Transformers, LLM Training & Tuning, LLM workflows, and specialized guest research seminars on Diffusion Frameworks (GENIE) and Domain Generalization (HiDISC).

61 Hours Covered 17 Lecture Modules 4 Assignments End-Semester Question Papers
Spring 2025 Archival Repository

Advanced Sampling, VAEs & Image/Video Processing

Extensive coverage of sampling techniques (Rejection, Gibbs, Inverse Transform, Langevin, MALA, Metropolis-Hastings), parameter calculations, VAE theory via EM, and computer vision segmentation methods. Features guest talks on Open-Vocabulary Segmentation (Saikat Dutta), Adaptive Task-Arithmetic (Vaibhav Rathore), and Multimodal Learning (Aniket Thomas).

Highlighted Student Work:
Assignment 1 (ASCII Art & Video Generation): Best submissions by Pranab Kumar Mondal & Sushovan Pan (Colored Video).
Assignment 2 (Classification Report): Best submission by Sanmitra Sur.
57.5 Hours Covered 17 Lecture Modules 3 Core Assignments End-Semester Exam
Spring 2024 Inaugural Edition

Neural Network Foundations, NLP & Segmentation Pipelines

Access 2024 Archive →

The foundational online course offering covering PyTorch autograd, classification pipelines, U-Net semantic segmentation (Skin Lesion), VAE theory, and Transformers/Attention. Includes specialized lectures on NLP, Word2Vec, and Neural Machine Translation by guest speaker Seshadri Mazumder.

Student Recitation Presentations:
Privacy in Age Recognition From Images — Anirban & Sourish
Uncertainty Sets using Conformal Prediction — Bidit & Srijan
Normalizing Flows — Shreyas
46.5 Hours Covered 14 Lecture Modules 2 Multi-Class Assignments Recitation & Viva

Project Showcase

Highlighted Cohort Portal • Spring 2025

ACVDL Student Projects & Applications Gallery

Explore end-to-end computer vision applications, empirical evaluations, and custom deep learning pipelines engineered and deployed by graduate and undergraduate students.

Open Showcase Gallery ↗

Special Interest Groups (SIG)

Algorithmic Problem Solving • Biweekly Sessions

Competitive Coding & Problem Solving with Python (CCP-SIG)

Organized to maintain accountability and consistency in algorithmic mastery. The group meets biweekly to analyze optimal time complexities and implement solutions in rapid-prototyping Python. Focuses heavily on LeetCode Data Structures & Algorithms (Arrays, Two-Pointers, Subarrays, Linked Lists) before transitioning to competitive programming challenges on Codeforces.

Open CCP-SIG Archive →