About

I am a PhD student in Computer Science at IIIT-Delhi. My research focuses on video generation and understanding, with a particular interest in model efficiency and temporal modeling. Currently, I am studying how video-language models represent and reason about temporal dynamics.

Papers

Temporal Fidelity: Probing the Temporal Resolution of Video-Language Models

Sai Varun Jamalpoor, Mukesh Mohania, Vikram Goyal
Preprint, Under Review

TL;DR: We introduce a psychophysics-inspired framework to probe the temporal resolution limits of Video-LLMs, identifying a representation-behavior gap in temporal sensitivity.

Education

  • IIIT-Delhi 2025 – Present
    PhD Candidate in Computer Science

    I research generative video models, specifically focusing on temporal consistency, frame interpolation, and distributed training setups.

  • IIST Trivandrum 2023 – 2025
    M.Tech in Machine Learning and Computing

    Studied optimization, neural representation models, and statistical learning. My master's thesis was supervised by Prof. Deepak Mishra.

  • KNR College of Engineering 2018 – 2022
    B.Tech in Computer Science and Engineering

    Learned basic computer science theory, systems programming, and algorithms.

Teaching & Mentorship

Teaching Assistant
Database Management Systems (CSE202), IIIT-Delhi, Winter 2026
Teaching Assistant
Information Integration and Applications (CSE656), IIIT-Delhi, Monsoon 2025
Mentor
Helping undergraduate students on deep learning and vision projects at IIIT-Delhi

Writing

Writing on AI, generative models, and the machinery underneath.

May 6, 2026 · interactive · 25 min

Understanding Diffusion Models

Building up diffusion models from scratch — the math, the intuition, and interactive visualizations.