Home

Hello! I am Arkaprava Sinha, a Graduate Research Assistant pursuing a Ph.D. in Computer Science at the University of North Carolina, Charlotte. I am advised by Prof. Srijan Das. My research focuses on Video Understanding, Multimodal Learning, and Temporal Action Detection. I have prior experience as a Data Scientist, contributing to innovative projects in Computer Vision, and Machine Learning.

Research

My research is centered on video representation learning, with a focus on utilizing spatio-temporal attention mechanisms and multiple modalities. I am also deeply interested in Vision-Language Models and Self-Supervised Learning techniques. The primary applications of my research encompass Action Classification in Trimmed Videos, Temporal Action Detection in Untrimmed Videos, Robotic Vision, and the development of Video Conversational Agents.

News

Feb 2025 - LLAVIDAL accepted to CVPR 2025.
Dec 2024 - SKI Models accepted to AAAI 2025.
Oct 2024 - 2 papers accepted to NeurIPS 2024 workshops. Early version of LLAVIDAL is presented in NeurIPS 2024 workshop on Video-Language Models and Multimodal Algorithmic Reasoning.

Selected Publications

  • LLAVIDAL: A Large Language Vision Model for Daily Activities of Living
    The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
    Dominick Reilly, Rajatsubhra Chakraborty, Arkaprava Sinha, Manish Kumar Govind, Pu Wang, Francois Bremond, Le Xue, Srijan Das
    Paper | Code

  • SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
    The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
    Arkaprava Sinha, Dominick Reilly, Francois Bremond, Pu Wang, Srijan Das
    Paper | Code

  • MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection
    Preprint
    Arkaprava Sinha, Monish Soundar Raj, Pu Wang, Ahmed Helmy, Srijan Das
    Paper | Code

  • Quo Vadis, Video Understanding with Vision-Language Foundation Models?
    NeurIPS Workshop on Video-Language Models, 2024
    Mahmoud Ali, Di Yang, Arkaprava Sinha, Dominick Reilly, Srijan Das, Gianpiero Francesca, Francois Bremond
    Paper