LLAVIDAL: A Large Language Vision Model for Daily Activities of Living
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
Dominick Reilly, Rajatsubhra Chakraborty, Arkaprava Sinha, Manish Kumar Govind, Pu Wang, Francois Bremond, Le Xue, Srijan Das
Paper | Code
SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
Arkaprava Sinha, Dominick Reilly, Francois Bremond, Pu Wang, Srijan Das
Paper | Code
MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection
Preprint
Arkaprava Sinha, Monish Soundar Raj, Pu Wang, Ahmed Helmy, Srijan Das
Paper | Code
Quo Vadis, Video Understanding with Vision-Language Foundation Models?
NeurIPS Workshop on Video-Language Models, 2024
Mahmoud Ali, Di Yang, Arkaprava Sinha, Dominick Reilly, Srijan Das, Gianpiero Francesca, Francois Bremond
Paper