Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2503.13646
Cited By
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
Computer Vision and Pattern Recognition (CVPR), 2025
17 March 2025
Chiara Plizzari
A. Tonioni
Yongqin Xian
Achin Kulshrestha
F. Tombari
EgoV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos"
11 / 11 papers shown
EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning
Yogesh Kulkarni
Pooyan Fazli
EgoV
LRM
383
0
0
23 Nov 2025
Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges
Junlong Li
Huaiyuan Xu
Sijie Cheng
Kejun Wu
Kim-Hui Yap
Lap-Pui Chau
Yi Wang
EgoV
230
0
0
17 Nov 2025
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Xu Zheng
Zihao Dongfang
Lutao Jiang
Boyuan Zheng
Yulong Guo
...
L. Zhang
Danda Pani Paudel
Nicu Sebe
Luc Van Gool
Xuming Hu
LRM
VLM
708
4
0
29 Oct 2025
Training-free Online Video Step Grounding
Luca Zanella
Massimiliano Mancini
Yiming Wang
Alessio Tonioni
Elisa Ricci
128
0
0
19 Oct 2025
EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark
Deheng Zhang
Yuqian Fu
Runyi Yang
Yang Miao
Tianwen Qian
...
Ajad Chhatkuli
Xuanjing Huang
Yu-Gang Jiang
Luc Van Gool
D. Paudel
EgoV
257
2
0
07 Oct 2025
Reasoning under Vision: Understanding Visual-Spatial Cognition in Vision-Language Models for CAPTCHA
Python Song
Luke Tenyi Chang
Yun-Yun Tsai
Penghui Li
Junfeng Yang
LRM
96
0
0
07 Oct 2025
HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes
Keliang Li
Hongze Shen
Hao Shi
Ruibing Hou
Hong Chang
...
Wen Wang
Yiling Wu
Shihong Deng
Shiguang Shan
Xilin Chen
LRM
180
1
0
19 Aug 2025
Causality Matters: How Temporal Information Emerges in Video Language Models
Yumeng Shi
Quanyu Long
Yin Wu
Wenya Wang
111
1
0
15 Aug 2025
EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering
Yanjun Li
Yuqian Fu
Tianwen Qian
Qiáo Xu
Silong Dai
Danda Pani Paudel
Luc Van Gool
Xiaoling Wang
EgoV
VLM
278
4
0
14 Aug 2025
EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Akshay Paruchuri
Sinan Hersek
Lavisha Aggarwal
Qiao Yang
Xin Liu
Achin Kulshrestha
Andrea Colaco
Henry Fuchs
Ishan Chatterjee
EgoV
181
1
0
03 Aug 2025
EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs
Ivan Rodin
Tz-Ying Wu
Kyle Min
S. N. Sridhar
Antonino Furnari
Subarna Tripathi
G. Farinella
203
0
0
06 Jun 2025
1