Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2503.13646
Cited By

Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos

Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos

Computer Vision and Pattern Recognition (CVPR), 2025

17 March 2025

Chiara Plizzari

Achin Kulshrestha

ArXiv (abs)PDF HTML

Papers citing "Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos"

11 / 11 papers shown

EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning

EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning

Yogesh Kulkarni

383

0

0

23 Nov 2025

Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges

Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges

230

0

0

17 Nov 2025

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

...

Danda Pani Paudel

708

4

0

29 Oct 2025

Training-free Online Video Step Grounding

Training-free Online Video Step Grounding

Massimiliano Mancini

Alessio Tonioni

128

0

0

19 Oct 2025

EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark

EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark

...

257

2

0

07 Oct 2025

Reasoning under Vision: Understanding Visual-Spatial Cognition in Vision-Language Models for CAPTCHA

Reasoning under Vision: Understanding Visual-Spatial Cognition in Vision-Language Models for CAPTCHA

Luke Tenyi Chang

96

0

0

07 Oct 2025

HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes

HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes

...

180

1

0

19 Aug 2025

Causality Matters: How Temporal Information Emerges in Video Language Models

Causality Matters: How Temporal Information Emerges in Video Language Models

111

1

0

15 Aug 2025

EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering

EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering

Danda Pani Paudel

278

4

0

14 Aug 2025

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart GlassesIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025

Akshay Paruchuri

Lavisha Aggarwal

Achin Kulshrestha

Ishan Chatterjee

181

1

0

03 Aug 2025

EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs

EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs

Antonino Furnari

Subarna Tripathi

203

0

0

06 Jun 2025