Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07177
Cited By
MM-Ego: Towards Building Egocentric Multimodal LLMs for Video QA
9 October 2024
Hanrong Ye
Haotian Zhang
Erik Daxberger
Lin Chen
Zongyu Lin
Yanghao Li
Bowen Zhang
Haoxuan You
Dan Xu
Zhe Gan
Jiasen Lu
Yinfei Yang
EgoV
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MM-Ego: Towards Building Egocentric Multimodal LLMs for Video QA"
8 / 8 papers shown
Title
Advancing Egocentric Video Question Answering with Multimodal Large Language Models
Alkesh Patel
Vibhav Chitalia
Yinfei Yang
18
0
0
06 Apr 2025
ProbRes: Probabilistic Jump Diffusion for Open-World Egocentric Activity Recognition
Sanjoy Kundu
Shanmukha Vellamchetti
Sathyanarayanan N. Aakur
EgoV
47
0
0
04 Apr 2025
LLaVAction: evaluating and training multi-modal large language models for action recognition
Shaokai Ye
Haozhe Qi
Alexander Mathis
Mackenzie W. Mathis
57
1
0
24 Mar 2025
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
Erik Daxberger
Nina Wenzel
David Griffiths
Haiming Gang
Justin Lazarow
...
Kai Kang
Marcin Eichner
Y. Yang
Afshin Dehghan
Peter Grasch
72
2
0
17 Mar 2025
EgoBlind: Towards Egocentric Visual Assistance for the Blind People
Junbin Xiao
Nanxin Huang
Hao Qiu
Zhulin Tao
Xun Yang
Richang Hong
M. Wang
Angela Yao
EgoV
VLM
58
0
0
11 Mar 2025
EgoLife: Towards Egocentric Life Assistant
Jingkang Yang
Shuai Liu
Hongming Guo
Yuhao Dong
X. Zhang
...
Joerg Widmer
Francesco Gringoli
Lei Yang
Bo Li
Z. Liu
EgoV
43
2
0
05 Mar 2025
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
Jihan Yang
Shusheng Yang
Anjali W. Gupta
Rilyn Han
Li Fei-Fei
Saining Xie
LRM
119
50
0
18 Dec 2024
Slot State Space Models
Jindong Jiang
Fei Deng
Gautam Singh
Minseung Lee
Sungjin Ahn
28
4
0
18 Jun 2024
1