Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.09035
Cited By
v1
v2 (latest)
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
IEEE International Conference on Computer Vision (ICCV), 2019
22 May 2019
Antonino Furnari
G. Farinella
EgoV
Re-assign community
ArXiv (abs)
PDF
HTML
Github (132★)
Papers citing
"What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention"
50 / 112 papers shown
Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span
Heeseung Yun
Joonil Na
Jaeyeon Kim
Calvin Murdock
Gunhee Kim
92
0
0
23 Nov 2025
Countering Multi-modal Representation Collapse through Rank-targeted Fusion
Seulgi Kim
Kiran Kokilepersaud
Mohit Prabhushankar
Ghassan AlRegib
116
0
0
09 Nov 2025
Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding
Anupam Pani
Yanchao Yang
120
0
0
24 Oct 2025
Action-Dynamics Modeling and Cross-Temporal Interaction for Online Action Understanding
Xinyu Yang
Zheheng Jiang
Feixiang Zhou
Yihang Zhu
Na Lv
Nan Xing
Huiyu Zhou
137
0
0
12 Oct 2025
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
Computer Vision and Pattern Recognition (CVPR), 2025
Tomoya Yoshida
Shuhei Kurita
Taichi Nishimura
Shinsuke Mori
292
2
0
04 Jun 2025
Efficient Egocentric Action Recognition with Multimodal Data
Marco Calzavara
Ard Kastrati
Matteo Macchini
Dushan Vasilevski
Roger Wattenhofer
EgoV
252
0
0
02 Jun 2025
The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation
Masashi Hatano
Zhifan Zhu
Hideo Saito
Dima Damen
EgoV
345
6
0
11 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
260
3
0
10 Apr 2025
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Zhanzhong Pang
Fadime Sener
Angela Yao
OffRL
352
5
0
24 Mar 2025
DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos
Computer Vision and Pattern Recognition (CVPR), 2025
Lorenzo Mur-Labadia
Josechu Guerrero
Ruben Martinez-Cantin
VGen
304
0
0
11 Mar 2025
Optimizing Multitask Industrial Processes with Predictive Action Guidance
IEEE Transactions on Automation Science and Engineering (T-ASE), 2025
Naval Kishore Mehta
Arvind
Shyam Sunder Prasad
Sumeet Saurav
Sanjay Singh
156
0
0
10 Jan 2025
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Yuanmin Huang
Jilan Xu
Baoqi Pei
Yuping He
Guo Chen
...
Kunpeng Li
C. Yuan
Yidan Wang
Yu Qiao
L. Wang
453
13
0
31 Dec 2024
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Zirui Wang
Xinran Zhao
Simon Stepputtis
Woojun Kim
Tongshuang Wu
Katia Sycara
Yaqi Xie
OffRL
331
1
0
03 Nov 2024
Human Action Anticipation: A Survey
Bolin Lai
Sam Toyer
Tushar Nagarajan
Rohit Girdhar
S. Zha
James M. Rehg
Kris Kitani
Kristen Grauman
Ruta Desai
Miao Liu
AI4TS
303
7
0
17 Oct 2024
CathAction: A Benchmark for Endovascular Intervention Understanding
Baoru Huang
Tuan Vo
Chayun Kongtongvattana
G. Dagnino
Dennis Kundrat
...
Francisco Vasconcelos
Danail Stoyanov
Daniel Elson
Ferdinando Rodriguez y Baena
Anh Nguyen
195
6
0
23 Aug 2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Xin Liu
Chao Hao
Zitong Yu
Huanjing Yue
Jingyu Yang
257
2
0
05 Aug 2024
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Ming Hu
Peng Xia
Lin Wang
Siyuan Yan
Feilong Tang
...
Xuelian Cheng
Jun Cheng
Chi Liu
Kaijing Zhou
Zongyuan Ge
334
27
0
11 Jun 2024
Bidirectional Progressive Transformer for Interaction Intention Anticipation
European Conference on Computer Vision (ECCV), 2024
Zichen Zhang
Hongcheng Luo
Wei Zhai
Yang Cao
Yu Kang
325
8
0
09 May 2024
Uncertainty-boosted Robust Video Activity Anticipation
Zhaobo Qi
Shuhui Wang
Weigang Zhang
Qingming Huang
292
10
0
29 Apr 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Computer Vision and Pattern Recognition (CVPR), 2024
Yifei Huang
Guo Chen
Jilan Xu
Mingfang Zhang
Lijin Yang
...
Hongjie Zhang
Yi Liu
Yali Wang
Limin Wang
Yu Qiao
EgoV
418
82
0
24 Mar 2024
Intention Action Anticipation Model with Guide-Feedback Loop Mechanism
Zongnan Ma
Fuchun Zhang
Zhixiong Nan
Yao Ge
222
5
0
19 Mar 2024
On the Efficacy of Text-Based Input Modalities for Action Anticipation
Apoorva Beedu
Karan Samel
Irfan Essa
403
4
0
23 Jan 2024
Instance Tracking in 3D Scenes from Egocentric Videos
Yunhan Zhao
Haoyu Ma
Shu Kong
Charless C. Fowlkes
3DPC
317
11
0
07 Dec 2023
DiffAnt: Diffusion Models for Action Anticipation
Zeyun Zhong
Chengzhi Wu
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
DiffM
VGen
198
9
0
27 Nov 2023
GePSAn: Generative Procedure Step Anticipation in Cooking Videos
IEEE International Conference on Computer Vision (ICCV), 2023
M. A. Abdelsalam
Samrudhdhi B. Rangrej
Isma Hadji
Nikita Dvornik
Konstantinos G. Derpanis
Afsaneh Fazly
AI4TS
217
8
0
12 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
304
15
0
29 Sep 2023
Knowledge-Guided Short-Context Action Anticipation in Human-Centric Videos
Sarthak Bhagat
Simon Stepputtis
Joseph Campbell
Katia Sycara
224
4
0
12 Sep 2023
Multi-label affordance mapping from egocentric vision
IEEE International Conference on Computer Vision (ICCV), 2023
Lorenzo Mur-Labadia
Jose J. Guerrero
Ruben Martinez-Cantin
EgoV
223
23
0
05 Sep 2023
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Qitong Wang
Long Zhao
Liangzhe Yuan
Ting Liu
Xi Peng
370
21
0
22 Aug 2023
Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sanket Thakur
Cigdem Beyan
Pietro Morerio
Vittorio Murino
Alessio Del Bue
276
20
0
16 Aug 2023
Memory-and-Anticipation Transformer for Online Action Understanding
IEEE International Conference on Computer Vision (ICCV), 2023
Jiahao Wang
Guo Chen
Yifei Huang
Liming Wang
Tong Lu
OffRL
303
60
0
15 Aug 2023
An Outlook into the Future of Egocentric Vision
International Journal of Computer Vision (IJCV), 2023
Chiara Plizzari
Gabriele Goletto
Antonino Furnari
Siddhant Bansal
Francesco Ragusa
G. Farinella
Dima Damen
Tatiana Tommasi
EgoV
294
72
0
14 Aug 2023
Multimodal Distillation for Egocentric Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
331
35
0
14 Jul 2023
EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023
Cristhian Forigua
María Escobar
Jordi Pont-Tuset
Kevis-Kokitsi Maninis
Pablo Arbelaez
EgoV
238
2
0
29 Jun 2023
Guided Attention for Next Active Object @ EGO4D STA Challenge
Sanket Thakur
Cigdem Beyan
Pietro Morerio
Vittorio Murino
Alessio Del Bue
254
0
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Neurocomputing (Neurocomputing), 2023
Thanh-Dat Truong
Khoa Luu
EgoV
389
15
0
25 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
261
112
0
22 May 2023
Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention
International Conference on Information Photonics (ICIP), 2023
Sanket Thakur
Cigdem Beyan
Pietro Morerio
Vittorio Murino
Alessio Del Bue
157
8
0
22 May 2023
Pretrained Language Models as Visual Planners for Human Assistance
IEEE International Conference on Computer Vision (ICCV), 2023
Dhruvesh Patel
H. Eghbalzadeh
Nitin Kamra
Michael L. Iuzzolino
Unnat Jain
Ruta Desai
LM&Ro
328
35
0
17 Apr 2023
Affordances from Human Videos as a Versatile Representation for Robotics
Computer Vision and Pattern Recognition (CVPR), 2023
Shikhar Bahl
Russell Mendonca
Lili Chen
Unnat Jain
Deepak Pathak
369
257
0
17 Apr 2023
StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation
Francesco Ragusa
G. Farinella
Antonino Furnari
206
22
0
08 Apr 2023
Anticipating Next Active Objects for Egocentric Videos
IEEE Access (IEEE Access), 2023
Sanket Thakur
Cigdem Beyan
Pietro Morerio
Vittorio Murino
Alessio Del Bue
EgoV
299
9
0
13 Feb 2023
Zero-Shot Robot Manipulation from Passive Human Videos
Homanga Bharadhwaj
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
309
51
0
03 Feb 2023
FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
Computer Vision and Pattern Recognition (CVPR), 2022
Christian Diller
Thomas Funkhouser
Angela Dai
281
5
0
25 Nov 2022
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Nada Osman
Guglielmo Camporese
Lamberto Ballan
126
15
0
26 Oct 2022
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Zeyun Zhong
David Schneider
Michael Voit
Rainer Stiefelhagen
Jürgen Beyerer
178
60
0
23 Oct 2022
Rethinking Learning Approaches for Long-Term Action Anticipation
European Conference on Computer Vision (ECCV), 2022
Megha Nawhal
Akash Abdu Jyothi
Greg Mori
AI4TS
174
39
0
20 Oct 2022
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation
Findings (Findings), 2022
Sayontan Ghosh
Tanvi Aggarwal
Minh Hoai
Niranjan Balasubramanian
VLM
222
4
0
12 Oct 2022
ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
Asian Conference on Computer Vision (ACCV), 2022
A. Fragomeni
Michael Wray
Dima Damen
CLIP
ViT
145
4
0
09 Oct 2022
Visual Object Tracking in First Person Vision
International Journal of Computer Vision (IJCV), 2022
Matteo Dunnhofer
Antonino Furnari
G. Farinella
C. Micheloni
235
41
0
27 Sep 2022
1
2
3
Next