v1v2 (latest)

What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention

IEEE International Conference on Computer Vision (ICCV), 2019

22 May 2019

Antonino Furnari

G. Farinella

EgoV

ArXiv (abs)PDF HTML Github (132★)

Papers citing "What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention"

50 / 112 papers shown

Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span

23 Nov 2025

Countering Multi-modal Representation Collapse through Rank-targeted Fusion

116

09 Nov 2025

Gaze-VLM:Bridging Gaze and VLMs through Attention Regularization for Egocentric Understanding

Anupam Pani

Yanchao Yang

120

24 Oct 2025

Action-Dynamics Modeling and Cross-Temporal Interaction for Online Action Understanding

137

12 Oct 2025

Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric VisionComputer Vision and Pattern Recognition (CVPR), 2025

292

04 Jun 2025

Efficient Egocentric Action Recognition with Multimodal Data

252

02 Jun 2025

The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation

345

11 Apr 2025

Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding

...

260

10 Apr 2025

Context-Enhanced Memory-Refined Transformer for Online Action DetectionComputer Vision and Pattern Recognition (CVPR), 2025

352

24 Mar 2025

DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric VideosComputer Vision and Pattern Recognition (CVPR), 2025

Lorenzo Mur-Labadia

Josechu Guerrero

Ruben Martinez-Cantin

VGen

304

11 Mar 2025

Optimizing Multitask Industrial Processes with Predictive Action GuidanceIEEE Transactions on Automation Science and Engineering (T-ASE), 2025

156

10 Jan 2025

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

...

453

31 Dec 2024

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation

331

03 Nov 2024

Human Action Anticipation: A Survey

303

17 Oct 2024

CathAction: A Benchmark for Endovascular Intervention Understanding

Baoru Huang

Tuan Vo

Chayun Kongtongvattana

G. Dagnino

Dennis Kundrat

...

Francisco Vasconcelos

Danail Stoyanov

Daniel Elson

Ferdinando Rodriguez y Baena

Anh Nguyen

195

23 Aug 2024

From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation

257

05 Aug 2024

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

Ming Hu

...

Zongyuan Ge

334

11 Jun 2024

Bidirectional Progressive Transformer for Interaction Intention AnticipationEuropean Conference on Computer Vision (ECCV), 2024

Yang Cao

325

09 May 2024

Uncertainty-boosted Robust Video Activity Anticipation

292

29 Apr 2024

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real WorldComputer Vision and Pattern Recognition (CVPR), 2024

Yifei Huang

...

Yu Qiao

418

24 Mar 2024

Intention Action Anticipation Model with Guide-Feedback Loop Mechanism

222

19 Mar 2024

On the Efficacy of Text-Based Input Modalities for Action Anticipation

Apoorva Beedu

Karan Samel

Irfan Essa

403

23 Jan 2024

Instance Tracking in 3D Scenes from Egocentric Videos

317

07 Dec 2023

DiffAnt: Diffusion Models for Action Anticipation

Juergen Gall

198

27 Nov 2023

GePSAn: Generative Procedure Step Anticipation in Cooking VideosIEEE International Conference on Computer Vision (ICCV), 2023

M. A. Abdelsalam

Samrudhdhi B. Rangrej

Isma Hadji

Nikita Dvornik

Konstantinos G. Derpanis

Afsaneh Fazly

AI4TS

217

12 Oct 2023

A Survey on Deep Learning Techniques for Action Anticipation

304

29 Sep 2023

Knowledge-Guided Short-Context Action Anticipation in Human-Centric Videos

224

12 Sep 2023

Multi-label affordance mapping from egocentric visionIEEE International Conference on Computer Vision (ICCV), 2023

Lorenzo Mur-Labadia

Jose J. Guerrero

Ruben Martinez-Cantin

EgoV

223

05 Sep 2023

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2023

370

22 Aug 2023

Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

276

16 Aug 2023

Memory-and-Anticipation Transformer for Online Action UnderstandingIEEE International Conference on Computer Vision (ICCV), 2023

Yifei Huang

303

15 Aug 2023

An Outlook into the Future of Egocentric VisionInternational Journal of Computer Vision (IJCV), 2023

Dima Damen

294

14 Aug 2023

Multimodal Distillation for Egocentric Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2023

Gorjan Radevski

Dusan Grujicic

Marie-Francine Moens

Matthew Blaschko

Tinne Tuytelaars

EgoV

331

14 Jul 2023

EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023

Cristhian Forigua

María Escobar

Jordi Pont-Tuset

Kevis-Kokitsi Maninis

Pablo Arbelaez

EgoV

238

29 Jun 2023

Guided Attention for Next Active Object @ EGO4D STA Challenge

254

25 May 2023

Cross-view Action Recognition Understanding From Exocentric to Egocentric PerspectiveNeurocomputing (Neurocomputing), 2023

Thanh-Dat Truong

Khoa Luu

EgoV

389

25 May 2023

VideoLLM: Modeling Video Sequence with Large Language Models

Yifei Huang

...

Yi Wang

Yu Qiao

261

112

22 May 2023

Enhancing Next Active Object-based Egocentric Action Anticipation with Guided AttentionInternational Conference on Information Photonics (ICIP), 2023

157

22 May 2023

Pretrained Language Models as Visual Planners for Human AssistanceIEEE International Conference on Computer Vision (ICCV), 2023

Ruta Desai

328

17 Apr 2023

Affordances from Human Videos as a Versatile Representation for RoboticsComputer Vision and Pattern Recognition (CVPR), 2023

369

257

17 Apr 2023

StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation

Francesco Ragusa

G. Farinella

Antonino Furnari

206

08 Apr 2023

Anticipating Next Active Objects for Egocentric VideosIEEE Access (IEEE Access), 2023

299

13 Feb 2023

Zero-Shot Robot Manipulation from Passive Human Videos

Homanga Bharadhwaj

Abhi Gupta

Shubham Tulsiani

Vikash Kumar

309

03 Feb 2023

FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video ObservationsComputer Vision and Pattern Recognition (CVPR), 2022

Christian Diller

Thomas Funkhouser

Angela Dai

281

25 Nov 2022

TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Nada Osman

Guglielmo Camporese

Lamberto Ballan

126

26 Oct 2022

Anticipative Feature Fusion Transformer for Multi-Modal Action AnticipationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

178

23 Oct 2022

Rethinking Learning Approaches for Long-Term Action AnticipationEuropean Conference on Computer Vision (ECCV), 2022

174

20 Oct 2022

Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action AnticipationFindings (Findings), 2022

Sayontan Ghosh

Tanvi Aggarwal

Minh Hoai

Niranjan Balasubramanian

VLM

222

12 Oct 2022

ConTra: (Con)text (Tra)nsformer for Cross-Modal Video RetrievalAsian Conference on Computer Vision (ACCV), 2022

A. Fragomeni

Michael Wray

Dima Damen

CLIP ViT

145

09 Oct 2022

Visual Object Tracking in First Person VisionInternational Journal of Computer Vision (IJCV), 2022

235

27 Sep 2022