Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2406.09272
Cited By

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric
Videos

v1v2v3 (latest)

Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos

13 June 2024

Zihui Xue

Kristen Grauman

ArXiv (abs)PDF HTML Github (313★)

Papers citing "Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos"

13 / 13 papers shown

Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound

Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound

220

3

0

30 Nov 2025

Segmenting Collision Sound Sources in Egocentric Videos

Segmenting Collision Sound Sources in Egocentric Videos

335

0

0

17 Nov 2025

CAVER: Curious Audiovisual Exploring Robot

CAVER: Curious Audiovisual Exploring Robot

Boueny Folefack

Ben Abbatematteo

187

0

0

10 Nov 2025

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

229

0

0

07 Oct 2025

Clink! Chop! Thud! -- Learning Object Sounds from Real-World Interactions

Clink! Chop! Thud! -- Learning Object Sounds from Real-World Interactions

Siddhant Agarwal

Arun Balajee Vasudevan

158

0

0

02 Oct 2025

EGOILLUSION: Benchmarking Hallucinations in Egocentric Video Understanding

EGOILLUSION: Benchmarking Hallucinations in Egocentric Video Understanding

Ramaneswaran Selvakumar

283

7

0

18 Aug 2025

Sonify Anything: Towards Context-Aware Sonic Interactions in AR

Sonify Anything: Towards Context-Aware Sonic Interactions in AR

165

1

0

03 Aug 2025

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart Glasses

EgoTrigger: Toward Audio-Driven Image Capture for Human Memory Enhancement in All-Day Energy-Efficient Smart GlassesIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025

Akshay Paruchuri

Lavisha Aggarwal

Achin Kulshrestha

Ishan Chatterjee

279

3

0

03 Aug 2025

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Takashi Shibuya

419

1

0

26 Jun 2025

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Bryan Catanzaro

422

121

0

06 Mar 2025

Generative AI for Cel-Animation: A Survey

Generative AI for Cel-Animation: A Survey

...

875

23

0

08 Jan 2025

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation

MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound GenerationInternational Conference on Learning Representations (ICLR), 2024

Chang D. Yoo

389

8

0

03 Oct 2024

Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound

Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley SoundIEEE Transactions on Audio, Speech, and Language Processing (IEEE TASLP), 2024

547

21

0

21 Aug 2024

Page 1 of 1