ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09707
  4. Cited By
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion
  Models

Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

17 November 2022
Simon Alexanderson
Rajmund Nagy
Jonas Beskow
G. Henter
    DiffM
    VGen
ArXivPDFHTML

Papers citing "Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models"

50 / 130 papers shown
Title
ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation
Jingzhong Lin
Yuanyuan Qi
Xinru Li
Wenxuan Huang
Xiangfeng Xu
Bangyan Li
Xuejiao Wang
Gaoqi He
21
0
0
08 May 2025
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition
Zhiping Qiu
Yitong Jin
Y. Wang
Yi Shi
C. Wang
Chao Tan
Xiaobing Li
Feng Yu
Tao Yu
Qionghai Dai
19
0
0
07 May 2025
StableMotion: Training Motion Cleanup Models with Unpaired Corrupted Data
StableMotion: Training Motion Cleanup Models with Unpaired Corrupted Data
Yuxuan Mu
Hung Yu Ling
Yi Shi
Ismael Baira Ojeda
Pengcheng Xi
Chang Shu
F. Zinno
Xue Bin Peng
36
0
0
06 May 2025
GENMO: A GENeralist Model for Human MOtion
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li
Jinkun Cao
Haotian Zhang
Davis Rempe
Jan Kautz
Umar Iqbal
Ye Yuan
DiffM
VGen
42
1
0
02 May 2025
UniPhys: Unified Planner and Controller with Diffusion for Flexible Physics-Based Character Control
UniPhys: Unified Planner and Controller with Diffusion for Flexible Physics-Based Character Control
Y. Wu
Korrawe Karunratanakul
Zhengyi Luo
Siyu Tang
DiffM
VGen
AI4CE
41
0
0
17 Apr 2025
Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis
Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis
Zihao Liu
Mingwen Ou
Zunnan Xu
Jiaqi Huang
Haonan Han
Ronghui Li
X. Li
DiffM
28
0
0
14 Apr 2025
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Xiangyue Zhang
Jianfang Li
Jiaxu Zhang
Jianqiang Ren
Liefeng Bo
Zhigang Tu
20
0
0
12 Apr 2025
BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis
BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis
Moinak Bhattacharya
Saumya Gupta
Annie Singh
C. L. P. Chen
Gagandeep Singh
Prateek Prasanna
MedIm
21
0
0
06 Apr 2025
ReCoM: Realistic Co-Speech Motion Generation with Recurrent Embedded Transformer
ReCoM: Realistic Co-Speech Motion Generation with Recurrent Embedded Transformer
Yong Xie
Yunlian Sun
Hongwen Zhang
Y. Liu
Jinhui Tang
VGen
85
0
0
27 Mar 2025
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
Nan Gao
Yihua Bao
Dongdong Weng
Jiayi Zhao
Jia Li
Yan Zhou
Pengfei Wan
Di Zhang
SLR
93
0
0
26 Mar 2025
Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion
Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion
Haim Sawdayee
Chuan Guo
Guy Tevet
Bing Zhou
Jian Wang
Amit H. Bermano
DiffM
VGen
46
0
0
25 Mar 2025
Motion Synthesis with Sparse and Flexible Keyjoint Control
Motion Synthesis with Sparse and Flexible Keyjoint Control
I. Hwang
Jinseok Bae
Donggeun Lim
Y. Kim
56
0
0
18 Mar 2025
MusicInfuser: Making Video Diffusion Listen and Dance
MusicInfuser: Making Video Diffusion Listen and Dance
Susung Hong
Ira Kemelmacher-Shlizerman
Brian L. Curless
Steven M. Seitz
VGen
43
0
0
18 Mar 2025
MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization
MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization
Binjie Liu
Lina Liu
Sanyi Zhang
Songen Gu
Yihao Zhi
Tianyi Zhu
Lei Yang
Long Ye
SLR
66
0
0
18 Mar 2025
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation
Ling-an Zeng
Guohong Huang
Yi-Lin Wei
Shengbo Gu
Yu-Ming Tang
Jingke Meng
Wei-Shi Zheng
51
2
0
17 Mar 2025
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Evgeniia Vu
Andrei Boiarov
Dmitry Vetrov
VGen
48
0
0
13 Mar 2025
MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm
MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm
Ziyan Guo
Zeyu Hu
Na Zhao
De Wen Soh
VGen
80
2
0
13 Mar 2025
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis
Xukun Zhou
Fengxin Li
Ming Chen
Yan Zhou
Pengfei Wan
Di Zhang
Yeying Jin
Zhaoxin Fan
Hongyan Liu
Jun He
DiffM
VGen
43
0
0
09 Mar 2025
SPG: Improving Motion Diffusion by Smooth Perturbation Guidance
Boseong Jeon
DiffM
40
0
0
04 Mar 2025
ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
Xuangeng Chu
Nabarun Goswami
Ziteng Cui
Hanqin Wang
Tatsuya Harada
DiffM
65
0
0
27 Feb 2025
Fatigue-PINN: Physics-Informed Fatigue-Driven Motion Modulation and Synthesis
Fatigue-PINN: Physics-Informed Fatigue-Driven Motion Modulation and Synthesis
Iliana Loi
Konstantinos Moustakas
45
0
0
26 Feb 2025
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music
Xinran Liu
Xu Dong
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
60
0
0
25 Feb 2025
X-Dancer: Expressive Music to Human Dance Video Generation
X-Dancer: Expressive Music to Human Dance Video Generation
Zeyuan Chen
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
X. Chen
Chao Wang
Di Chang
Linjie Luo
VGen
31
0
0
24 Feb 2025
CASIM: Composite Aware Semantic Injection for Text to Motion Generation
CASIM: Composite Aware Semantic Injection for Text to Motion Generation
Che-Jui Chang
Qingze Tony Liu
H. Zhou
Vladimir Pavlovic
Mubbasir Kapadia
99
0
0
04 Feb 2025
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent
  Diffusion Transformer
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer
B. Li
Xihua Wang
Ruihua Song
Wenbing Huang
DiffM
VGen
68
1
0
21 Dec 2024
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction
  with 3D Autonomous Characters
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
Jianping Jiang
Weiye Xiao
Zhengyu Lin
H. Zhang
Tianxiang Ren
Yang Gao
Zhiqian Lin
Zhongang Cai
Lei Yang
Ziwei Liu
79
3
0
29 Nov 2024
MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension
MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension
Zeyu Ling
Bo Han
Shiyang Li
H. Shen
Jikang Cheng
Changqing Zou
79
1
0
26 Nov 2024
Multi-Resolution Generative Modeling of Human Motion from Limited Data
Multi-Resolution Generative Modeling of Human Motion from Limited Data
David Eduardo Moreno-Villamarín
A. Hilsmann
Peter Eisert
DiffM
3DH
78
0
0
25 Nov 2024
SMGDiff: Soccer Motion Generation using diffusion probabilistic models
SMGDiff: Soccer Motion Generation using diffusion probabilistic models
Hongdi Yang
Chengyang Li
Zhenxuan Wu
Gaozheng Li
Jingya Wang
Jingyi Yu
Zhuo Su
Lan Xu
DiffM
VGen
65
1
0
25 Nov 2024
Constrained Diffusion with Trust Sampling
William Huang
Yifeng Jiang
Tom Van Wouwe
C. Karen Liu
27
3
0
17 Nov 2024
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and
  Correspondence
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence
Fuming You
Minghui Fang
Li Tang
Rongjie Huang
Yongqi Wang
Zhou Zhao
18
0
0
04 Nov 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided
  Mixture-of-Experts
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffM
VGen
30
1
0
31 Oct 2024
ReinDiffuse: Crafting Physically Plausible Motions with Reinforced
  Diffusion Model
ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model
Gaoge Han
Mingjiang Liang
Jinglei Tang
Yongkang Cheng
Wei Liu
Shaoli Huang
VGen
28
5
0
09 Oct 2024
Towards a GENEA Leaderboard -- an Extended, Living Benchmark for
  Evaluating and Advancing Conversational Motion Synthesis
Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Rajmund Nagy
Hendric Voss
Youngwoo Yoon
Taras Kucherenko
Teodor Nikolov
Thanh Hoang-Minh
R. Mcdonnell
Stefan Kopp
Michael Neff
G. Henter
16
1
0
08 Oct 2024
FürElise: Capturing and Physically Synthesizing Hand Motions of Piano
  Performance
FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance
Ruocheng Wang
Pei Xu
Haochen Shi
Elizabeth Schumann
C. Karen Liu
17
3
0
08 Oct 2024
Estimating Body and Hand Motion in an Ego-sensed World
Estimating Body and Hand Motion in an Ego-sensed World
Brent Yi
Vickie Ye
Maya Zheng
Lea Müller
Georgios Pavlakos
Yi Ma
Jitendra Malik
Angjoo Kanazawa
DiffM
39
6
0
04 Oct 2024
Real-time Diverse Motion In-betweening with Space-time Control
Real-time Diverse Motion In-betweening with Space-time Control
Yuchen Chu
Zeshi Yang
DiffM
16
1
0
30 Sep 2024
HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
Vladimir Guzov
Yifeng Jiang
Fangzhou Hong
Gerard Pons-Moll
Richard A. Newcombe
C. Karen Liu
Yuting Ye
Lingni Ma
33
6
0
20 Sep 2024
Generation of Complex 3D Human Motion by Temporal and Spatial
  Composition of Diffusion Models
Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models
Lorenzo Mandelli
Stefano Berretti
DiffM
24
2
0
18 Sep 2024
2D or not 2D: How Does the Dimensionality of Gesture Representation
  Affect 3D Co-Speech Gesture Generation?
2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?
Teo Guichoux
Laure Soulier
Nicolas Obin
Catherine Pelachaud
SLR
22
0
0
16 Sep 2024
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
ProbTalk3D: Non-Deterministic Emotion Controllable Speech-Driven 3D Facial Animation Synthesis Using VQ-VAE
Sichun Wu
Kazi Injamamul Haque
Zerrin Yumak
VGen
28
2
0
12 Sep 2024
DiffTED: One-shot Audio-driven TED Talk Video Generation with
  Diffusion-based Co-speech Gestures
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures
S. Hogue
Chenxu Zhang
Hamza Daruger
Yapeng Tian
Xiaohu Guo
VGen
25
10
0
11 Sep 2024
Lagrangian Motion Fields for Long-term Motion Generation
Lagrangian Motion Fields for Long-term Motion Generation
Yifei Yang
Zikai Huang
C. Xu
Shengfeng He
18
0
0
03 Sep 2024
ViMo: Generating Motions from Casual Videos
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
24
0
0
13 Aug 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture
  Generation
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Xiaofeng Mao
Zhengkai Jiang
Qilin Wang
Chencan Fu
Jiangning Zhang
Jiafu Wu
Yabiao Wang
Chengjie Wang
Wei Li
Mingmin Chi
70
4
0
06 Aug 2024
DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer
  Normalization Mamba-2 framework
DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework
Fan Zhang
Naye Ji
Fuxing Gao
Bozuo Zhao
Jingmei Wu
...
Zhenqing Ye
Jiayang Zhu
WeiFan Zhong
Leyao Yan
Xiaomeng Ma
27
0
0
01 Aug 2024
Enhancing Anti-spoofing Countermeasures Robustness through Joint
  Optimization and Transfer Learning
Enhancing Anti-spoofing Countermeasures Robustness through Joint Optimization and Transfer Learning
Yikang Wang
Xingming Wang
Hiromitsu Nishizaki
Ming Li
AAML
24
0
0
29 Jul 2024
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and
  Disentangled Multi-Modality Fusion
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion
Chencan Fu
Yabiao Wang
Jiangning Zhang
Zhengkai Jiang
Xiaofeng Mao
Jiafu Wu
Weijian Cao
Chengjie Wang
Yanhao Ge
Yong Liu
Mamba
35
2
0
29 Jul 2024
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seung-geun Chi
Hyung-Gun Chi
Hengbo Ma
Nakul Agarwal
Faizan Siddiqui
Karthik Ramani
Kwonjoon Lee
DiffM
34
10
0
19 Jul 2024
SMooDi: Stylized Motion Diffusion Model
SMooDi: Stylized Motion Diffusion Model
Lei Zhong
Yiming Xie
Varun Jampani
Deqing Sun
Huaizu Jiang
DiffM
42
15
0
17 Jul 2024
123
Next