ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.09326
  4. Cited By
Gesticulator: A framework for semantically-aware speech-driven gesture
  generation
v1v2v3v4v5 (latest)

Gesticulator: A framework for semantically-aware speech-driven gesture generation

International Conference on Multimodal Interaction (ICMI), 2020
25 January 2020
Taras Kucherenko
Patrik Jonell
S. V. Waveren
G. Henter
Simon Alexanderson
Iolanda Leite
Hedvig Kjellström
    SLR
ArXiv (abs)PDFHTML

Papers citing "Gesticulator: A framework for semantically-aware speech-driven gesture generation"

50 / 92 papers shown
Towards Reliable Human Evaluations in Gesture Generation: Insights from a Community-Driven State-of-the-Art Benchmark
Towards Reliable Human Evaluations in Gesture Generation: Insights from a Community-Driven State-of-the-Art Benchmark
Rajmund Nagy
Hendric Voss
Thanh Hoang-Minh
Mihail Tsakov
Teodor Nikolov
...
R. Mcdonnell
Michael Neff
Taras Kucherenko
Youngwoo Yoon
G. Henter
EGVMVGen
456
0
0
03 Nov 2025
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
ImaGGen: Zero-Shot Generation of Co-Speech Semantic Gestures Grounded in Language and Image Input
Hendric Voss
Stefan Kopp
SLR
328
0
0
20 Oct 2025
Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents
Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents
Zeyi Zhang
Yanju Zhou
Heyuan Yao
Tenglong Ao
Xiaohang Zhan
Libin Liu
LLMAG
210
6
0
06 Oct 2025
Learning to Generate Pointing Gestures in Situated Embodied Conversational Agents
Learning to Generate Pointing Gestures in Situated Embodied Conversational AgentsFrontiers in Robotics and AI (Front. Robot. AI), 2023
Anna Deichler
Siyang Wang
Simon Alexanderson
Jonas Beskow
266
15
0
15 Sep 2025
Multimodal Quantitative Measures for Multiparty Behaviour Evaluation
Multimodal Quantitative Measures for Multiparty Behaviour EvaluationInternational Conference on Multimodal Interaction (ICMI), 2025
Ojas Shirekar
Wim Pouw
Chenxu Hao
Vrushank Phadnis
Thabo Beeler
Chirag Raman
131
0
0
01 Aug 2025
Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models
Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models
Bohong Chen
Yumeng Li
Youyi Zheng
Yao-Xiang Ding
Kun Zhou
313
2
0
27 Jul 2025
SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning
SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning
Lanmiao Liu
E. Ghaleb
Aslı Özyürek
Zerrin Yumak
SLR
315
5
0
25 Jul 2025
AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars
AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars
T. Zhang
Jian Zhao
Yuer Li
Zheng Zhu
Ping Hu
Zhaoxin Fan
Wenjun Wu
Xuelong Li
282
0
0
21 May 2025
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain
Nan Gao
Yihua Bao
Dongdong Weng
Jiayi Zhao
Jia Li
Yan Zhou
Pengfei Wan
Di Zhang
SLR
391
1
0
26 Mar 2025
Global Position Aware Group Choreography using Large Language Model
Global Position Aware Group Choreography using Large Language Model
Haozhou Pang
Tianwei Ding
Lanshan He
Qi Gan
SLR
257
0
0
12 Mar 2025
Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing
Synchronize Dual Hands for Physics-Based Dexterous Guitar PlayingACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
Pei Xu
Ruocheng Wang
361
9
0
20 Feb 2025
Multi-Resolution Generative Modeling of Human Motion from Limited Data
Multi-Resolution Generative Modeling of Human Motion from Limited Data
David Eduardo Moreno-Villamarín
Anna Hilsmann
Peter Eisert
DiffM3DH
284
0
0
25 Nov 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided
  Mixture-of-Experts
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffMVGen
356
4
0
31 Oct 2024
Allo-AVA: A Large-Scale Multimodal Conversational AI Dataset for
  Allocentric Avatar Gesture Animation
Allo-AVA: A Large-Scale Multimodal Conversational AI Dataset for Allocentric Avatar Gesture Animation
Saif Punjwani
Larry Heck
SLRVGen
220
3
0
21 Oct 2024
Towards a GENEA Leaderboard -- an Extended, Living Benchmark for
  Evaluating and Advancing Conversational Motion Synthesis
Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
Rajmund Nagy
Hendric Voss
Youngwoo Yoon
Taras Kucherenko
Teodor Nikolov
Thanh Hoang-Minh
R. Mcdonnell
Stefan Kopp
Michael Neff
G. Henter
305
5
0
08 Oct 2024
LLM Gesticulator: Leveraging Large Language Models for Scalable and
  Controllable Co-Speech Gesture Synthesis
LLM Gesticulator: Leveraging Large Language Models for Scalable and Controllable Co-Speech Gesture Synthesis
Haozhou Pang
Tianwei Ding
Lanshan He
Ming Tao
Lu Zhang
Qi Gan
283
7
0
06 Oct 2024
2D or not 2D: How Does the Dimensionality of Gesture Representation
  Affect 3D Co-Speech Gesture Generation?
2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?International Conference on Intelligent Virtual Agents (IVA), 2024
Teo Guichoux
Laure Soulier
Nicolas Obin
Catherine Pelachaud
SLR
235
1
0
16 Sep 2024
DiffTED: One-shot Audio-driven TED Talk Video Generation with
  Diffusion-based Co-speech Gestures
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures
S. Hogue
Chenxu Zhang
Hamza Daruger
Yapeng Tian
Xiaohu Guo
VGen
288
23
0
11 Sep 2024
Incorporating Spatial Awareness in Data-Driven Gesture Generation for
  Virtual Agents
Incorporating Spatial Awareness in Data-Driven Gesture Generation for Virtual AgentsInternational Conference on Intelligent Virtual Agents (IVA), 2024
Anna Deichler
Simon Alexanderson
Jonas Beskow
178
2
0
07 Aug 2024
Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective
  Face and Body Expressions from Affordable Inputs
Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs
Uttaran Bhattacharya
Aniket Bera
Dinesh Manocha
CVBM
355
4
0
26 Jun 2024
Investigating the impact of 2D gesture representation on co-speech
  gesture generation
Investigating the impact of 2D gesture representation on co-speech gesture generation
Teo Guichoux
Laure Soulier
Nicolas Obin
Catherine Pelachaud
SLR
307
0
0
21 Jun 2024
SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic
  Injection with Large-Scale Pre-Training Diffusion Models
SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic Injection with Large-Scale Pre-Training Diffusion Models
Qingrong Cheng
Xu Li
Xinghui Fu
DiffM
259
16
0
22 May 2024
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisACM Transactions on Graphics (TOG), 2024
Zeyi Zhang
Tenglong Ao
Yuyao Zhang
Qingzhe Gao
Chuan Lin
Baoquan Chen
Libin Liu
SLR
350
30
0
16 May 2024
Fake it to make it: Using synthetic data to remedy the data shortage in
  joint multimodal speech-and-gesture synthesis
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis
Shivam Mehta
Anna Deichler
Jim O'Regan
Birger Moëll
Jonas Beskow
G. Henter
Simon Alexanderson
275
8
0
30 Apr 2024
Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for
  Communication
Beyond Talking -- Generating Holistic 3D Human Dyadic Motion for Communication
Mingze Sun
Chao Xu
Xinyu Jiang
Yang Liu
Baigui Sun
Ruqi Huang
260
16
0
28 Mar 2024
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture
  Synthesis
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
Muhammad Hamza Mughal
Rishabh Dabral
I. Habibie
Lucia Donatelli
Marc Habermann
Christian Theobalt
SLR
174
41
0
26 Mar 2024
Speech-driven Personalized Gesture Synthetics: Harnessing Automatic
  Fuzzy Feature Inference
Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference
Fan Zhang
Zhaohan Wang
Xin Lyu
Siyuan Zhao
Mengjian Li
...
Naye Ji
Hui Du
Fuxing Gao
Hao Wu
Shunman Li
VGen
334
8
0
16 Mar 2024
ReNeLiB: Real-time Neural Listening Behavior Generation for Socially
  Interactive Agents
ReNeLiB: Real-time Neural Listening Behavior Generation for Socially Interactive Agents
Daksitha Senel Withanage Don
Philipp Müller
Fabrizio Nunnari
Elisabeth André
Patrick Gebhard
364
4
0
12 Feb 2024
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven
  Holistic 3D Expression and Gesture Generation
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Junming Chen
Yunfei Liu
Jianan Wang
Ailing Zeng
Yu Li
Qifeng Chen
VGen
319
69
0
09 Jan 2024
AgentAvatar: Disentangling Planning, Driving and Rendering for
  Photorealistic Avatar Agents
AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Duomin Wang
Bin Dai
Yu Deng
Baoyuan Wang
VGen
575
12
0
29 Nov 2023
SpeechAct: Towards Generating Whole-body Motion from SpeechIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Jinsong Zhang
Minjie Zhu
Yuxiang Zhang
Yebin Liu
Kun Li
376
5
0
29 Nov 2023
META4: Semantically-Aligned Generation of Metaphoric Gestures Using
  Self-Supervised Text and Speech Representation
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation
Mireille Fares
Catherine Pelachaud
Nicolas Obin
193
1
0
09 Nov 2023
Large language models in textual analysis for gesture selection
Large language models in textual analysis for gesture selectionInternational Conference on Multimodal Interaction (ICMI), 2023
Laura Birka Hensel
Nutchanon Yongsatianchot
P. Torshizi
E. Minucci
Stacy Marsella
SLR
337
12
0
04 Oct 2023
ACT2G: Attention-based Contrastive Learning for Text-to-Gesture
  Generation
ACT2G: Attention-based Contrastive Learning for Text-to-Gesture GenerationProceedings of the ACM on Computer Graphics and Interactive Techniques (PACMCGIT), 2023
Hitoshi Teshima
Naoki Wake
Diego Thomas
Yuta Nakashima
Hiroshi Kawasaki
Katsushi Ikeuchi
262
0
0
28 Sep 2023
Speech-Gesture GAN: Gesture Generation for Robots and Embodied Agents
Speech-Gesture GAN: Gesture Generation for Robots and Embodied AgentsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Carson Yu Liu
Gelareh Mohammadi
Yang Song
W. Johal
197
5
0
17 Sep 2023
Towards the generation of synchronized and believable non-verbal facial
  behaviors of a talking virtual agent
Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent
Alice Delbosc
M. Ochs
Nicolas Sabouret
Brian Ravenet
Stéphane Ayache
383
14
0
15 Sep 2023
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons
UnifiedGesture: A Unified Gesture Synthesis Model for Multiple SkeletonsACM Multimedia (ACM MM), 2023
Sicheng Yang
Zehao Wang
Zhiyong Wu
Minglei Li
Zhensong Zhang
...
Lei Hao
Songcen Xu
Xiaofei Wu
Changpeng Yang
Zonghong Dai
DiffM
340
18
0
13 Sep 2023
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio
  Representation
Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio RepresentationInternational Conference on Multimodal Interaction (ICMI), 2023
Anna Deichler
Shivam Mehta
Simon Alexanderson
Jonas Beskow
DiffM
272
32
0
11 Sep 2023
BodyFormer: Semantics-guided 3D Body Gesture Synthesis with Transformer
BodyFormer: Semantics-guided 3D Body Gesture Synthesis with TransformerACM Transactions on Graphics (TOG), 2023
Kunkun Pang
Dafei Qin
Yingruo Fan
Julian Habekost
Takaaki Shiratori
Junichi Yamagishi
Taku Komura
SLRViT
155
28
0
07 Sep 2023
The GENEA Challenge 2023: A large scale evaluation of gesture generation
  models in monadic and dyadic settings
The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settingsInternational Conference on Multimodal Interaction (ICMI), 2023
Taras Kucherenko
Rajmund Nagy
Youngwoo Yoon
Jieyeon Woo
Teodor Nikolov
Mihail Tsakov
G. Henter
VGen
235
62
0
24 Aug 2023
Can Language Models Learn to Listen?
Can Language Models Learn to Listen?IEEE International Conference on Computer Vision (ICCV), 2023
Evonne Ng
Sanjay Subramanian
Dan Klein
Angjoo Kanazawa
Trevor Darrell
Shiry Ginosar
364
42
0
21 Aug 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
291
0
0
11 Aug 2023
Human Motion Generation: A Survey
Human Motion Generation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Wentao Zhu
Xiaoxuan Ma
Dongwoo Ro
Hai Ci
Jinlu Zhang
Jiaxin Shi
Feng Gao
Qi Tian
Yizhou Wang
VGen
596
126
0
20 Jul 2023
MRecGen: Multimodal Appropriate Reaction Generator
MRecGen: Multimodal Appropriate Reaction Generator
Jiaqi Xu
Cheng Luo
Weicheng Xie
Linlin Shen
Xiaofeng Liu
Lu Liu
Hatice Gunes
Siyang Song
VGen
166
3
0
05 Jul 2023
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Li-Ping Yin
Yijun Wang
Tianyu He
Jinming Liu
Wei Zhao
Bohan Li
Xin Jin
Jianxin Lin
DiffM
220
22
0
20 Jun 2023
Diff-TTSG: Denoising probabilistic integrated speech and gesture
  synthesis
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesisSpeech Synthesis Workshop (SSW), 2023
Shivam Mehta
Siyang Wang
Simon Alexanderson
Jonas Beskow
Éva Székely
G. Henter
DiffM
373
18
0
15 Jun 2023
ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text
  and Speech using Adversarial Disentanglement of Multimodal Style Encoding
ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding
Mireille Fares
Catherine Pelachaud
Nicolas Obin
214
0
0
22 May 2023
QPGesture: Quantization-Based and Phase-Guided Motion Matching for
  Natural Speech-Driven Gesture Generation
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Sicheng Yang
Zhiyong Wu
Minglei Li
Zhensong Zhang
Lei Hao
Weihong Bao
Hao-Wen Zhuang
SLR
261
58
0
18 May 2023
GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
GestureDiffuCLIP: Gesture Diffusion Model with CLIP LatentsACM Transactions on Graphics (TOG), 2023
Tenglong Ao
Zeyi Zhang
Libin Liu
DiffMVGen
369
207
0
26 Mar 2023
Evaluating gesture generation in a large-scale open challenge: The GENEA
  Challenge 2022
Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022ACM Transactions on Graphics (TOG), 2023
Taras Kucherenko
Pieter Wolfert
Youngwoo Yoon
Carla Viegas
Teodor Nikolov
Mihail Tsakov
G. Henter
225
37
0
15 Mar 2023
12
Next
Page 1 of 2