Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.08849
Cited By
A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
17 August 2023
Li Liu
Lufei Gao
Wen-Ling Lei
Fengji Ma
Xiaotian Lin
Jin-Tao Wang
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation"
12 / 12 papers shown
Title
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Ping Luo
Jiebo Luo
Chenliang Xu
VLM
47
81
0
29 Dec 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
45
14
0
04 May 2023
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions
Geumbyeol Hwang
Sunwon Hong
Seunghyun Lee
Sungwoo Park
Gyeongsu Chae
VGen
19
5
0
14 Mar 2023
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
Xinya Ji
Hang Zhou
Kaisiyuan Wang
Qianyi Wu
Wayne Wu
Feng Xu
Xun Cao
CVBM
50
157
0
30 May 2022
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
59
116
0
06 Dec 2021
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
325
1,570
0
10 Nov 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
79
221
0
12 Feb 2021
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
M. Pantic
165
237
0
23 Jan 2020
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild
Yu Luo
Jianbo Ye
Reginald B. Adams
Jia Li
M. Newman
J. Z. Wang
46
85
0
28 Aug 2018
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
214
2,224
0
14 Jun 2018
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
782
0
16 Nov 2016
1