Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

7 December 2020

Papers citing "Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements"

21 / 21 papers shown

VGGSounder: Audio-Visual Evaluations for Foundation Models

Daniil Zverev

Thaddäus Wiedemer

Christian Schroeder de Witt

259

11 Aug 2025

Controllable Video-to-Music Generation with Multiple Time-Varying Conditions

151

28 Jul 2025

A Survey on Cross-Modal Interaction Between Music and Multimodal Data

405

17 Apr 2025

A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives

609

01 Apr 2025

Vision-to-Music Generation: A Survey

Victor Shea-Jay Huang

Yue Liao

EGVM VGen

395

27 Mar 2025

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

Zhou Zhao

333

16 Oct 2024

From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and GenerationInternational Conference on Machine Learning (ICML), 2024

435

27 Sep 2024

Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models

A. Sophia Koepke

216

09 Apr 2024

The NES Video-Music Database: A Dataset of Symbolic Video Game Music Paired with Gameplay Videos

Igor Cardoso

Rubens O. Moraes

Lucas N. Ferreira

309

05 Apr 2024

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer modelExpert systems with applications (ESWA), 2023

422

02 Nov 2023

Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-AnsweringIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Xiulong Liu

Zhikang Dong

Peng Zhang

239

10 Oct 2023

Text-to-feature diffusion for audio-visual few-shot learning

A. Sophia Koepke

248

07 Sep 2023

V2Meow: Meowing to the Visual Beat via Video-to-Music GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023

Kun Su

Judith Yue Li

Qingqing Huang

Dima Kuzmin

Joonseok Lee

...

215

11 May 2023

Long-Term Rhythmic Video SoundtrackerInternational Conference on Machine Learning (ICML), 2023

Yu Qiao

368

02 May 2023

Conditional Generation of Audio from Video via Foley AnalogiesComputer Vision and Pattern Recognition (CVPR), 2023

Ziyang Chen

226

17 Apr 2023

Co-Speech Gesture Synthesis using Discrete Gesture Token LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023

173

04 Mar 2023

Video Background Music Generation: Dataset, Method and EvaluationIEEE International Conference on Computer Vision (ICCV), 2022

384

21 Nov 2022

Learning in Audio-visual Context: A Review, Analysis, and New Perspective

311

20 Aug 2022

Temporal and cross-modal attention for audio-visual zero-shot learningEuropean Conference on Computer Vision (ECCV), 2022

Otniel-Bogdan Mercea

Thomas Hummel

A. Sophia Koepke

Zeynep Akata

211

20 Jul 2022

Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and LanguageComputer Vision and Pattern Recognition (CVPR), 2022

Otniel-Bogdan Mercea

Lukas Riesch

A. Sophia Koepke

Zeynep Akata

196

07 Mar 2022

Video Background Music Generation with Controllable Music Transformer

256

124

16 Nov 2021