Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.12266
Cited By
v1
v2 (latest)
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
16 October 2024
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
H. Lu
Zhou Zhao
Wei Xue
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation"
10 / 10 papers shown
Title
AudioEval: Automatic Dual-Perspective and Multi-Dimensional Evaluation of Text-to-Audio-Generation
Hui Wang
J. Zhao
Cheng Liu
Yuhang Jia
Haoqin Sun
Jiaming Zhou
Yong Qin
68
0
0
16 Oct 2025
Flow Straight and Fast in Hilbert Space: Functional Rectified Flow
Jianxin Zhang
Clayton Scott
88
0
0
12 Sep 2025
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Yuxin Guo
Teng Wang
Yuying Ge
Shijie Ma
Yixiao Ge
Wei Zou
Mingyu Ding
DiffM
AuLLM
94
1
0
27 Aug 2025
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
Xiquan Li
Junxi Liu
Yuzhe Liang
Zhikang Niu
Wenxi Chen
Xie Chen
129
2
0
08 Aug 2025
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
Huadai Liu
Kaicheng Luo
Jialei Wang
Wen Wang
Qian Chen
Zhou Zhao
Wei Xue
VGen
LRM
277
13
0
26 Jun 2025
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Yatong Bai
Jonah Casebeer
Somayeh Sojoudi
Nicholas J. Bryan
DiffM
VLM
371
2
0
21 Apr 2025
OmniAudio: Generating Spatial Audio from 360-Degree Video
Huadai Liu
Tianyi Luo
Qikai Jiang
Kaicheng Luo
Peiwen Sun
...
Xin Li
Shiliang Zhang
Zhijie Yan
Zhou Zhao
Wei Xue
VGen
372
10
0
21 Apr 2025
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Chia-Yu Hung
Navonil Majumder
Zhifeng Kong
Ambuj Mehrish
Rafael Valle
Bryan Catanzaro
Soujanya Poria
Bryan Catanzaro
Soujanya Poria
308
34
0
30 Dec 2024
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Saksham Singh Kushwaha
Yapeng Tian
DiffM
VGen
201
10
0
14 Dec 2024
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
Huadai Liu
Jialei Wang
X. Li
Wen Wang
Qian Chen
Rongjie Huang
Yang Liu
Jiayang Xu
Zhou Zhao
182
9
0
18 Jul 2024
1