ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.12266
  4. Cited By
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation
v1v2 (latest)

FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation

Annual Meeting of the Association for Computational Linguistics (ACL), 2024
16 October 2024
Huadai Liu
Jialei Wang
Rongjie Huang
Yang Liu
H. Lu
Zhou Zhao
Wei Xue
ArXiv (abs)PDFHTML

Papers citing "FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation"

10 / 10 papers shown
Title
AudioEval: Automatic Dual-Perspective and Multi-Dimensional Evaluation of Text-to-Audio-Generation
AudioEval: Automatic Dual-Perspective and Multi-Dimensional Evaluation of Text-to-Audio-Generation
Hui Wang
J. Zhao
Cheng Liu
Yuhang Jia
Haoqin Sun
Jiaming Zhou
Yong Qin
68
0
0
16 Oct 2025
Flow Straight and Fast in Hilbert Space: Functional Rectified Flow
Flow Straight and Fast in Hilbert Space: Functional Rectified Flow
Jianxin Zhang
Clayton Scott
88
0
0
12 Sep 2025
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
Yuxin Guo
Teng Wang
Yuying Ge
Shijie Ma
Yixiao Ge
Wei Zou
Mingyu Ding
DiffMAuLLM
94
1
0
27 Aug 2025
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
Xiquan Li
Junxi Liu
Yuzhe Liang
Zhikang Niu
Wenxi Chen
Xie Chen
129
2
0
08 Aug 2025
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
Huadai Liu
Kaicheng Luo
Jialei Wang
Wen Wang
Qian Chen
Zhou Zhao
Wei Xue
VGenLRM
277
13
0
26 Jun 2025
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Yatong Bai
Jonah Casebeer
Somayeh Sojoudi
Nicholas J. Bryan
DiffMVLM
371
2
0
21 Apr 2025
OmniAudio: Generating Spatial Audio from 360-Degree Video
OmniAudio: Generating Spatial Audio from 360-Degree Video
Huadai Liu
Tianyi Luo
Qikai Jiang
Kaicheng Luo
Peiwen Sun
...
Xin Li
Shiliang Zhang
Zhijie Yan
Zhou Zhao
Wei Xue
VGen
372
10
0
21 Apr 2025
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
Chia-Yu Hung
Navonil Majumder
Zhifeng Kong
Ambuj Mehrish
Rafael Valle
Bryan Catanzaro
Soujanya Poria
Bryan Catanzaro
Soujanya Poria
308
34
0
30 Dec 2024
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
VinTAGe: Joint Video and Text Conditioning for Holistic Audio GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Saksham Singh Kushwaha
Yapeng Tian
DiffMVGen
201
10
0
14 Dec 2024
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control
Huadai Liu
Jialei Wang
X. Li
Wen Wang
Qian Chen
Rongjie Huang
Yang Liu
Jiayang Xu
Zhou Zhao
182
9
0
18 Jul 2024
1