ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.01554
  4. Cited By
EditSpeech: A Text Based Speech Editing System Using Partial Inference
  and Bidirectional Fusion
v1v2 (latest)

EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion

4 July 2021
Daxin Tan
Liqun Deng
Y. Yeung
Xin Jiang
Xiao Chen
Tan Lee
ArXiv (abs)PDFHTML

Papers citing "EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion"

31 / 31 papers shown
Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
Baher Mohammad
Magauiya Zhussip
Stamatios Lefkimmiatis
Mamba
148
0
0
06 Oct 2025
Instance-Specific Test-Time Training for Speech Editing in the Wild
Instance-Specific Test-Time Training for Speech Editing in the Wild
Taewoo Kim
Uijong Lee
H. Park
Choongsang Cho
Nam In Park
Young Han Lee
202
0
0
16 Jun 2025
PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing
PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing
You Zhang
Baotong Tian
Lin Zhang
Z. Duan
139
3
0
03 Jun 2025
SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement
Kuan-Yu Chen
Jeng-Lin Li
Jian-Jiun Ding
300
0
0
20 May 2025
SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors
SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors
Yang Chen
Hui Wang
Shiyao Wang
Jianfei Chen
Jiabei He
Jiaming Zhou
Xi Yang
Longji Xu
Yonghua Lin
Yong Qin
211
3
0
20 Mar 2025
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Helin Wang
Meng Yu
Jiarui Hai
Chen Chen
Yuchen Hu
Rilin Chen
Najim Dehak
Dong Yu
376
10
0
03 Jan 2025
DiffEditor: Enhancing Speech Editing with Semantic Enrichment and
  Acoustic Consistency
DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency
Yang Chen
Yuhang Jia
Shiwan Zhao
Ziyue Jiang
Haoran Li
Jiarong Kang
Yong Qin
141
3
0
19 Sep 2024
SongCreator: Lyrics-based Universal Song Generation
SongCreator: Lyrics-based Universal Song GenerationNeural Information Processing Systems (NeurIPS), 2024
Shun Lei
Yixuan Zhou
Boshi Tang
Max W. Y. Lam
Feng Liu
Hangyu Liu
Jingcheng Wu
Shiyin Kang
Zhiyong Wu
Helen Meng
285
17
0
09 Sep 2024
Automatic Voice Identification after Speech Resynthesis using PPG
Automatic Voice Identification after Speech Resynthesis using PPGThe Speaker and Language Recognition Workshop (Odyssey), 2024
Thibault Gaudier
Marie Tahon
Anthony Larcher
Yannick Esteve
186
0
0
05 Aug 2024
Speech Editing -- a Summary
Speech Editing -- a Summary
Tobias Kässmann
Yining Liu
Danni Liu
149
1
0
24 Jul 2024
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis
Zhijun Liu
Shuai Wang
Sho Inoue
Qibing Bai
Haizhou Li
DiffM
184
31
0
08 Jun 2024
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Puyuan Peng
Po-Yao (Bernie) Huang
Daniel Li
Abdelrahman Mohamed
David Harwath
439
146
0
25 Mar 2024
AttentionStitch: How Attention Solves the Speech Editing Problem
AttentionStitch: How Attention Solves the Speech Editing Problem
Antonios Alexos
Pierre Baldi
206
3
0
05 Mar 2024
Fine-Grained Quantitative Emotion Editing for Speech Generation
Fine-Grained Quantitative Emotion Editing for Speech Generation
Sho Inoue
Kun Zhou
Shuai Wang
Haizhou Li
214
5
0
04 Mar 2024
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion
  Models
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muqiao Yang
Chunlei Zhang
Yong-mei Xu
Zhongweiyang Xu
Heming Wang
Bhiksha Raj
Dong Yu
DiffM
159
11
0
02 Oct 2023
FluentEditor: Text-based Speech Editing by Considering Acoustic and
  Prosody Consistency
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody ConsistencyInterspeech (Interspeech), 2023
Rui Liu
Jiatian Xi
Ziyue Jiang
Haizhou Li
369
7
0
21 Sep 2023
Cross-Utterance Conditioned VAE for Speech Generation
Cross-Utterance Conditioned VAE for Speech GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yongqian Li
Cheng Yu
Guangzhi Sun
Weiqin Zu
Zheng Tian
...
Wei Pan
Chao Zhang
Jun Wang
Yang Yang
Fanglei Sun
192
3
0
08 Sep 2023
Improving Code-Switching and Named Entity Recognition in ASR with Speech
  Editing based Data Augmentation
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Zheng Liang
Zheshu Song
Ziyang Ma
Chenpeng Du
K. Yu
Xie Chen
140
5
0
14 Jun 2023
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
  Bias
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Ziyue Jiang
Yi Ren
Zhe Ye
Jinglin Liu
Chen Zhang
...
Rongjie Huang
Chunfeng Wang
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
256
95
0
06 Jun 2023
FluentSpeech: Stutter-Oriented Automatic Speech Editing with
  Context-Aware Diffusion Models
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ziyue Jiang
Qiang Yang
Jia-li Zuo
Zhe Ye
Rongjie Huang
Yixiang Ren
Zhou Zhao
DiffM
153
28
0
23 May 2023
DiffVoice: Text-to-Speech with Latent Diffusion
DiffVoice: Text-to-Speech with Latent DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhijun Liu
Yiwei Guo
K. Yu
DiffM
171
27
0
23 Apr 2023
Emotion Selectable End-to-End Text-based Speech Editing
Emotion Selectable End-to-End Text-based Speech EditingArtificial Intelligence (AI), 2022
Tao Wang
Jiangyan Yi
Ruibo Fu
Jianhua Tao
Zhengqi Wen
Chu Yuan Zhang
169
4
0
20 Dec 2022
MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy
MaskedSpeech: Context-aware Speech Synthesis with Masking StrategyInterspeech (Interspeech), 2022
Ya-Jie Zhang
Wei Song
Ya Yue
Zhengchen Zhang
Youzheng Wu
Xiaodong He
140
7
0
11 Nov 2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual
  Multi-Speaker Text-to-Speech
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Xiaoran Fan
Chao Pang
Tian Yuan
Richard He Bai
Renjie Zheng
...
Junkun Chen
Zeyu Chen
Liang Huang
Yu Sun
Hua Wu
238
1
0
07 Nov 2022
Towards zero-shot Text-based voice editing using acoustic context
  conditioning, utterance embeddings, and reference encoders
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
Jason Fong
Yun Wang
Prabhav Agrawal
Vimal Manohar
Jilong Wu
Thilo Kohler
Qing He
142
0
0
28 Oct 2022
The PartialSpoof Database and Countermeasures for the Detection of Short
  Fake Speech Segments Embedded in an Utterance
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an UtteranceIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Lin Zhang
Xin Wang
Erica Cooper
Nicholas W. D. Evans
Junichi Yamagishi
304
86
0
11 Apr 2022
A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech
  Synthesis and Editing
A3^33T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingInternational Conference on Machine Learning (ICML), 2022
Richard He Bai
Renjie Zheng
Junkun Chen
Xintong Li
Mingbo Ma
Liang Huang
249
60
0
18 Mar 2022
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech
  Editing
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech EditingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Tao Wang
Jiangyan Yi
Ruibo Fu
Jianhua Tao
Zhengqi Wen
KELM
123
25
0
21 Feb 2022
SpeechPainter: Text-conditioned Speech Inpainting
SpeechPainter: Text-conditioned Speech InpaintingInterspeech (Interspeech), 2022
Zalan Borsos
Matthew Sharifi
Marco Tagliasacchi
174
35
0
15 Feb 2022
Environment Aware Text-to-Speech Synthesis
Environment Aware Text-to-Speech SynthesisInterspeech (Interspeech), 2021
Daxin Tan
Guangyan Zhang
Tan Lee
209
8
0
08 Oct 2021
EdiTTS: Score-based Editing for Controllable Text-to-Speech
EdiTTS: Score-based Editing for Controllable Text-to-Speech
Jaesung Tae
Hyeongju Kim
Taesu Kim
DiffM
395
47
0
06 Oct 2021
1