Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14635
Cited By
CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation
24 May 2023
Yan Zhou
Qingkai Fang
Yang Feng
OT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
17 / 17 papers shown
Title
DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation
Xinglin Lyu
Wei Tang
Y. Li
X. Zhao
Ming Zhu
...
Y. Lu
Min Zhang
Daimeng Wei
Hao Yang
Min Zhang
70
0
0
07 Apr 2025
Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation
Henglyu Liu
Andong Chen
Kehai Chen
X. Bai
M. Zhong
Yuan Qiu
Min Zhang
40
0
0
13 Mar 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
83
0
0
28 Jan 2025
CTC-based Non-autoregressive Textless Speech-to-Speech Translation
Qingkai Fang
Zhengrui Ma
Yan Zhou
Min Zhang
Yang Feng
50
0
0
11 Jun 2024
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
Qingkai Fang
Shaolei Zhang
Zhengrui Ma
Min Zhang
Yang Feng
VLM
35
1
0
11 Jun 2024
COTET: Cross-view Optimal Transport for Knowledge Graph Entity Typing
Zhiwei Hu
Víctor Gutiérrez-Basulto
Zhiliang Xiang
Ru Li
Jeff Z. Pan
OT
31
0
0
22 May 2024
Rethinking and Improving Multi-task Learning for End-to-end Speech Translation
Yuhao Zhang
Chen Xu
Bei Li
Hao Chen
Tong Xiao
Chunliang Zhang
Jingbo Zhu
18
5
0
07 Nov 2023
Unified Segment-to-Segment Framework for Simultaneous Sequence Generation
Shaolei Zhang
Yang Feng
15
7
0
27 Oct 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo
Qingkai Fang
Dong Yu
Yang Feng
13
6
0
20 Oct 2023
DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
Qingkai Fang
Yan Zhou
Yangzhou Feng
27
6
0
11 Oct 2023
Cross-modal Alignment with Optimal Transport for CTC-based ASR
Xugang Lu
Peng Shen
Yu Tsao
Hisashi Kawai
17
4
0
24 Sep 2023
Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Chen Xu
Xiaoqian Liu
Erfeng He
Yuhao Zhang
Qianqian Dong
Tong Xiao
Jingbo Zhu
Dapeng Man
Wu Yang
19
0
0
21 Sep 2023
Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval
Yabing Wang
Shuhui Wang
Hao Luo
Jianfeng Dong
F. Wang
Meng Han
Xun Wang
Meng Wang
4
8
0
11 Sep 2023
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
Pengzhi Gao
Ruiqing Zhang
Zhongjun He
Hua-Hong Wu
Haifeng Wang
12
4
0
28 Aug 2023
Back Translation for Speech-to-text Translation Without Transcripts
Qingkai Fang
Yang Feng
17
13
0
15 May 2023
Understanding and Bridging the Modality Gap for Speech Translation
Qingkai Fang
Yang Feng
13
25
0
15 May 2023
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
61
57
0
07 Oct 2022
1