Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.11559
Cited By
v1
v2
v3
v4 (latest)
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
25 October 2019
Yung-Sung Chuang
Chi-Liang Liu
Hung-yi Lee
Lin-shan Lee
AuLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering"
24 / 24 papers shown
Title
MAPSS: Manifold-based Assessment of Perceptual Source Separation
Amir Ivry
Samuele Cornell
Shinji Watanabe
81
0
0
11 Sep 2025
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Applied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
203
3
0
24 Oct 2024
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chan-Jan Hsu
Ho-Lam Chung
Hung-yi Lee
Yu Tsao
283
6
0
01 Nov 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Neural Information Processing Systems (NeurIPS), 2022
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
491
166
0
14 Jun 2022
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Chenyu You
Polydoros Giannouris
Fenglin Liu
Shen Ge
Xian Wu
Yuexian Zou
AuLLM
145
53
0
29 Apr 2022
ERNIE-GeoL: A Geography-and-Language Pre-trained Model and its Applications in Baidu Maps
Knowledge Discovery and Data Mining (KDD), 2022
Jizhou Huang
Haifeng Wang
Yibo Sun
Yunsheng Shi
Zhengjie Huang
An Zhuo
Shikun Feng
181
56
0
17 Mar 2022
Using Pause Information for More Accurate Entity Recognition
Sahas Dendukuri
Pooja Chitkara
Joel Ruben Antony Moniz
Xiao Yang
M. Tsagkias
S. Pulman
116
5
0
27 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Chenyu You
Polydoros Giannouris
Yuexian Zou
SSL
187
64
0
08 Sep 2021
Detecting Extraneous Content in Podcasts
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
S. Reddy
Yongze Yu
Aasish Pappu
Aswin Sivaraman
R. Rezapour
R. Jones
84
16
0
03 Mar 2021
A Survey on Visual Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
941
2,988
0
23 Dec 2020
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
169
7
0
11 Nov 2020
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies
Interspeech (Interspeech), 2020
Alexander H. Liu
Yu-An Chung
James R. Glass
SSL
207
93
0
01 Nov 2020
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Cheng-I Jeff Lai
Yung-Sung Chuang
Hung-yi Lee
Shang-Wen Li
James R. Glass
VLM
SSL
213
62
0
26 Oct 2020
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Minjeong Kim
Gyuwan Kim
Sang-Woo Lee
Jung-Woo Ha
VLM
170
37
0
23 Oct 2020
Similarity Analysis of Self-Supervised Speech Representations
Yu-An Chung
Yonatan Belinkov
James R. Glass
SSL
338
44
0
22 Oct 2020
A Framework for Generative and Contrastive Learning of Audio Representations
Prateek Verma
J. Smith
SSL
183
21
0
22 Oct 2020
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You
Polydoros Giannouris
Yuexian Zou
427
40
0
21 Oct 2020
Towards Data Distillation for End-to-end Spoken Conversational Question Answering
Chenyu You
Polydoros Giannouris
Fenglin Liu
Dongchao Yang
Yuexian Zou
225
50
0
18 Oct 2020
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding
Yu-An Chung
Chenguang Zhu
Michael Zeng
VLM
275
8
0
05 Oct 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Neural Information Processing Systems (NeurIPS), 2020
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
318
195
0
06 Aug 2020
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel Denisov
Ngoc Thang Vu
183
31
0
03 Jul 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Interspeech (Interspeech), 2020
Chia-Chih Kuo
Shang-Bao Luo
Kuan-Yu Chen
122
18
0
25 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho
Donghyun Kwak
J. Yoon
N. Kim
229
27
0
17 May 2020
Pre-trained Models for Natural Language Processing: A Survey
Science China Technological Sciences (Sci China Technol Sci), 2020
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
929
1,607
0
18 Mar 2020
1