Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.00320
Cited By
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension
1 April 2018
Chia-Hsuan Lee
Szu-Lin Wu
Chi-Liang Liu
Hung-yi Lee
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension"
50 / 61 papers shown
Title
Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning
Giuseppe Attanasio
Sonal Sannigrahi
Ben Peters
André F. T. Martins
AuLLM
28
0
0
20 Jun 2025
Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs
Wenyu Zhang
Yingxu He
Geyu Lin
Zhuohan Liu
Shuo Sun
...
Jeremy H.M Wong
Qiongqiong Wang
Hardik B. Sailor
Nancy F. Chen
Ai Ti Aw
AuLLM
32
0
0
07 Jun 2025
SOVA-Bench: Benchmarking the Speech Conversation Ability for LLM-based Voice Assistant
Yixuan Hou
Heyang Liu
Yuhao Wang
Ziyang Cheng
Ronghua Wu
Qunshan Gu
Yanfeng Wang
Yu Wang
AuLLM
42
0
0
03 Jun 2025
Spoken question answering for visual queries
Nimrod Shabtay
Zvi Kons
Avihu Dekel
Hagai Aronowitz
R. Hoory
Assaf Arbelle
63
0
0
29 May 2025
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs
Firoj Alam
Md. Arid Hasan
Shammur A. Chowdhury
69
0
0
25 May 2025
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models
Yiming Gao
Bin Wang
Chengwei Wei
Shuo Sun
AiTi Aw
MLLM
AuLLM
46
0
0
22 May 2025
KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025
Sai Koneru
Maike Züfle
Thai-Binh Nguyen
Seymanur Akti
Jan Niehues
Alexander Waibel
100
0
0
19 May 2025
Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models
Heeseung Kim
Che Hyun Lee
Sangkwon Park
Jiheum Yeom
Nohil Park
Sangwon Yu
Sungroh Yoon
130
1
0
27 Feb 2025
NUTSHELL: A Dataset for Abstract Generation from Scientific Talks
Maike Züfle
Sara Papi
Beatrice Savoldi
Marco Gaido
L. Bentivogli
Jan Niehues
86
2
0
24 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
351
7
0
12 Feb 2025
The Role of Prosody in Spoken Question Answering
Jie Chi
Maureen de Seyssel
Natalie Schluter
102
0
0
08 Feb 2025
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
Junyi Ao
Yuancheng Wang
Xiaohai Tian
Dekun Chen
Jing Zhang
Lu Lu
Yansen Wang
Haizhou Li
Zhikai Wu
AuLLM
177
25
0
17 Jan 2025
A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering
Georgios Sidiropoulos
Evangelos Kanoulas
RALM
68
0
0
20 Sep 2024
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models
Potsawee Manakul
Guangzhi Sun
Warit Sirichotedumrong
Kasima Tharnpipitchai
Kunat Pipatanakul
AuLLM
122
7
0
17 Sep 2024
MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Wentao Zhang
Shuo Sun
Bin Wang
Xunlong Zou
Zhuohan Liu
Yingxu He
Geyu Lin
Nancy F. Chen
Ai Ti Aw
AuLLM
121
1
0
10 Sep 2024
Surprisingly Fragile: Assessing and Addressing Prompt Instability in Multimodal Foundation Models
Ian Stewart
Sameera Horawalavithana
Brendan Kennedy
Sai Munikoti
Karl Pazdernik
AAML
66
2
0
26 Aug 2024
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
Siddhant Arora
Ankita Pasad
Chung-Ming Chien
Jionghao Han
Roshan S. Sharma
...
William Chen
Suwon Shon
Hung-yi Lee
Karen Livescu
Shinji Watanabe
ELM
87
6
0
14 Jun 2024
Multi-Modal Retrieval For Large Language Model Based Speech Recognition
J. Kolehmainen
Aditya Gourav
Prashanth Gurunath Shivakumar
Yile Gu
Ankur Gandhe
Ariya Rastrow
Grant P. Strimel
I. Bulyko
90
5
0
13 Jun 2024
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Yanis Labrak
Adel Moumen
Richard Dufour
Mickael Rouvier
ELM
LM&MA
MedIm
74
1
0
09 Jun 2024
VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension
Thinh P. Ngo
Khoa Tran Anh Dang
Son T. Luu
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
134
0
0
05 Feb 2024
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering
Chyi-Jiunn Lin
Guan-Ting Lin
Yung-Sung Chuang
Wei-Lun Wu
Shang-Wen Li
Abdelrahman Mohamed
Hung-yi Lee
Lin-shan Lee
RALM
69
5
0
24 Jan 2024
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Kevin Everson
Yile Gu
Huck Yang
Prashanth Gurunath Shivakumar
Guan-Ting Lin
...
Shalini Ghosh
Wael Hamza
Hung-yi Lee
Ariya Rastrow
A. Stolcke
70
6
0
05 Jan 2024
GSQA: An End-to-End Model for Generative Spoken Question Answering
Min-Han Shih
Ho-Lam Chung
Yu-Chi Pai
Ming-Hao Hsu
Guan-Ting Lin
Shang-Wen Li
Hung-yi Lee
ELM
AuLLM
86
2
0
15 Dec 2023
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models
Zihan Zhao
Yiyang Jiang
Heyang Liu
Yanfeng Wang
Yu Wang
80
4
0
20 Aug 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
Anuj Diwan
Eunsol Choi
David Harwath
81
0
0
14 Jun 2023
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He
Philip N. Garner
100
4
0
16 May 2023
MeeQA: Natural Questions in Meeting Transcripts
Reut Apel
Tom Braude
Amir Kantor
Eyal Kolman
RALM
59
2
0
15 May 2023
HeySQuAD: A Spoken Question Answering Dataset
Yijing Wu
Sai Krishna Rallabandi
R. Srinivasamurthy
Parag Dakle
Alolika Gon
Preethi Raghavan
103
6
0
26 Apr 2023
A Mixed-Methods Approach to Understanding User Trust after Voice Assistant Failures
Amanda Baughan
Allison Mercurio
Ariel Liu
Xuezhi Wang
Jilin Chen
Xiao Ma
76
15
0
01 Mar 2023
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Suwon Shon
Siddhant Arora
Chyi-Jiunn Lin
Ankita Pasad
Felix Wu
Roshan S. Sharma
Wei Wu
Hung-yi Lee
Karen Livescu
Shinji Watanabe
ELM
80
33
0
20 Dec 2022
Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task
Shailaja Keyur Sampat
Pratyay Banerjee
Yezhou Yang
Chitta Baral
108
2
0
07 Dec 2022
On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering
Georgios Sidiropoulos
Svitlana Vakulenko
Evangelos Kanoulas
RALM
112
8
0
26 Sep 2022
Video-Guided Curriculum Learning for Spoken Video Grounding
Yan Xia
Zhou Zhao
Shangwei Ye
Yang Zhao
Haoyuan Li
Yi Ren
77
11
0
01 Sep 2022
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Chenyu You
Nuo Chen
Fenglin Liu
Shen Ge
Xian Wu
Yuexian Zou
AuLLM
63
44
0
29 Apr 2022
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification
Georgios P. Spithourakis
Ivan Vulić
M. Lis
I. Casanueva
Paweł Budzianowski
61
5
0
28 Apr 2022
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
Guan-Ting Lin
Yung-Sung Chuang
Ho-Lam Chung
Shu-Wen Yang
Hsuan-Jui Chen
Shuyan Dong
Shang-Wen Li
Abdel-rahman Mohamed
Hung-yi Lee
Lin-Shan Lee
123
22
0
09 Mar 2022
Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Manaal Faruqui
Dilek Z. Hakkani-Tür
90
22
0
10 Dec 2021
SD-QA: Spoken Dialectal Question Answering for the Real World
Fahim Faisal
Sharlina Keshava
ibn Alam
Antonios Anastasopoulos
147
32
0
24 Sep 2021
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining
Xiao Dong
Xunlin Zhan
Yangxin Wu
Yunchao Wei
Michael C. Kampffmeyer
Xiaoyong Wei
Minlong Lu
Yaowei Wang
Xiaodan Liang
116
38
0
09 Sep 2021
Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
SSL
102
64
0
08 Sep 2021
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension
Anna Rogers
Matt Gardner
Isabelle Augenstein
135
168
0
27 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
66
1
0
09 Jul 2021
Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering
Aditya Gupta
Jiacheng Xu
Shyam Upadhyay
Diyi Yang
Manaal Faruqui
90
33
0
08 Jun 2021
Self-supervised Dialogue Learning for Spoken Conversational Question Answering
Nuo Chen
Chenyu You
Yuexian Zou
SSL
87
34
0
04 Jun 2021
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering
Abhilasha Ravichander
Siddharth Dalmia
Maria Ryskina
Florian Metze
Eduard H. Hovy
A. Black
ELM
59
32
0
16 Feb 2021
Two-stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding
Seongbin Kim
Gyuwan Kim
Seongjin Shin
Sangmin Lee
VLM
59
20
0
25 Oct 2020
Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
108
52
0
21 Oct 2020
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
105
38
0
21 Oct 2020
Towards Data Distillation for End-to-end Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Fenglin Liu
Dongchao Yang
Yuexian Zou
77
48
0
18 Oct 2020
WER we are and WER we think we are
Piotr Szymañski
Piotr Żelasko
Mikolaj Morzy
Adrian Szymczak
Marzena Zyla-Hoppe
Joanna Banaszczak
Lukasz Augustyniak
Jan Mizgajski
Yishay Carmiel
71
46
0
07 Oct 2020
1
2
Next