Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1708.06073
Cited By
v1
v2 (latest)
The Microsoft 2017 Conversational Speech Recognition System
21 August 2017
Wayne Xiong
Lingfeng Wu
F. Alleva
J. Droppo
Xuedong Huang
A. Stolcke
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Microsoft 2017 Conversational Speech Recognition System"
50 / 144 papers shown
xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads
Jiabo Shi
Dimitrios Pezaros
Yehia Elkhatib
105
0
0
23 Oct 2025
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis
Jiabo Shi
Yehia Elkhatib
3DH
VLM
202
1
0
04 Apr 2025
Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition
Korbinian Kuhn
Verena Kersken
Gottfried Zimmermann
153
1
0
19 Mar 2025
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Chen Zhang
Conghui Tan
Rongzhong Lian
MoMe
296
1
0
21 Oct 2024
Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Spoken Language Technology Workshop (SLT), 2024
Xiaoxue Gao
Nancy F. Chen
Mamba
206
12
0
27 Sep 2024
Measuring the Accuracy of Automatic Speech Recognition Solutions
ACM Transactions on Accessible Computing (TACCESS), 2023
Korbinian Kuhn
Verena Kersken
Benedikt Reuter
Niklas Egger
Gottfried Zimmermann
199
43
0
29 Aug 2024
Child Speech Recognition in Human-Robot Interaction: Problem Solved?
R. Janssens
Eva Verhelst
Giulio Antonio Abbo
Qiaoqiao Ren
Maria Jose Pinto Bernal
Tony Belpaeme
160
7
0
26 Apr 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods
Zheyu Zhang
AAML
149
1
0
23 Feb 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
199
3
0
20 Dec 2023
Assessing SATNet's Ability to Solve the Symbol Grounding Problem
Neural Information Processing Systems (NeurIPS), 2023
Oscar Chang
Lampros Flokas
Hod Lipson
Michael Spranger
NAI
187
24
0
13 Dec 2023
SAPIEN: Affective Virtual Agents Powered by Large Language Models
Masum Hasan
Cengiz Ozel
Sammy Potter
E. Hoque
VLM
LLMAG
177
16
0
06 Aug 2023
Leveraging Cross-Utterance Context For ASR Decoding
Interspeech (Interspeech), 2023
Robert Flynn
Anton Ragni
191
1
0
29 Jun 2023
Personalized Predictive ASR for Latency Reduction in Voice Assistants
Interspeech (Interspeech), 2023
A. Schwarz
Di He
Maarten Van Segbroeck
Mohammed Hethnawi
Ariya Rastrow
207
6
0
23 May 2023
Modular Domain Adaptation for Conformer-Based Streaming ASR
Interspeech (Interspeech), 2023
Qiujia Li
Yue Liu
DongSeon Hwang
Tara N. Sainath
P. M. Mengibar
190
13
0
22 May 2023
Neural Delay Differential Equations: System Reconstruction and Image Classification
International Conference on Learning Representations (ICLR), 2021
Qunxi Zhu
Yao Guo
Wei Lin
174
39
0
11 Apr 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition
International Conference on Human Factors in Computing Systems (CHI), 2023
Colin S. Lea
Zifang Huang
Lauren Tooley
Jaya Narain
Dianna Yee
P. Georgiou
Dung Tien Tran
Jeffrey P. Bigham
Leah Findlater
409
43
0
17 Feb 2023
Using Kaldi for Automatic Speech Recognition of Conversational Austrian German
J. Linke
Saskia Wepner
G. Kubin
Barbara Schuppler
168
10
0
16 Jan 2023
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jielin Qiu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Ding Zhao
Hailin Jin
AI4TS
137
7
0
12 Oct 2022
Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Computer Vision and Pattern Recognition (CVPR), 2022
Yang Zhou
Jimei Yang
Dingzeyu Li
Jun Saito
Deepali Aneja
E. Kalogerakis
DiffM
SLR
221
28
0
23 Jul 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Neural Information Processing Systems (NeurIPS), 2022
Massimiliano Patacchiola
J. Bronskill
Aliaksandra Shysheya
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
351
12
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
International Conference on Learning Representations (ICLR), 2022
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DH
FedML
256
35
0
17 Jun 2022
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-shi Zhu
Jie Zhang
Zitian Zhang
Lirong Dai
192
18
0
26 May 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
254
133
0
25 Apr 2022
MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with Speech
International Conference on Human Factors in Computing Systems (CHI), 2022
Young-Ho Kim
Diana Chou
Bongshin Lee
M. Danilovich
Amanda Lazar
D. Conroy
Hernisa Kacorri
E. Choe
142
36
0
01 Apr 2022
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR
International Conference on Language Resources and Evaluation (LREC), 2022
Nina Markl
S. McNulty
159
14
0
25 Feb 2022
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
167
0
0
21 Feb 2022
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
H.C.M. Turner
Giulio Lovisotto
Simon Eberz
Ivan Martinovic
86
1
0
13 Feb 2022
Recent Progress in the CUHK Dysarthric Speech Recognition System
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Shansong Liu
Mengzhe Geng
Shoukang Hu
Xurong Xie
Mingyu Cui
Jianwei Yu
Xunying Liu
Helen Meng
147
82
0
15 Jan 2022
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Interspeech (Interspeech), 2020
Mengzhe Geng
Xurong Xie
Shansong Liu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
132
74
0
14 Jan 2022
Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data
IEEE Internet of Things Journal (IEEE IoT J.), 2022
Eunyeong Jeon
Anirudh Som
Ankita Shukla
Kristina Hasanaj
M. Buman
Pavan Turaga
108
14
0
01 Jan 2022
Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition
C. Li
Ngoc Thang Vu
107
0
0
19 Dec 2021
ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees
Neural Information Processing Systems (NeurIPS), 2021
Kuan-Lin Chen
Ching-Hua Lee
H. Garudadri
Bhaskar D. Rao
AI4TS
284
7
0
10 Nov 2021
Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Pengfei Zhang
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jianru Xue
Nanning Zheng
243
8
0
07 Nov 2021
On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Nan Feng
Guodong Zhang
Kapil Khandelwal
AI4CE
76
1
0
03 Nov 2021
Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Haozhe Chen
Weiming Zhang
Kunlin Liu
Kejiang Chen
Han Fang
Nenghai Yu
94
4
0
19 Oct 2021
Graphs as Tools to Improve Deep Learning Methods
Carlos Lassance
Myriam Bontonou
Mounia Hamidouche
Bastien Pasdeloup
Lucas Drumetz
Vincent Gripon
GNN
AI4CE
AAML
125
0
0
08 Oct 2021
Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition
Zhiyun Lu
Yanwei Pan
Thibault Doutre
Parisa Haghani
Liangliang Cao
Rohit Prabhavalkar
Chuxu Zhang
Trevor Strohman
AuLLM
206
15
0
08 Oct 2021
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
206
27
0
17 Aug 2021
Edge service resource allocation strategy based on intelligent prediction
Yujie Wang
Xin Du
Xuzhao Chen
Zhihui Lu
82
0
0
27 Jul 2021
Large-Scale News Classification using BERT Language Model: Spark NLP Approach
International Conference on Sustainable Information Engineering and Technology (ICSIET), 2021
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
N. Yudistira
150
29
0
14 Jul 2021
Dive into Deep Learning
Journal of the American College of Radiology (JACR), 2020
Aston Zhang
Zachary Chase Lipton
Mu Li
Alexander J. Smola
VLM
354
646
0
21 Jun 2021
ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Ashish Shenoy
S. Bodapati
Katrin Kirchhoff
194
15
0
15 Jun 2021
Drivers' Manoeuvre Modelling and Prediction for Safe HRI
Erwin Jose López Pulgarín
G. Herrmann
U. Leonards
77
0
0
03 Jun 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect
Binbin Xu
Chongyang Tao
Z. Feng
Youssef Raqui
Sylvie Ranwez
161
17
0
07 May 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Interspeech (Interspeech), 2021
Ashish Shenoy
S. Bodapati
Monica Sunkara
S. Ronanki
Katrin Kirchhoff
232
21
0
21 Apr 2021
On Architectures and Training for Raw Waveform Feature Extraction in ASR
Automatic Speech Recognition & Understanding (ASRU), 2021
Peter Vieting
Christoph Luscher
Wilfried Michel
Ralf Schluter
Hermann Ney
164
11
0
09 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomes
Statistical Methods in Medical Research (Stat Med), 2021
Jamie Yap
John J. Dziak
David Kabiito
Claire Babirye
J. McKay
Bibhas Chakraborty
J. Nakatumba‐Nabende
191
26
0
31 Mar 2021
Platform for Situated Intelligence
D. Bohus
Sean Andrist
Ashley Feniello
Nick Saw
Mihai Jalobeanu
Patrick Sweeney
Anne Loomis Thompson
Eric Horvitz
116
50
0
29 Mar 2021
"Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us?
B. Liu
119
9
0
29 Mar 2021
1
2
3
Next