Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1708.06073
Cited By
v1
v2 (latest)
The Microsoft 2017 Conversational Speech Recognition System
21 August 2017
Wayne Xiong
Lingfeng Wu
F. Alleva
J. Droppo
Xuedong Huang
A. Stolcke
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Microsoft 2017 Conversational Speech Recognition System"
50 / 144 papers shown
xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads
Jiabo Shi
Dimitrios Pezaros
Yehia Elkhatib
108
0
0
23 Oct 2025
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis
Jiabo Shi
Yehia Elkhatib
3DH
VLM
208
1
0
04 Apr 2025
Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition
Korbinian Kuhn
Verena Kersken
Gottfried Zimmermann
158
1
0
19 Mar 2025
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Chen Zhang
Conghui Tan
Rongzhong Lian
MoMe
305
1
0
21 Oct 2024
Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models
Spoken Language Technology Workshop (SLT), 2024
Xiaoxue Gao
Nancy F. Chen
Mamba
223
12
0
27 Sep 2024
Measuring the Accuracy of Automatic Speech Recognition Solutions
ACM Transactions on Accessible Computing (TACCESS), 2023
Korbinian Kuhn
Verena Kersken
Benedikt Reuter
Niklas Egger
Gottfried Zimmermann
199
45
0
29 Aug 2024
Child Speech Recognition in Human-Robot Interaction: Problem Solved?
R. Janssens
Eva Verhelst
Giulio Antonio Abbo
Qiaoqiao Ren
Maria Jose Pinto Bernal
Tony Belpaeme
172
7
0
26 Apr 2024
Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods
Zheyu Zhang
AAML
155
1
0
23 Feb 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
210
3
0
20 Dec 2023
Assessing SATNet's Ability to Solve the Symbol Grounding Problem
Neural Information Processing Systems (NeurIPS), 2023
Oscar Chang
Lampros Flokas
Hod Lipson
Michael Spranger
NAI
187
24
0
13 Dec 2023
SAPIEN: Affective Virtual Agents Powered by Large Language Models
Masum Hasan
Cengiz Ozel
Sammy Potter
E. Hoque
VLM
LLMAG
194
18
0
06 Aug 2023
Leveraging Cross-Utterance Context For ASR Decoding
Interspeech (Interspeech), 2023
Robert Flynn
Anton Ragni
200
1
0
29 Jun 2023
Personalized Predictive ASR for Latency Reduction in Voice Assistants
Interspeech (Interspeech), 2023
A. Schwarz
Di He
Maarten Van Segbroeck
Mohammed Hethnawi
Ariya Rastrow
211
6
0
23 May 2023
Modular Domain Adaptation for Conformer-Based Streaming ASR
Interspeech (Interspeech), 2023
Qiujia Li
Yue Liu
DongSeon Hwang
Tara N. Sainath
P. M. Mengibar
200
13
0
22 May 2023
Neural Delay Differential Equations: System Reconstruction and Image Classification
International Conference on Learning Representations (ICLR), 2021
Qunxi Zhu
Yao Guo
Wei Lin
180
39
0
11 Apr 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition
International Conference on Human Factors in Computing Systems (CHI), 2023
Colin S. Lea
Zifang Huang
Lauren Tooley
Jaya Narain
Dianna Yee
P. Georgiou
Dung Tien Tran
Jeffrey P. Bigham
Leah Findlater
428
44
0
17 Feb 2023
Using Kaldi for Automatic Speech Recognition of Conversational Austrian German
J. Linke
Saskia Wepner
G. Kubin
Barbara Schuppler
186
10
0
16 Jan 2023
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jielin Qiu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Ding Zhao
Hailin Jin
AI4TS
149
7
0
12 Oct 2022
Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Computer Vision and Pattern Recognition (CVPR), 2022
Yang Zhou
Jimei Yang
Dingzeyu Li
Jun Saito
Deepali Aneja
E. Kalogerakis
DiffM
SLR
260
28
0
23 Jul 2022
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
Neural Information Processing Systems (NeurIPS), 2022
Massimiliano Patacchiola
J. Bronskill
Aliaksandra Shysheya
Katja Hofmann
Sebastian Nowozin
Richard Turner
VLM
362
13
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
International Conference on Learning Representations (ICLR), 2022
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DH
FedML
262
35
0
17 Jun 2022
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR
Qiu-shi Zhu
Jie Zhang
Zitian Zhang
Lirong Dai
200
18
0
26 May 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
259
133
0
25 Apr 2022
MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with Speech
International Conference on Human Factors in Computing Systems (CHI), 2022
Young-Ho Kim
Diana Chou
Bongshin Lee
M. Danilovich
Amanda Lazar
D. Conroy
Hernisa Kacorri
E. Choe
151
36
0
01 Apr 2022
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR
International Conference on Language Resources and Evaluation (LREC), 2022
Nina Markl
S. McNulty
169
15
0
25 Feb 2022
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
180
0
0
21 Feb 2022
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
H.C.M. Turner
Giulio Lovisotto
Simon Eberz
Ivan Martinovic
95
1
0
13 Feb 2022
Recent Progress in the CUHK Dysarthric Speech Recognition System
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Shansong Liu
Mengzhe Geng
Shoukang Hu
Xurong Xie
Mingyu Cui
Jianwei Yu
Xunying Liu
Helen Meng
163
83
0
15 Jan 2022
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Interspeech (Interspeech), 2020
Mengzhe Geng
Xurong Xie
Shansong Liu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
142
74
0
14 Jan 2022
Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data
IEEE Internet of Things Journal (IEEE IoT J.), 2022
Eunyeong Jeon
Anirudh Som
Ankita Shukla
Kristina Hasanaj
M. Buman
Pavan Turaga
109
14
0
01 Jan 2022
Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition
C. Li
Ngoc Thang Vu
111
0
0
19 Dec 2021
ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees
Neural Information Processing Systems (NeurIPS), 2021
Kuan-Lin Chen
Ching-Hua Lee
H. Garudadri
Bhaskar D. Rao
AI4TS
307
7
0
10 Nov 2021
Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Pengfei Zhang
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jianru Xue
Nanning Zheng
247
8
0
07 Nov 2021
On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics
Nan Feng
Guodong Zhang
Kapil Khandelwal
AI4CE
79
1
0
03 Nov 2021
Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Haozhe Chen
Weiming Zhang
Kunlin Liu
Kejiang Chen
Han Fang
Nenghai Yu
104
4
0
19 Oct 2021
Graphs as Tools to Improve Deep Learning Methods
Carlos Lassance
Myriam Bontonou
Mounia Hamidouche
Bastien Pasdeloup
Lucas Drumetz
Vincent Gripon
GNN
AI4CE
AAML
134
0
0
08 Oct 2021
Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition
Zhiyun Lu
Yanwei Pan
Thibault Doutre
Parisa Haghani
Liangliang Cao
Rohit Prabhavalkar
Chuxu Zhang
Trevor Strohman
AuLLM
225
15
0
08 Oct 2021
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
222
27
0
17 Aug 2021
Edge service resource allocation strategy based on intelligent prediction
Yujie Wang
Xin Du
Xuzhao Chen
Zhihui Lu
93
0
0
27 Jul 2021
Large-Scale News Classification using BERT Language Model: Spark NLP Approach
International Conference on Sustainable Information Engineering and Technology (ICSIET), 2021
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
N. Yudistira
155
31
0
14 Jul 2021
Dive into Deep Learning
Journal of the American College of Radiology (JACR), 2020
Aston Zhang
Zachary Chase Lipton
Mu Li
Alexander J. Smola
VLM
362
650
0
21 Jun 2021
ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Ashish Shenoy
S. Bodapati
Katrin Kirchhoff
201
15
0
15 Jun 2021
Drivers' Manoeuvre Modelling and Prediction for Safe HRI
Erwin Jose López Pulgarín
G. Herrmann
U. Leonards
85
0
0
03 Jun 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect
Binbin Xu
Chongyang Tao
Z. Feng
Youssef Raqui
Sylvie Ranwez
162
17
0
07 May 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
201
53
0
03 May 2021
Adapting Long Context NLM for ASR Rescoring in Conversational Agents
Interspeech (Interspeech), 2021
Ashish Shenoy
S. Bodapati
Monica Sunkara
S. Ronanki
Katrin Kirchhoff
241
21
0
21 Apr 2021
On Architectures and Training for Raw Waveform Feature Extraction in ASR
Automatic Speech Recognition & Understanding (ASRU), 2021
Peter Vieting
Christoph Luscher
Wilfried Michel
Ralf Schluter
Hermann Ney
174
11
0
09 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomes
Statistical Methods in Medical Research (Stat Med), 2021
Jamie Yap
John J. Dziak
David Kabiito
Claire Babirye
J. McKay
Bibhas Chakraborty
J. Nakatumba‐Nabende
207
27
0
31 Mar 2021
Platform for Situated Intelligence
D. Bohus
Sean Andrist
Ashley Feniello
Nick Saw
Mihai Jalobeanu
Patrick Sweeney
Anne Loomis Thompson
Eric Horvitz
116
50
0
29 Mar 2021
"Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us?
B. Liu
123
9
0
29 Mar 2021
1
2
3
Next
Page 1 of 3