v1v2 (latest)

The Microsoft 2017 Conversational Speech Recognition System

21 August 2017

Papers citing "The Microsoft 2017 Conversational Speech Recognition System"

50 / 144 papers shown

xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads

Jiabo Shi

Dimitrios Pezaros

Yehia Elkhatib

105

23 Oct 2025

Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis

Jiabo Shi

Yehia Elkhatib

3DH VLM

202

04 Apr 2025

Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition

Korbinian Kuhn

Verena Kersken

Gottfried Zimmermann

153

19 Mar 2025

Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation

296

21 Oct 2024

Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces ModelsSpoken Language Technology Workshop (SLT), 2024

Xiaoxue Gao

Nancy F. Chen

Mamba

206

27 Sep 2024

Measuring the Accuracy of Automatic Speech Recognition SolutionsACM Transactions on Accessible Computing (TACCESS), 2023

199

29 Aug 2024

Child Speech Recognition in Human-Robot Interaction: Problem Solved?

Maria Jose Pinto Bernal

Tony Belpaeme

160

26 Apr 2024

Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods

Zheyu Zhang

AAML

149

23 Feb 2024

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

199

20 Dec 2023

Assessing SATNet's Ability to Solve the Symbol Grounding ProblemNeural Information Processing Systems (NeurIPS), 2023

187

13 Dec 2023

SAPIEN: Affective Virtual Agents Powered by Large Language Models

177

06 Aug 2023

Leveraging Cross-Utterance Context For ASR DecodingInterspeech (Interspeech), 2023

Robert Flynn

Anton Ragni

191

29 Jun 2023

Personalized Predictive ASR for Latency Reduction in Voice AssistantsInterspeech (Interspeech), 2023

A. Schwarz

Di He

Maarten Van Segbroeck

Mohammed Hethnawi

Ariya Rastrow

207

23 May 2023

Modular Domain Adaptation for Conformer-Based Streaming ASRInterspeech (Interspeech), 2023

190

22 May 2023

Neural Delay Differential Equations: System Reconstruction and Image ClassificationInternational Conference on Learning Representations (ICLR), 2021

Qunxi Zhu

Yao Guo

Wei Lin

174

11 Apr 2023

From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech RecognitionInternational Conference on Human Factors in Computing Systems (CHI), 2023

409

17 Feb 2023

Using Kaldi for Automatic Speech Recognition of Conversational Austrian German

168

16 Jan 2023

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Ding Zhao

137

12 Oct 2022

Audio-driven Neural Gesture Reenactment with Video Motion GraphsComputer Vision and Pattern Recognition (CVPR), 2022

221

23 Jul 2022

Contextual Squeeze-and-Excitation for Efficient Few-Shot Image ClassificationNeural Information Processing Systems (NeurIPS), 2022

Massimiliano Patacchiola

351

20 Jun 2022

FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image ClassificationInternational Conference on Learning Representations (ICLR), 2022

Aliaksandra Shysheya

J. Bronskill

Massimiliano Patacchiola

Sebastian Nowozin

Richard Turner

3DH FedML

256

17 Jun 2022

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

192

26 May 2022

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Zhijian Liu

Song Han

254

133

25 Apr 2022

MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with SpeechInternational Conference on Human Factors in Computing Systems (CHI), 2022

142

01 Apr 2022

Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASRInternational Conference on Language Resources and Evaluation (LREC), 2022

Nina Markl

S. McNulty

159

25 Feb 2022

Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments

Mario Esparza

167

21 Feb 2022

I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy

H.C.M. Turner

Giulio Lovisotto

Simon Eberz

Ivan Martinovic

13 Feb 2022

Recent Progress in the CUHK Dysarthric Speech Recognition SystemIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

147

15 Jan 2022

Investigation of Data Augmentation Techniques for Disordered Speech RecognitionInterspeech (Interspeech), 2020

132

14 Jan 2022

Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor DataIEEE Internet of Things Journal (IEEE IoT J.), 2022

108

01 Jan 2022

Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition

C. Li

Ngoc Thang Vu

107

19 Dec 2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation GuaranteesNeural Information Processing Systems (NeurIPS), 2021

284

10 Nov 2021

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

Jianru Xue

243

07 Nov 2021

On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics

03 Nov 2021

Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition

19 Oct 2021

Graphs as Tools to Improve Deep Learning Methods

Vincent Gripon

125

08 Oct 2021

Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition

206

08 Oct 2021

Look Who's Talking: Active Speaker Detection in the Wild

206

17 Aug 2021

Edge service resource allocation strategy based on intelligent prediction

27 Jul 2021

Large-Scale News Classification using BERT Language Model: Spark NLP ApproachInternational Conference on Sustainable Information Engineering and Technology (ICSIET), 2021

Kuncahyo Setyo Nugroho

Anantha Yullian Sukmadewa

N. Yudistira

150

14 Jul 2021

Dive into Deep LearningJournal of the American College of Radiology (JACR), 2020

354

646

21 Jun 2021

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

Ashish Shenoy

S. Bodapati

Katrin Kirchhoff

194

15 Jun 2021

Drivers' Manoeuvre Modelling and Prediction for Safe HRI

Erwin Jose López Pulgarín

G. Herrmann

U. Leonards

03 Jun 2021

A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

161

07 May 2021

On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021

Zoltán Tüske

G. Saon

Brian Kingsbury

183

03 May 2021

Adapting Long Context NLM for ASR Rescoring in Conversational AgentsInterspeech (Interspeech), 2021

232

21 Apr 2021

On Architectures and Training for Raw Waveform Feature Extraction in ASRAutomatic Speech Recognition & Understanding (ASRU), 2021

164

09 Apr 2021

Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomesStatistical Methods in Medical Research (Stat Med), 2021

John J. Dziak

Bibhas Chakraborty

191

31 Mar 2021

Platform for Situated Intelligence

D. Bohus

Sean Andrist

Ashley Feniello

116

29 Mar 2021

"Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us?

B. Liu

119

29 Mar 2021