v1v2 (latest)

The Microsoft 2017 Conversational Speech Recognition System

21 August 2017

Papers citing "The Microsoft 2017 Conversational Speech Recognition System"

50 / 144 papers shown

xMem: A CPU-Based Approach for Accurate Estimation of GPU Memory in Deep Learning Training Workloads

Jiabo Shi

Dimitrios Pezaros

Yehia Elkhatib

151

23 Oct 2025

Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis

Jiabo Shi

Yehia Elkhatib

3DH VLM

252

04 Apr 2025

Communication Access Real-Time Translation Through Collaborative Correction of Automatic Speech Recognition

Korbinian Kuhn

Verena Kersken

Gottfried Zimmermann

196

19 Mar 2025

Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation

329

21 Oct 2024

Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces ModelsSpoken Language Technology Workshop (SLT), 2024

Xiaoxue Gao

Nancy F. Chen

Mamba

256

27 Sep 2024

Measuring the Accuracy of Automatic Speech Recognition SolutionsACM Transactions on Accessible Computing (TACCESS), 2023

240

29 Aug 2024

Child Speech Recognition in Human-Robot Interaction: Problem Solved?

Maria Jose Pinto Bernal

Tony Belpaeme

206

26 Apr 2024

Reinforcement Learning-Based Approaches for Enhancing Security and Resilience in Smart Control: A Survey on Attack and Defense Methods

Zheyu Zhang

AAML

178

23 Feb 2024

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

235

20 Dec 2023

Assessing SATNet's Ability to Solve the Symbol Grounding ProblemNeural Information Processing Systems (NeurIPS), 2023

235

13 Dec 2023

SAPIEN: Affective Virtual Agents Powered by Large Language Models

222

06 Aug 2023

Leveraging Cross-Utterance Context For ASR DecodingInterspeech (Interspeech), 2023

Robert Flynn

Anton Ragni

228

29 Jun 2023

Personalized Predictive ASR for Latency Reduction in Voice AssistantsInterspeech (Interspeech), 2023

A. Schwarz

Di He

Maarten Van Segbroeck

Mohammed Hethnawi

Ariya Rastrow

272

23 May 2023

Modular Domain Adaptation for Conformer-Based Streaming ASRInterspeech (Interspeech), 2023

250

22 May 2023

Neural Delay Differential Equations: System Reconstruction and Image ClassificationInternational Conference on Learning Representations (ICLR), 2021

Qunxi Zhu

Yao Guo

Wei Lin

228

11 Apr 2023

From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech RecognitionInternational Conference on Human Factors in Computing Systems (CHI), 2023

511

17 Feb 2023

Using Kaldi for Automatic Speech Recognition of Conversational Austrian German

199

16 Jan 2023

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Ding Zhao

159

12 Oct 2022

Audio-driven Neural Gesture Reenactment with Video Motion GraphsComputer Vision and Pattern Recognition (CVPR), 2022

287

23 Jul 2022

Contextual Squeeze-and-Excitation for Efficient Few-Shot Image ClassificationNeural Information Processing Systems (NeurIPS), 2022

Massimiliano Patacchiola

421

20 Jun 2022

FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image ClassificationInternational Conference on Learning Representations (ICLR), 2022

Aliaksandra Shysheya

J. Bronskill

Massimiliano Patacchiola

Sebastian Nowozin

Richard Turner

3DH FedML

312

17 Jun 2022

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

234

26 May 2022

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Zhijian Liu

Song Han

285

134

25 Apr 2022

MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with SpeechInternational Conference on Human Factors in Computing Systems (CHI), 2022

165

01 Apr 2022

Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASRInternational Conference on Language Resources and Evaluation (LREC), 2022

Nina Markl

S. McNulty

196

25 Feb 2022

Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments

Mario Esparza

199

21 Feb 2022

I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy

H.C.M. Turner

Giulio Lovisotto

Simon Eberz

Ivan Martinovic

100

13 Feb 2022

Recent Progress in the CUHK Dysarthric Speech Recognition SystemIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

189

15 Jan 2022

Investigation of Data Augmentation Techniques for Disordered Speech RecognitionInterspeech (Interspeech), 2020

198

14 Jan 2022

Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor DataIEEE Internet of Things Journal (IEEE IoT J.), 2022

137

01 Jan 2022

Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition

C. Li

Ngoc Thang Vu

125

19 Dec 2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation GuaranteesNeural Information Processing Systems (NeurIPS), 2021

343

10 Nov 2021

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

Jianru Xue

311

07 Nov 2021

On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics

03 Nov 2021

Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition

154

19 Oct 2021

Graphs as Tools to Improve Deep Learning Methods

Vincent Gripon

171

08 Oct 2021

Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition

324

08 Oct 2021

Look Who's Talking: Active Speaker Detection in the Wild

246

17 Aug 2021

Edge service resource allocation strategy based on intelligent prediction

107

27 Jul 2021

Large-Scale News Classification using BERT Language Model: Spark NLP ApproachInternational Conference on Sustainable Information Engineering and Technology (ICSIET), 2021

Kuncahyo Setyo Nugroho

Anantha Yullian Sukmadewa

N. Yudistira

169

14 Jul 2021

Dive into Deep LearningJournal of the American College of Radiology (JACR), 2020

464

663

21 Jun 2021

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

Ashish Shenoy

S. Bodapati

Katrin Kirchhoff

256

15 Jun 2021

Drivers' Manoeuvre Modelling and Prediction for Safe HRI

Erwin Jose López Pulgarín

G. Herrmann

U. Leonards

03 Jun 2021

A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

196

07 May 2021

On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021

Zoltán Tüske

G. Saon

Brian Kingsbury

296

03 May 2021

Adapting Long Context NLM for ASR Rescoring in Conversational AgentsInterspeech (Interspeech), 2021

286

21 Apr 2021

On Architectures and Training for Raw Waveform Feature Extraction in ASRAutomatic Speech Recognition & Understanding (ASRU), 2021

253

09 Apr 2021

Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomesStatistical Methods in Medical Research (Stat Med), 2021

John J. Dziak

Bibhas Chakraborty

256

31 Mar 2021

Platform for Situated Intelligence

D. Bohus

Sean Andrist

Ashley Feniello

142

29 Mar 2021

"Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us?

B. Liu

162

29 Mar 2021