ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.06073
  4. Cited By
The Microsoft 2017 Conversational Speech Recognition System
v1v2 (latest)

The Microsoft 2017 Conversational Speech Recognition System

21 August 2017
Wayne Xiong
Lingfeng Wu
F. Alleva
J. Droppo
Xuedong Huang
A. Stolcke
ArXiv (abs)PDFHTML

Papers citing "The Microsoft 2017 Conversational Speech Recognition System"

50 / 144 papers shown
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent
  Forecasting
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent ForecastingIEEE International Conference on Computer Vision (ICCV), 2021
Ye Yuan
Xinshuo Weng
Yanglan Ou
Kris Kitani
AI4TS
504
596
0
25 Mar 2021
Generating Human Readable Transcript for Automatic Speech Recognition
  with Pre-trained Language Model
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language ModelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Junwei Liao
Yu Shi
Ming Gong
Linjun Shou
Sefik Emre Eskimez
Liyang Lu
Hong Qu
Michael Zeng
104
9
0
22 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information
  Representation
Transformer Language Models with LSTM-based Cross-utterance Information RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
G. Sun
Chuxu Zhang
P. Woodland
224
35
0
12 Feb 2021
Evaluating Models of Robust Word Recognition with Serial Reproduction
Evaluating Models of Robust Word Recognition with Serial Reproduction
Stephan C. Meylan
Sathvik Nair
Thomas Griffiths
100
4
0
24 Jan 2021
Practical Speech Re-use Prevention in Voice-driven Services
Practical Speech Re-use Prevention in Voice-driven ServicesInternational Symposium on Recent Advances in Intrusion Detection (RAID), 2021
Yangyong Zhang
Maliheh Shirvanian
Sunpreet S. Arora
Jianwei Huang
G. Gu
139
2
0
12 Jan 2021
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization
  Framework
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Sung-En Chang
Yanyu Li
Mengshu Sun
Runbin Shi
Hayden Kwok-Hay So
Xuehai Qian
Yanzhi Wang
Xue Lin
MQ
326
105
0
08 Dec 2020
Context-aware RNNLM Rescoring for Conversational Speech Recognition
Context-aware RNNLM Rescoring for Conversational Speech RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2020
Kun Wei
Pengcheng Guo
Hang Lv
Zhen Tu
Lei Xie
206
5
0
18 Nov 2020
Gated Recurrent Fusion with Joint Training Framework for Robust
  End-to-End Speech Recognition
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition
Cunhang Fan
Jiangyan Yi
Jianhua Tao
Zhengkun Tian
Bin Liu
Zhengqi Wen
122
87
0
09 Nov 2020
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus
HarperValleyBank: A Domain-Specific Spoken Dialog Corpus
Mike Wu
J. Nafziger
A. Scodary
Andrew L. Maas
158
18
0
26 Oct 2020
Super-Human Performance in Online Low-latency Recognition of
  Conversational Speech
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
334
42
0
07 Oct 2020
Machine learning based forecasting of significant daily returns in
  foreign exchange markets
Machine learning based forecasting of significant daily returns in foreign exchange marketsInternational Journal of Business Intelligence and Data Mining (IJBIDM), 2020
Firuz Kamalov
Ikhlaas Gurrib
123
5
0
21 Sep 2020
Cross-Utterance Language Models with Acoustic Error Sampling
Cross-Utterance Language Models with Acoustic Error Sampling
G. Sun
Chuxu Zhang
P. Woodland
142
2
0
19 Aug 2020
Deep learning for photoacoustic imaging: a survey
Deep learning for photoacoustic imaging: a survey
Changchun Yang
Hengrong Lan
Feng Gao
Fei Gao
VLMMedIm
317
21
0
10 Aug 2020
A Multi-Task Learning Approach for Human Activity Segmentation and
  Ergonomics Risk Assessment
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment
Behnoosh Parsa
A. Banerjee
210
2
0
07 Aug 2020
Privacy-preserving Artificial Intelligence Techniques in Biomedicine
Privacy-preserving Artificial Intelligence Techniques in Biomedicine
Reihaneh Torkzadehmahani
Reza Nasirigerdeh
David B. Blumenthal
T. Kacprowski
M. List
...
Harald H. H. W. Schmidt
A. Schwalber
Christof Tschohl
Andrea Wohner
Jan Baumbach
276
78
0
22 Jul 2020
Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech
  Recognition
Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition
Maarten Van Segbroeck
Sri Harish Reddy Mallidi
Brian King
I-Fan Chen
Gurpreet Chadha
Roland Maas
VLMAI4TS
108
7
0
30 Jun 2020
User Intent Inference for Web Search and Conversational Agents
User Intent Inference for Web Search and Conversational AgentsWeb Search and Data Mining (WSDM), 2020
Ali Ahmadvand
126
4
0
28 May 2020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and
  Beamforming
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
Wangyou Zhang
Aswin Shanmugam Subramanian
Xuankai Chang
Shinji Watanabe
Y. Qian
132
27
0
21 May 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech
  Recognition
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan
Joshua Shapiro
Jeremy Wohlwend
Kyu Jeong Han
Tao Lei
T. Ma
155
23
0
21 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory
  Prediction
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
377
581
0
18 May 2020
The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR
  Challenge
The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge
Tien-Hong Lo
Fu-An Chao
Shi-Yan Weng
Berlin Chen
162
11
0
18 May 2020
Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Fast and Robust Unsupervised Contextual Biasing for Speech Recognition
Young Mo Kang
Yingbo Zhou
132
13
0
04 May 2020
Multi-level Binarized LSTM in EEG Classification for Wearable Devices
Multi-level Binarized LSTM in EEG Classification for Wearable DevicesInternational Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP), 2020
Najmeh Nazari
Seyed Ahmad Mirsalari
Sima Sinaei
M. Salehi
Masoud Daneshtalab
MQ
105
13
0
19 Apr 2020
Multilevel Minimization for Deep Residual Networks
Multilevel Minimization for Deep Residual NetworksESAIM Proceedings and Surveys (ESAIM Proc. Surv.), 2020
Lisa Gaedke-Merzhäuser
Alena Kopanicáková
Rolf Krause
204
17
0
13 Apr 2020
Improving Readability for Automatic Speech Recognition Transcription
Improving Readability for Automatic Speech Recognition Transcription
Junwei Liao
Sefik Emre Eskimez
Liyang Lu
Yu Shi
Ming Gong
Linjun Shou
Hong Qu
Michael Zeng
143
60
0
09 Apr 2020
Direct Speech-to-image Translation
Direct Speech-to-image TranslationIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2020
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
178
29
0
07 Apr 2020
Improving noise robust automatic speech recognition with single-channel
  time-domain enhancement network
Improving noise robust automatic speech recognition with single-channel time-domain enhancement networkIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
136
110
0
09 Mar 2020
Deep segmental phonetic posterior-grams based discovery of
  non-categories in L2 English speech
Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech
Xu Li
Xixin Wu
Xunying Liu
Helen Meng
45
1
0
01 Feb 2020
Single headed attention based sequence-to-sequence model for
  state-of-the-art results on Switchboard
Single headed attention based sequence-to-sequence model for state-of-the-art results on SwitchboardInterspeech (Interspeech), 2020
Zoltán Tüske
G. Saon
Kartik Audhkhasi
Brian Kingsbury
BDL
172
70
0
20 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled
  BLSTM for Single-channel Multi-talker Speech Separation
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech SeparationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
110
8
0
25 Dec 2019
A Cycle-GAN Approach to Model Natural Perturbations in Speech for ASR
  Applications
A Cycle-GAN Approach to Model Natural Perturbations in Speech for ASR Applications
Sri Harsha Dumpala
Imran A. Sheikh
Rupayan Chakraborty
Sunil Kumar Kopparapu
GAN
53
7
0
18 Dec 2019
End-to-end training of time domain audio separation and recognition
End-to-end training of time domain audio separation and recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Thilo von Neumann
K. Kinoshita
Lukas Drude
Christoph Boeddeker
Marc Delcroix
Tomohiro Nakatani
Reinhold Haeb-Umbach
204
35
0
18 Dec 2019
Towards Explainable Deep Neural Networks (xDNN)
Towards Explainable Deep Neural Networks (xDNN)Neural Networks (NN), 2019
Plamen Angelov
Eduardo Soares
AAML
339
283
0
05 Dec 2019
Machine learning for music genre: multifaceted review and
  experimentation with audioset
Machine learning for music genre: multifaceted review and experimentation with audiosetJournal of Intelligence and Information Systems (JIIS), 2019
Jaime Ramírez
M. Flores
VLM
135
57
0
28 Nov 2019
Improving N-gram Language Models with Pre-trained Deep Transformer
Improving N-gram Language Models with Pre-trained Deep Transformer
Yiren Wang
Hongzhao Huang
Zhe Liu
Yutong Pang
Yongqiang Wang
Chengxiang Zhai
Fuchun Peng
73
8
0
22 Nov 2019
Forecasting significant stock price changes using neural networks
Forecasting significant stock price changes using neural networks
F. Kamalov
AIFin
229
91
0
21 Nov 2019
Towards a Model for Spoken Conversational Search
Towards a Model for Spoken Conversational SearchInformation Processing & Management (IPM), 2019
Johanne R. Trippas
Damiano Spina
Paul Thomas
Mark Sanderson
Hideo Joho
L. Cavedon
198
81
0
29 Oct 2019
An Empirical Study of Efficient ASR Rescoring with Transformers
An Empirical Study of Efficient ASR Rescoring with Transformers
Hongzhao Huang
Fuchun Peng
KELM
89
24
0
24 Oct 2019
Detecting Multiple Speech Disfluencies using a Deep Residual Network
  with Bidirectional Long Short-Term Memory
Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term MemoryIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Tedd Kourkounakis
Amirhossein Hajavi
Ali Etemad
134
83
0
17 Oct 2019
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units
  for speech recognition
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Atanas G. Atanasov
Tim Ng
Leo Liu
Henry Mason
Xiaodan Zhuang
Daben Liu
175
41
0
04 Oct 2019
Acoustic Model Adaptation from Raw Waveforms with SincNet
Acoustic Model Adaptation from Raw Waveforms with SincNetAutomatic Speech Recognition & Understanding (ASRU), 2019
Joachim Fainberg
Ondˇrej Klejch
Erfan Loweimi
P. Bell
Steve Renals
109
15
0
30 Sep 2019
A High-Fidelity Open Embodied Avatar with Lip Syncing and Expression
  Capabilities
A High-Fidelity Open Embodied Avatar with Lip Syncing and Expression CapabilitiesInternational Conference on Multimodal Interaction (ICMI), 2019
Deepali Aneja
Daniel J. McDuff
S. Shah
98
39
0
19 Sep 2019
High-Throughput In-Memory Computing for Binary Deep Neural Networks with
  Monolithically Integrated RRAM and 90nm CMOS
High-Throughput In-Memory Computing for Binary Deep Neural Networks with Monolithically Integrated RRAM and 90nm CMOSIEEE Transactions on Electron Devices (IEEE TED), 2019
Shihui Yin
Xiaoyu Sun
Shimeng Yu
Jae-sun Seo
MQ
95
121
0
16 Sep 2019
Feature Engineering and Forecasting via Derivative-free Optimization and
  Ensemble of Sequence-to-sequence Networks with Applications in Renewable
  Energy
Feature Engineering and Forecasting via Derivative-free Optimization and Ensemble of Sequence-to-sequence Networks with Applications in Renewable Energy
Mohammad Pirhooshyaran
K. Scheinberg
L. Snyder
AI4TS
244
28
0
12 Sep 2019
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Beyond Human-Level Accuracy: Computational Challenges in Deep LearningACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2019
Joel Hestness
Newsha Ardalani
G. Diamos
111
79
0
03 Sep 2019
Deaf, Hard of Hearing, and Hearing Perspectives on using Automatic
  Speech Recognition in Conversation
Deaf, Hard of Hearing, and Hearing Perspectives on using Automatic Speech Recognition in ConversationInternational ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2017
Abraham Glasser
Kesavan R. Kushalnagar
Raja S. Kushalnagar
88
34
0
03 Sep 2019
Towards Better Understanding of Spontaneous Conversations: Overcoming
  Automatic Speech Recognition Errors With Intent Recognition
Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition
Piotr Żelasko
Jan Mizgajski
Mikolaj Morzy
Adrian Szymczak
Piotr Szymañski
Lukasz Augustyniak
Yishay Carmiel
210
0
0
21 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DVVLMAI4TS
372
226
0
16 Aug 2019
An Inter-Layer Weight Prediction and Quantization for Deep Neural
  Networks based on a Smoothly Varying Weight Hypothesis
An Inter-Layer Weight Prediction and Quantization for Deep Neural Networks based on a Smoothly Varying Weight Hypothesis
Kang-Ho Lee
Joonhyun Jeong
Sung-Ho Bae
131
4
0
16 Jul 2019
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion
  Recognition
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2019
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
J. Epps
Björn W. Schuller
325
111
0
13 Jul 2019
Previous
123
Next