Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1911.01102
Cited By
What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
4 November 2019
Chung-Yi Li
Pei-Chieh Yuan
Hung-yi Lee
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis"
16 / 16 papers shown
Tailored Design of Audio-Visual Speech Recognition Models using Branchformers
David Gimeno-Gómez
Carlos David Martínez Hinarejos
421
6
0
09 Jul 2024
Exploration of Adapter for Noise Robust Automatic Speech Recognition
Hao Shi
Tatsuya Kawahara
276
6
0
28 Feb 2024
MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
Interspeech (Interspeech), 2023
Jiamin Xie
John H. L. Hansen
156
5
0
27 Oct 2023
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
International Conference on Artificial Neural Networks (ICANN), 2023
Patrick Eickhoff
M. Möller
Theresa Pekarek-Rosin
Johannes Twiefel
Stefan Wermter
151
4
0
05 Sep 2023
Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Haitao Tang
Yu Fu
Lei Sun
Jiabin Xue
Dan Liu
...
Zhiqiang Ma
Minghui Wu
Jia Pan
Genshun Wan
Ming’En Zhao
163
5
0
27 Jun 2023
Hardware Acceleration of Explainable Artificial Intelligence
Zhixin Pan
Prabhat Mishra
290
2
0
04 May 2023
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lingwei Meng
Jiawen Kang
Mingyu Cui
Yuejiao Wang
Xixin Wu
Helen M. Meng
196
21
0
20 Feb 2023
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yuan-Jui Chen
Ming Tu
Tang-Chun Li
Xin Li
Qiuqiang Kong
Jiaxin Li
Zhichao Wang
Qiao Tian
Yuping Wang
Yuxuan Wang
222
16
0
27 Oct 2022
Privacy-Preserving Speech Representation Learning using Vector Quantization
Pierre Champion
D. Jouvet
Anthony Larcher
SSL
83
0
0
15 Mar 2022
Towards Relatable Explainable AI with the Perceptual Process
International Conference on Human Factors in Computing Systems (CHI), 2021
Wencan Zhang
Brian Y. Lim
AAML
XAI
268
72
0
28 Dec 2021
Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Salima Mdhaffar
J. Bonastre
Marc Tommasi
N. Tomashenko
Yannick Esteve
170
12
0
07 Nov 2021
Focus on the present: a regularization method for the ASR source-target attention layer
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Nanxin Chen
Piotr Żelasko
Jesús Villalba
Najim Dehak
200
3
0
02 Nov 2020
Probing Acoustic Representations for Phonetic Properties
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Danni Ma
Neville Ryant
M. Liberman
345
53
0
25 Oct 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
537
394
0
12 Jul 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
324
159
0
18 May 2020
Utterance-level Aggregation For Speaker Recognition In The Wild
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
237
364
0
26 Feb 2019
1
Page 1 of 1