ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.01102
  4. Cited By
What does a network layer hear? Analyzing hidden representations of
  end-to-end ASR through speech synthesis

What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
4 November 2019
Chung-Yi Li
Pei-Chieh Yuan
Hung-yi Lee
ArXiv (abs)PDFHTML

Papers citing "What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis"

16 / 16 papers shown
Tailored Design of Audio-Visual Speech Recognition Models using Branchformers
Tailored Design of Audio-Visual Speech Recognition Models using Branchformers
David Gimeno-Gómez
Carlos David Martínez Hinarejos
421
6
0
09 Jul 2024
Exploration of Adapter for Noise Robust Automatic Speech Recognition
Exploration of Adapter for Noise Robust Automatic Speech Recognition
Hao Shi
Tatsuya Kawahara
276
6
0
28 Feb 2024
MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition
MixRep: Hidden Representation Mixup for Low-Resource Speech RecognitionInterspeech (Interspeech), 2023
Jiamin Xie
John H. L. Hansen
156
5
0
27 Oct 2023
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic
  Speech Recognition
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech RecognitionInternational Conference on Artificial Neural Networks (ICANN), 2023
Patrick Eickhoff
M. Möller
Theresa Pekarek-Rosin
Johannes Twiefel
Stefan Wermter
151
4
0
05 Sep 2023
Reducing the gap between streaming and non-streaming Transducer-based
  ASR by adaptive two-stage knowledge distillation
Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Haitao Tang
Yu Fu
Lei Sun
Jiabin Xue
Dan Liu
...
Zhiqiang Ma
Minghui Wu
Jia Pan
Genshun Wan
Ming’En Zhao
163
5
0
27 Jun 2023
Hardware Acceleration of Explainable Artificial Intelligence
Hardware Acceleration of Explainable Artificial Intelligence
Zhixin Pan
Prabhat Mishra
290
2
0
04 May 2023
A Sidecar Separator Can Convert a Single-Talker Speech Recognition
  System to a Multi-Talker One
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lingwei Meng
Jiawen Kang
Mingyu Cui
Yuejiao Wang
Xixin Wu
Helen M. Meng
196
21
0
20 Feb 2023
Streaming Voice Conversion Via Intermediate Bottleneck Features And
  Non-streaming Teacher Guidance
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher GuidanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yuan-Jui Chen
Ming Tu
Tang-Chun Li
Xin Li
Qiuqiang Kong
Jiaxin Li
Zhichao Wang
Qiao Tian
Yuping Wang
Yuxuan Wang
222
16
0
27 Oct 2022
Privacy-Preserving Speech Representation Learning using Vector
  Quantization
Privacy-Preserving Speech Representation Learning using Vector Quantization
Pierre Champion
D. Jouvet
Anthony Larcher
SSL
83
0
0
15 Mar 2022
Towards Relatable Explainable AI with the Perceptual Process
Towards Relatable Explainable AI with the Perceptual ProcessInternational Conference on Human Factors in Computing Systems (CHI), 2021
Wencan Zhang
Brian Y. Lim
AAMLXAI
268
72
0
28 Dec 2021
Retrieving Speaker Information from Personalized Acoustic Models for
  Speech Recognition
Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Salima Mdhaffar
J. Bonastre
Marc Tommasi
N. Tomashenko
Yannick Esteve
170
12
0
07 Nov 2021
Focus on the present: a regularization method for the ASR source-target
  attention layer
Focus on the present: a regularization method for the ASR source-target attention layerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Nanxin Chen
Piotr Żelasko
Jesús Villalba
Najim Dehak
200
3
0
02 Nov 2020
Probing Acoustic Representations for Phonetic Properties
Probing Acoustic Representations for Phonetic PropertiesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Danni Ma
Neville Ryant
M. Liberman
345
53
0
25 Oct 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
537
394
0
12 Jul 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio
  Representation
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
324
159
0
18 May 2020
Utterance-level Aggregation For Speaker Recognition In The Wild
Utterance-level Aggregation For Speaker Recognition In The WildIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Weidi Xie
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
237
364
0
26 Feb 2019
1
Page 1 of 1