ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.03240
  4. Cited By
An Unsupervised Autoregressive Model for Speech Representation Learning
v1v2 (latest)

An Unsupervised Autoregressive Model for Speech Representation Learning

5 April 2019
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
    SSL
ArXiv (abs)PDFHTML

Papers citing "An Unsupervised Autoregressive Model for Speech Representation Learning"

19 / 269 papers shown
Does Visual Self-Supervision Improve Learning of Speech Representations
  for Emotion Recognition?
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?IEEE Transactions on Affective Computing (IEEE TAC), 2020
Abhinav Shukla
Stavros Petridis
Maja Pantic
SSL
417
33
0
04 May 2020
Improved Speech Representations with Multi-Target Autoregressive
  Predictive Coding
Improved Speech Representations with Multi-Target Autoregressive Predictive CodingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Yu-An Chung
James R. Glass
SSL
206
57
0
11 Apr 2020
Towards Learning a Universal Non-Semantic Representation of Speech
Towards Learning a Universal Non-Semantic Representation of SpeechInterspeech (Interspeech), 2020
Joel Shor
A. Jansen
Ronnie Maor
Oran Lang
Omry Tuval
Félix de Chaumont Quitry
Marco Tagliasacchi
Ira Shavitt
Dotan Emanuel
Yinnon A. Haviv
SSL
556
166
0
25 Feb 2020
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked
  Reconstruction
Unsupervised Pre-training of Bidirectional Speech Encoders via Masked ReconstructionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Weiran Wang
Qingming Tang
Karen Livescu
SSL
229
99
0
28 Jan 2020
Visually Guided Self Supervised Learning of Speech Representations
Visually Guided Self Supervised Learning of Speech RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Abhinav Shukla
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
166
30
0
13 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
346
88
0
02 Jan 2020
Deep Contextualized Acoustic Representations For Semi-Supervised Speech
  Recognition
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
221
145
0
03 Dec 2019
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded
  Speech
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded SpeechInternational Conference on Learning Representations (ICLR), 2019
David Harwath
Wei-Ning Hsu
James R. Glass
170
88
0
21 Nov 2019
Effectiveness of self-supervised pre-training for speech recognition
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
349
156
0
10 Nov 2019
Speaker-invariant Affective Representation Learning via Adversarial
  Training
Speaker-invariant Affective Representation Learning via Adversarial TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Haoqi Li
Ming Tu
Jing-ling Huang
Shrikanth Narayanan
P. Georgiou
345
60
0
04 Nov 2019
Towards Unsupervised Speech Recognition and Synthesis with Quantized
  Speech Representation Learning
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Alexander H. Liu
Tao Tu
Hung-yi Lee
Lin-Shan Lee
SSL
227
52
0
28 Oct 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
466
391
0
25 Oct 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Generative Pre-Training for Speech with Autoregressive Predictive CodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Yu-An Chung
James R. Glass
SSL
330
182
0
23 Oct 2019
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention
  Networks
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention NetworksInterspeech (Interspeech), 2019
Xingcheng Song
Guangsen Wang
Zhiyong Wu
Yiheng Huang
Jane Polak Scowcroft
Dong Yu
Helen Meng
SSL
202
54
0
23 Oct 2019
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
263
105
0
22 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech RepresentationsInternational Conference on Learning Representations (ICLR), 2019
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
560
716
0
12 Oct 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech RecognitionInterspeech (Interspeech), 2019
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
118
33
0
09 Jul 2019
BERTphone: Phonetically-Aware Encoder Representations for
  Utterance-Level Speaker and Language Recognition
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language RecognitionThe Speaker and Language Recognition Workshop (Odyssey), 2019
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
200
28
0
30 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep
  Pre-Trained Language Models
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
146
27
0
17 Jun 2019
Previous
123456