ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.13979
  4. Cited By
Unsupervised Cross-lingual Representation Learning for Speech
  Recognition

Unsupervised Cross-lingual Representation Learning for Speech Recognition

24 June 2020
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
    SSL
ArXivPDFHTML

Papers citing "Unsupervised Cross-lingual Representation Learning for Speech Recognition"

50 / 402 papers shown
Title
Temporal Knowledge Distillation for On-device Audio Classification
Temporal Knowledge Distillation for On-device Audio Classification
Kwanghee Choi
Martin Kersner
Jacob Morton
Buru Chang
6
26
0
27 Oct 2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text
  Joint Pre-Training
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Ankur Bapna
Yu-An Chung
Na Wu
Anmol Gulati
Ye Jia
J. Clark
Melvin Johnson
Jason Riesa
Alexis Conneau
Yu Zhang
VLM
51
94
0
20 Oct 2021
ASR4REAL: An extended benchmark for speech models
ASR4REAL: An extended benchmark for speech models
M. Rivière
Jade Copet
Gabriel Synnaeve
AuLLM
39
15
0
16 Oct 2021
From Start to Finish: Latency Reduction Strategies for Incremental
  Speech Synthesis in Simultaneous Speech-to-Speech Translation
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation
Danni Liu
Changhan Wang
Hongyu Gong
Xutai Ma
Yun Tang
J. Pino
17
4
0
15 Oct 2021
Multilingual Speech Recognition using Knowledge Transfer across Learning
  Processes
Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Rimita Lahiri
K. Kumatani
Eric Sun
Yao Qian
47
6
0
15 Oct 2021
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of
  Graphemes and Syllables
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables
Jounghee Kim
Pilsung Kang
VLM
15
6
0
11 Oct 2021
Injecting Text and Cross-lingual Supervision in Few-shot Learning from
  Self-Supervised Models
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models
Matthew Wiesner
Desh Raj
Sanjeev Khudanpur
51
6
0
10 Oct 2021
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0
Sameer Khurana
Antoine Laurent
James R. Glass
VLM
35
18
0
07 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
  Recognition
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
17
217
0
07 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish
  Dutch
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
23
1
0
29 Sep 2021
Topic Model Robustness to Automatic Speech Recognition Errors in Podcast
  Transcripts
Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Raluca Alexandra Fetic
Mikkel Jordahn
Lucas Chaves Lima
R. A. F. Egebæk
Martin Carsten Nielsen
Benjamin Biering
Lars Kai Hansen
23
1
0
25 Sep 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
27
77
0
23 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning
  for Low-Resource Speech Recognition
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
24
26
0
19 Sep 2021
Performance-Efficiency Trade-offs in Unsupervised Pre-training for
  Speech Recognition
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Felix Wu
Kwangyoun Kim
Jing Pan
Kyu Jeong Han
Kilian Q. Weinberger
Yoav Artzi
25
71
0
14 Sep 2021
Ensuring the Inclusive Use of Natural Language Processing in the Global
  Response to COVID-19
Ensuring the Inclusive Use of Natural Language Processing in the Global Response to COVID-19
A. Luccioni
K. H. Pham
C. Lam
Joseph Aylett-Bullock
M. Luengo-Oroz
16
3
0
11 Aug 2021
CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Anirudh Gupta
Harveen Singh Chadha
Priyanshi Shah
Neeraj Chimmwal
Ankur Dhuriya
Rishabh Gaur
Vivek Raghavan
28
37
0
15 Jul 2021
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual
  Shared Task
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
Yun Tang
Hongyu Gong
Xian Li
Changhan Wang
J. Pino
Holger Schwenk
Naman Goyal
34
10
0
14 Jul 2021
Layer-wise Analysis of a Self-supervised Speech Representation Model
Layer-wise Analysis of a Self-supervised Speech Representation Model
Ankita Pasad
Ju-Chieh Chou
Karen Livescu
SSL
26
287
0
10 Jul 2021
Improved Language Identification Through Cross-Lingual Self-Supervised
  Learning
Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Andros Tjandra
Diptanu Gon Choudhury
Frank Zhang
Kritika Singh
Alexis Conneau
Alexei Baevski
Assaf Sela
Yatharth Saraf
Michael Auli
VLM
SSL
24
35
0
08 Jul 2021
Pretext Tasks selection for multitask self-supervised speech
  representation learning
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
14
12
0
01 Jul 2021
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Pavel Denisov
Manuel Mager
Ngoc Thang Vu
32
6
0
30 Jun 2021
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition
Cheng-I Jeff Lai
Yang Zhang
Alexander H. Liu
Shiyu Chang
Yi-Lun Liao
Yung-Sung Chuang
Kaizhi Qian
Sameer Khurana
David D. Cox
James R. Glass
VLM
49
70
0
10 Jun 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for
  Signal Recognition
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
38
9
0
05 Jun 2021
Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in
  German Speech Recognition
Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition
Julia Pritzen
Michael Gref
Dietlind Zühlke
C. Schmidt
12
1
0
26 May 2021
Unsupervised Speech Recognition
Unsupervised Speech Recognition
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
12
270
0
24 May 2021
Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Exploiting Adapters for Cross-lingual Low-resource Speech Recognition
Wenxin Hou
Hanlin Zhu
Yidong Wang
Jindong Wang
Tao Qin
Renjun Xu
T. Shinozaki
19
63
0
18 May 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Scaling End-to-End Models for Large-Scale Multilingual ASR
Bo-wen Li
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
W. R. Huang
Min Ma
Junwen Bai
CLL
26
76
0
30 Apr 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
F. Ringeval
D. Schwab
Laurent Besacier
SSL
17
70
0
23 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language
  Processing for Multilingual Task-Oriented Dialogue Systems
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
E. Razumovskaia
Goran Glavavs
Olga Majewska
E. Ponti
Anna Korhonen
Ivan Vulić
18
32
0
17 Apr 2021
Multilingual and Cross-Lingual Intent Detection from Spoken Data
Multilingual and Cross-Lingual Intent Detection from Spoken Data
D. Gerz
Pei-hao Su
Razvan Kusztos
Avishek Mondal
M. Lis
Eshan Singhal
N. Mrksic
Tsung-Hsien Wen
Ivan Vulić
15
35
0
17 Apr 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
Changhan Wang
Anne Wu
J. Pino
Alexei Baevski
Michael Auli
Alexis Conneau
SSL
31
44
0
14 Apr 2021
On Architectures and Training for Raw Waveform Feature Extraction in ASR
On Architectures and Training for Raw Waveform Feature Extraction in ASR
Peter Vieting
Christoph Luscher
Wilfried Michel
Ralf Schluter
Hermann Ney
22
9
0
09 Apr 2021
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0
  acoustic model
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model
Apoorv Vyas
S. Madikeri
H. Bourlard
11
15
0
06 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised
  Pre-Training
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
14
235
0
02 Apr 2021
Leveraging pre-trained representations to improve access to
  untranscribed speech from endangered languages
Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages
Nay San
Martijn Bartelds
Mitchell Browne
Lily Clifford
Fiona Gibson
...
Jane Simpson
Myfany Turpin
Maria Vollmer
Sasha Wilmoth
Dan Jurafsky
13
15
0
26 Mar 2021
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of
  Cardiac Signals
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals
Dani Kiyasseh
T. Zhu
David A. Clifton
22
0
0
19 Mar 2021
Self-Supervised Learning of Audio Representations from Permutations with
  Differentiable Ranking
Self-Supervised Learning of Audio Representations from Permutations with Differentiable Ranking
Andrew N. Carr
Quentin Berthet
Mathieu Blondel
O. Teboul
Neil Zeghidour
SSL
8
24
0
17 Mar 2021
XLST: Cross-lingual Self-training to Learn Multilingual Representation
  for Low Resource Speech Recognition
XLST: Cross-lingual Self-training to Learn Multilingual Representation for Low Resource Speech Recognition
Zi-qiang Zhang
Yan Song
Ming Wu
Xin Fang
Lirong Dai
SSL
22
21
0
15 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
13
12
0
13 Mar 2021
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource
  End-to-End Speech Recognition
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
A. Laptev
A. Andrusenko
Ivan Podluzhny
Anton Mitrofanov
Ivan Medennikov
Yuri N. Matveev
VLM
18
14
0
12 Mar 2021
CDPAM: Contrastive learning for perceptual audio similarity
CDPAM: Contrastive learning for perceptual audio similarity
Pranay Manocha
Zeyu Jin
Richard Y. Zhang
Adam Finkelstein
17
68
0
09 Feb 2021
UniSpeech: Unified Speech Representation Learning with Labeled and
  Unlabeled Data
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
30
112
0
19 Jan 2021
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for
  Low-resource Speech Recognition
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Cheng Yi
Shiyu Zhou
Bo Xu
49
40
0
17 Jan 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
  Learning, Semi-Supervised Learning and Interpretation
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
21
459
0
02 Jan 2021
A comparison of self-supervised speech representations as input features
  for unsupervised acoustic word embeddings
A comparison of self-supervised speech representations as input features for unsupervised acoustic word embeddings
Lisa van Staden
Herman Kamper
SSL
15
16
0
14 Dec 2020
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual
  Speech Recognition
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Genta Indra Winata
Guangsen Wang
Caiming Xiong
S. Hoi
VLM
8
50
0
03 Dec 2020
Automatically Identifying Language Family from Acoustic Examples in Low
  Resource Scenarios
Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios
Peter Wu
Yifan Zhong
A. Black
18
3
0
01 Dec 2020
Neural Representations for Modeling Variation in Speech
Neural Representations for Modeling Variation in Speech
Martijn Bartelds
Wietse de Vries
Faraz Sanal
Caitlin Richter
M. Liberman
Martijn B. Wieling
SSL
DRL
14
22
0
25 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
14
7
0
11 Nov 2020
Supervised Contrastive Learning for Pre-trained Language Model
  Fine-tuning
Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning
Beliz Gunel
Jingfei Du
Alexis Conneau
Ves Stoyanov
15
497
0
03 Nov 2020
Previous
123456789
Next