Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.04699
Cited By
TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation
12 May 2018
François Hernandez
Vincent Nguyen
Sahar Ghannay
N. Tomashenko
Yannick Esteve
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation"
50 / 205 papers shown
Title
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
M. Musaev
Saida Mussakhojayeva
Ilyos Khujayorov
Yerbolat Khassanov
M. Ochilov
H. A. Varol
21
19
0
30 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
29
15
0
17 Jul 2021
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition
Christian Huber
Juan Hussain
Sebastian Stüker
A. Waibel
29
24
0
05 Jul 2021
Arabic Code-Switching Speech Recognition using Monolingual Data
Ahmed M. Ali
Shammur A. Chowdhury
A. Hussein
Yasser Hifny
23
23
0
04 Jul 2021
Dealing with training and test segmentation mismatch: FBK@IWSLT2021
Sara Papi
Marco Gaido
Matteo Negri
Marco Turchi
39
6
0
23 Jun 2021
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
VLM
35
51
0
16 Jun 2021
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Guoguo Chen
Shuzhou Chai
Guan-Bo Wang
Jiayu Du
Weiqiang Zhang
...
Xuchen Yao
Yongqing Wang
Yujun Wang
Zhao You
Zhiyong Yan
60
351
0
13 Jun 2021
Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition
A. Hussein
Shammur A. Chowdhury
Najim Dehak
Ahmed M. Ali
21
2
0
10 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
24
752
0
08 Jun 2021
CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings
Tatiana Likhomanenko
Qiantong Xu
Gabriel Synnaeve
R. Collobert
A. Rogozhnikov
OOD
ViT
33
55
0
06 Jun 2021
Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
L. Bentivogli
Mauro Cettolo
Marco Gaido
Alina Karakanta
A. Martinelli
Matteo Negri
Marco Turchi
21
79
0
02 Jun 2021
The Volctrans Neural Speech Translation System for IWSLT 2021
Chengqi Zhao
Zhicheng Liu
Jian-Fei Tong
Tao Wang
Mingxuan Wang
Rong Ye
Qianqian Dong
Jun Cao
Lei Li
32
8
0
16 May 2021
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation
Marco Gaido
Matteo Negri
Mauro Cettolo
Marco Turchi
VLM
58
25
0
23 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Janne Pylkkönen
Antti Ukkonen
Juho Kilpikoski
Samu Tamminen
Hannes Heikinheimo
26
27
0
22 Apr 2021
EasyCall corpus: a dysarthric speech dataset
Rosanna Turrisi
Arianna Braccia
M. Emanuele
Simone Giulietti
M. Pugliatti
M. Sensi
Luciano Fadiga
Leonardo Badino
19
22
0
06 Apr 2021
SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network
William Chan
Daniel S. Park
Chris A. Lee
Yu Zhang
Quoc V. Le
Mohammad Norouzi
AI4TS
40
136
0
05 Apr 2021
SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Patrick K. O’Neill
Vitaly Lavrukhin
Somshubra Majumdar
Vahid Noroozi
Yuekai Zhang
...
Keenan Freyberg
Michael D. Shulman
Boris Ginsburg
Shinji Watanabe
Georg Kucsko
AI4TS
28
59
0
05 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
25
237
0
02 Apr 2021
Construction of a Large-scale Japanese ASR Corpus on TV Recordings
Shintaro Ando
Hiromasa Fujihara
11
21
0
26 Mar 2021
Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition
Yosuke Kashiwagi
E. Tsunoo
Shinji Watanabe
AI4TS
29
7
0
18 Feb 2021
Improving speech recognition models with small samples for air traffic control systems
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
34
32
0
16 Feb 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
38
112
0
19 Jan 2021
On Knowledge Distillation for Direct Speech Translation
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
30
14
0
09 Dec 2020
Breeding Gender-aware Direct Speech Translation Systems
Marco Gaido
Beatrice Savoldi
L. Bentivogli
Matteo Negri
Marco Turchi
48
20
0
09 Dec 2020
End to End ASR System with Automatic Punctuation Insertion
Yushi Guan
3DV
27
5
0
03 Dec 2020
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Sameer Khurana
Niko Moritz
Takaaki Hori
Jonathan Le Roux
24
54
0
26 Nov 2020
On the use of Self-supervised Pre-trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition
Manon Macary
Marie Tahon
Yannick Esteve
Anthony Rousseau
SSL
22
54
0
18 Nov 2020
DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Dominique Fohr
Irina Illina
KELM
8
1
0
02 Nov 2020
Rethinking Evaluation in ASR: Are Our Models Robust Enough?
Tatiana Likhomanenko
Qiantong Xu
Vineel Pratap
Paden Tomasello
Jacob Kahn
Gilad Avidov
R. Collobert
Gabriel Synnaeve
39
98
0
22 Oct 2020
Analysis of the BUT Diarization System for VoxConverse Challenge
Federico Landini
O. Glembek
P. Matejka
Johan Rohdin
L. Burget
Mireia Díez
Anna Silnova
21
32
0
22 Oct 2020
They are wearing a mask! Identification of Subjects Wearing a Surgical Mask from their Speech by means of x-vectors and Fisher Vectors
José Vicente Egas López
19
0
0
23 Aug 2020
"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)
Arsenii Gorin
Daniil Kulko
Steven Grima
Alex Glasman
17
3
0
04 Aug 2020
Leverage Unlabeled Data for Abstractive Speech Summarization with Self-Supervised Learning and Back-Summarization
P. Tardy
Louis de Seynes
François Hernandez
Vincent Nguyen
D. Janiszek
Yannick Esteve
12
1
0
30 Jul 2020
Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation
P. Tardy
D. Janiszek
Yannick Esteve
Vincent Nguyen
23
10
0
15 Jul 2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020
Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
19
53
0
04 Jun 2020
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
Maha Elbayad
H. Nguyen
Fethi Bougares
N. Tomashenko
Antoine Caubrière
Benjamin Lecouteux
Yannick Esteve
Laurent Besacier
17
12
0
24 May 2020
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham
Thanh-Le Ha
Tuan-Nam Nguyen
T. Nguyen
Elizabeth Salesky
S. Stueker
Jan Niehues
A. Waibel
20
36
0
20 May 2020
A language score based output selection method for multilingual speech recognition
V. Nguyen
Thi Quynh Khanh Dinh
Truong Thao Nguyen
Dang-Khoa Mac
6
0
0
02 May 2020
Direct Speech-to-image Translation
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
36
29
0
07 Apr 2020
Gender Representation in Open Source Speech Resources
Mahault Garnerin
Solange Rossato
Laurent Besacier
24
6
0
18 Mar 2020
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data
Vincent Roger
Jérôme Farinas
J. Pinquier
33
23
0
09 Mar 2020
Toward Cross-Domain Speech Recognition with End-to-End Models
T. Nguyen
Sebastian Stüker
A. Waibel
13
7
0
09 Mar 2020
Learning Robust and Multilingual Speech Representations
Kazuya Kawakami
Luyu Wang
Chris Dyer
Phil Blunsom
Aaron van den Oord
SSL
22
98
0
29 Jan 2020
Transformer-based language modeling and decoding for conversational speech recognition
Kareem Nassar
18
0
0
04 Jan 2020
ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment
Bo Yang
Xianlong Tan
Zhengmao Chen
Bing Wang
Dan Li
Zhongping Yang
Xiping Wu
Yi Lin
16
21
0
26 Nov 2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task
H. Nguyen
N. Tomashenko
Marcely Zanon Boito
Antoine Caubrière
Fethi Bougares
Mickael Rouvier
Laurent Besacier
Yannick Esteve
15
8
0
30 Oct 2019
L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition
Yuanfeng Song
Di Jiang
Xuefang Zhao
Qian Xu
Raymond Chi-Wing Wong
Lixin Fan
Qiang Yang
29
17
0
25 Oct 2019
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
37
716
0
13 Sep 2019
IMS-Speech: A Speech to Text Tool
Pavel Denisov
Ngoc Thang Vu
19
11
0
13 Aug 2019
A Speech Test Set of Practice Business Presentations with Additional Relevant Texts
Dominik Machácek
J. Kratochvíl
Tereza Vojtechová
Ondrej Bojar
14
9
0
02 Aug 2019
Previous
1
2
3
4
5
Next