ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.06223
  4. Cited By
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in
  Multi-turn Conversation

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

12 December 2021
Holy Lovenia
Samuel Cahyawijaya
Genta Indra Winata
Peng-Tao Xu
Xu Yan
Zihan Liu
Rita Frieske
Tiezheng Yu
Wenliang Dai
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
ArXivPDFHTML

Papers citing "ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation"

24 / 24 papers shown
Title
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition
Jiaming Zhou
Yujie Guo
S. Zhao
Haoqin Sun
Hui Wang
...
Shiyao Wang
Xi Yang
Y. Wang
Yonghua Lin
Yong Qin
41
0
0
26 Feb 2025
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores
Jiaming Zhou
S. Zhao
Hui Wang
Tian-Hao Zhang
Haoqin Sun
Xuechen Wang
Yong Qin
136
3
0
20 Jan 2025
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for
  Long-term Streaming Video and Audio Interactions
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Pan Zhang
Xiaoyi Dong
Yuhang Cao
Yuhang Zang
Rui Qian
...
X. Zhang
K. Chen
Yu Qiao
D. Lin
Jiaqi Wang
KELM
84
12
0
12 Dec 2024
Enhancing Multilingual Speech Generation and Recognition Abilities in
  LLMs with Constructed Code-switched Data
Enhancing Multilingual Speech Generation and Recognition Abilities in LLMs with Constructed Code-switched Data
Jing Xu
Daxin Tan
Jiaqi Wang
Xiao Chen
19
0
0
17 Sep 2024
Leave No Knowledge Behind During Knowledge Distillation: Towards
  Practical and Effective Knowledge Distillation for Code-Switching ASR Using
  Realistic Data
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data
Liang-Hsuan Tseng
Zih-Ching Chen
Wei-Shun Chang
Cheng-Kuang Lee
Tsung-Ren Huang
Hung-yi Lee
36
1
0
15 Jul 2024
MaskLID: Code-Switching Language Identification through Iterative
  Masking
MaskLID: Code-Switching Language Identification through Iterative Masking
Amir Hossein Kargaran
François Yvon
Hinrich Schütze
21
1
0
10 Jun 2024
Do Prompts Really Prompt? Exploring the Prompt Understanding Capability
  of Whisper
Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Chih-Kai Yang
Kuan Po Huang
Hung-yi Lee
40
3
0
09 Jun 2024
Gated Low-rank Adaptation for personalized Code-Switching Automatic
  Speech Recognition on the low-spec devices
Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
Gwantae Kim
Bokyeung Lee
Donghyeon Kim
Hanseok Ko
OffRL
18
0
0
24 Apr 2024
Code-Switched Language Identification is Harder Than You Think
Code-Switched Language Identification is Harder Than You Think
Laurie Burchell
Alexandra Birch
Robert P. Thompson
Kenneth Heafield
11
0
0
02 Feb 2024
Investigating Zero-Shot Generalizability on Mandarin-English
  Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models
  with Self-Supervision and Weak Supervision
Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision
Chih-Kai Yang
Kuan-Po Huang
Ke-Han Lu
Chun-Yi Kuan
Chi-Yuan Hsiao
Hung-yi Lee
45
7
0
30 Dec 2023
IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian
  Local Languages
IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages
Muhammad Farid Adilazuarda
Samuel Cahyawijaya
Genta Indra Winata
Pascale Fung
Ayu Purwarianti
30
11
0
21 Nov 2023
Representativeness as a Forgotten Lesson for Multilingual and
  Code-switched Data Collection and Preparation
Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation
A. Seza Doğruöz
Sunayana Sitaram
Zheng-Xin Yong
21
13
0
31 Oct 2023
Adapting the adapters for code-switching in multilingual ASR
Adapting the adapters for code-switching in multilingual ASR
Atharva Kulkarni
Ajinkya Kulkarni
Miguel Couceiro
Hanan Aldarmaki
28
3
0
11 Oct 2023
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech
  Emotion Recognition
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Samuel Cahyawijaya
Holy Lovenia
Willy Chung
Rita Frieske
Zihan Liu
Pascale Fung
37
1
0
26 Jun 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot
  Task Generalization
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Puyuan Peng
Brian Yan
Shinji Watanabe
David F. Harwath
VLM
LRM
30
45
0
18 May 2023
Prompting Multilingual Large Language Models to Generate Code-Mixed
  Texts: The Case of South East Asian Languages
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
Zheng-Xin Yong
Ruochen Zhang
Jessica Zosa Forde
Skyler Wang
Arjun Subramonian
...
Yinghua Tan
Long Phan
Rowena Garcia
Thamar Solorio
Alham Fikri Aji
LRM
44
46
0
23 Mar 2023
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched
  Summarization
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
Ruochen Zhang
Carsten Eickhoff
32
5
0
07 Mar 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic
  Survey on Trends and Challenges
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges
Genta Indra Winata
Alham Fikri Aji
Zheng-Xin Yong
Thamar Solorio
25
31
0
19 Dec 2022
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric
  Code Switching
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching
Chenxi Whitehouse
Fenia Christopoulou
Ignacio Iacobacci
21
9
0
22 Oct 2022
Optimizing Bilingual Neural Transducer with Synthetic Code-switching
  Text Generation
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation
Thien Nguyen
Nathalie Tran
Liuhui Deng
Thiago Fraga da Silva
Matthew Radzihovsky
...
Honza Silovsky
Arnab Ghoshal
M. Martel
Bharat Ram Ambati
Mohamed Ali
14
5
0
21 Oct 2022
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car
  Commands
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Wenliang Dai
Samuel Cahyawijaya
Tiezheng Yu
Elham J. Barezi
Pascale Fung
11
1
0
06 Jul 2022
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command
  Recognition
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Wenliang Dai
Samuel Cahyawijaya
Tiezheng Yu
Elham J. Barezi
Peng-Tao Xu
...
Genta Indra Winata
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
17
11
0
11 Jan 2022
Automatic Speech Recognition Datasets in Cantonese: A Survey and New
  Dataset
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
Tiezheng Yu
Rita Frieske
Peng-Tao Xu
Samuel Cahyawijaya
Cheuk Tung Shadow Yiu
...
Elham J. Barezi
Qifeng Chen
Xiaojuan Ma
Bertram E. Shi
Pascale Fung
RALM
34
9
0
07 Jan 2022
Code-Switched Language Models Using Neural Based Synthetic Data from
  Parallel Sentences
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
SyDa
124
92
0
18 Sep 2019
1