Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.01176
Cited By
Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
2 July 2020
Brian Roark
Lawrence Wolf-Sonkin
Christo Kirov
Sabrina J. Mielke
Cibu Johny
Isin Demirsahin
Keith B. Hall
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset"
12 / 12 papers shown
Title
Improving Informally Romanized Language Identification
Adrian Benton
Alexander Gutkin
Christo Kirov
Brian Roark
50
0
0
30 Apr 2025
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
Umer Butt
Stalin Veranasi
Günter Neumann
48
0
0
27 Mar 2025
IndoNLP 2025: Shared Task on Real-Time Reverse Transliteration for Romanized Indo-Aryan languages
Deshan Sumanathilaka
Isuri Anuradha
Ruvan Weerasinghe
Nicholas Micallef
Julian Hough
42
0
0
10 Jan 2025
Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz
Satak Kumar Dey
Ruwad Naswan
Hasnaen Adil
Khondker Salman Sayeed
Haz Sameen Shahgir
31
0
0
29 Jun 2024
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
M. Kabir
Mohammed Saidul Islam
Md Tahmid Rahman Laskar
Mir Tafseer Nayeem
M Saiful Bari
Enamul Hoque
LM&MA
24
15
0
22 Sep 2023
Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency
Shigeki Karita
R. Sproat
Haruko Ishikawa
27
4
0
07 Jun 2023
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
J. Clark
Alexander Gutkin
Mihir Kale
Min Ma
...
Dan Garrette
R. Ingle
Melvin Johnson
Dmitry Panteleev
Partha P. Talukdar
ELM
22
38
0
19 May 2023
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data
Akshat Gahoi
Jayant Duneja
Anshul Padhi
Shivam Mangale
Saransh Rajput
Tanvi Kamble
D. Sharma
Vasudeva Varma
25
3
0
21 Oct 2022
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
13
38
0
24 Aug 2021
MuRIL: Multilingual Representations for Indian Languages
Simran Khanuja
Diksha Bansal
Sarvesh Mehtani
Savya Khosla
Atreyee Dey
...
Shachi Dave
Shruti Gupta
Subhash Chandra Bose Gali
Vishnu Subramanian
Partha P. Talukdar
41
277
0
19 Mar 2021
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Samson Tan
Shafiq R. Joty
AAML
29
35
0
17 Mar 2021
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1