Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.10730
Cited By
MuRIL: Multilingual Representations for Indian Languages
19 March 2021
Simran Khanuja
Diksha Bansal
Sarvesh Mehtani
Savya Khosla
Atreyee Dey
Balaji Gopalan
D. Margam
Pooja Aggarwal
Rajiv Teja Nagipogu
Shachi Dave
Shruti Gupta
Subhash Chandra Bose Gali
Vishnu Subramanian
Partha P. Talukdar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MuRIL: Multilingual Representations for Indian Languages"
27 / 27 papers shown
Title
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
M. Asim
A. Rehman
Andreas Dengel
31
0
0
08 May 2025
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
Sharvi Endait
Ruturaj Ghatage
Aditya Kulkarni
Rajlaxmi Patil
Raviraj Joshi
32
0
0
06 May 2025
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
29
1
0
24 Oct 2024
Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning
Gaurav Arora
Srujana Merugu
Shreya Jain
Vaibhav Saxena
LRM
27
0
0
18 Oct 2024
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
Raviraj Joshi
Kanishk Singla
Anusha Kamath
Raunak Kalani
Rakesh Paul
Utkarsh Vaidya
Sanjay Singh Chauhan
Niranjan Wartikar
Eileen Long
SyDa
CLL
31
2
0
18 Oct 2024
Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz
Satak Kumar Dey
Ruwad Naswan
Hasnaen Adil
Khondker Salman Sayeed
Haz Sameen Shahgir
29
0
0
29 Jun 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
18
1
0
25 Jan 2024
From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues
Shivani Kumar
S. Ramaneswaran
Md. Shad Akhtar
Tanmoy Chakraborty
19
23
0
19 Oct 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi
Tharindu Ranasinghe
Kai North
Damith Premasiri
Marcos Zampieri
14
13
0
18 Nov 2022
Progressive Sentiment Analysis for Code-Switched Text Data
Sudhanshu Ranjan
Dheeraj Mekala
Jingbo Shang
16
3
0
25 Oct 2022
HiNER: A Large Hindi Named Entity Recognition Dataset
Rudra Murthy
Pallab Bhattacharjee
R. Sharnagat
Jyotsana Khatri
Diptesh Kanojia
P. Bhattacharyya
29
13
0
28 Apr 2022
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages
Gokul Karthik Kumar
Abhishek Singh Gehlot
Sahal Shaji Mullappilly
Karthik Nandakumar
21
13
0
12 Apr 2022
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification
Mithun Das
Somnath Banerjee
Animesh Mukherjee
VLM
22
6
0
25 Mar 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
38
98
0
24 Mar 2022
Multilingual Abusiveness Identification on Code-Mixed Social Media Text
Ekagra Ranjan
Naman Poddar
8
0
0
01 Mar 2022
TamilEmo: Finegrained Emotion Detection Dataset for Tamil
Charangan Vasantharajan
Sean Benhur
Prasanna Kumar Kumaresan
Rahul Ponnusamy
S. Thangasamy
...
Thenmozhi Durairaj
Kanchana Sivanraju
Anbukkarasi Sampath
Bharathi Raja Chakravarthi
John P. Mccrae
6
5
0
09 Feb 2022
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages
Tushar Abhishek
Shivprasad Sagare
Bhavyajeet Singh
Anubhav Sharma
Manish Gupta
Vasudeva Varma
8
9
0
01 Feb 2022
Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification
Sean Benhur
Roshan Nayak
Kanchana Sivanraju
Adeep Hande
S. Navaneethakrishnan
R. Priyadharshini
Bharathi Raja Chakravarthi6
19
1
0
31 Dec 2021
Multilingual Text Classification for Dravidian Languages
Xiaotian Lin
Nankai Lin
Kanoksak Wattanachote
Shengyi Jiang
Lianxi Wang
53
3
0
03 Dec 2021
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language
Arushi Sharma
Anubha Kabra
Minni Jain
11
51
0
18 Oct 2021
Pretrained Transformers for Offensive Language Identification in Tanglish
Sean Benhur
Kanchana Sivanraju
VLM
29
5
0
06 Oct 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
23
70
0
07 Sep 2021
Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling
Adeep Hande
Karthik Puranik
Konthala Yasaswini
R. Priyadharshini
Sajeetha Thavareesan
Anbukkarasi Sampath
Kogilavani Shanmugavadivel
D. Thenmozhi
Bharathi Raja Chakravarthi
17
29
0
27 Aug 2021
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
35
73
0
01 Jul 2021
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
58
65
0
24 Oct 2020
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,740
0
26 Sep 2016
1