Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.01502
Cited By
How multilingual is Multilingual BERT?
4 June 2019
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How multilingual is Multilingual BERT?"
50 / 655 papers shown
Title
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
34
11
0
22 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
30
3
0
08 Jul 2024
Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media
Sidney G. -J. Wong
19
0
0
01 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark J. F. Gales
Kate Knill
20
1
0
01 Jul 2024
Self-Translate-Train: A Simple but Strong Baseline for Cross-lingual Transfer of Large Language Models
Ryokan Ri
Shun Kiyono
Sho Takase
SyDa
21
0
0
29 Jun 2024
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
Orgest Xhelili
Yihong Liu
Hinrich Schütze
31
6
0
28 Jun 2024
Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
Xue Zhang
Yunlong Liang
Fandong Meng
Songming Zhang
Yufeng Chen
Jinan Xu
Jie Zhou
KELM
42
4
0
24 Jun 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
53
1
0
23 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
67
4
0
22 Jun 2024
VAIYAKARANA : A Benchmark for Automatic Grammar Correction in Bangla
Pramit Bhattacharyya
Arnab Bhattacharya
25
0
0
20 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
31
1
0
20 Jun 2024
AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought
Yongheng Zhang
Qiguang Chen
Min Li
Wanxiang Che
Libo Qin
LRM
38
5
0
20 Jun 2024
Probing the Emergence of Cross-lingual Alignment during LLM Training
Hetong Wang
Pasquale Minervini
E. Ponti
36
7
0
19 Jun 2024
Synergizing Foundation Models and Federated Learning: A Survey
Shenghui Li
Fanghua Ye
Meng Fang
Jiaxu Zhao
Yun-Hin Chan
Edith C. -H. Ngai
Thiemo Voigt
AI4CE
47
5
0
18 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
78
4
0
15 Jun 2024
Investigating the translation capabilities of Large Language Models trained on parallel data only
Javier García Gilabert
Carlos Escolano
Aleix Sant Savall
Francesca de Luca Fornaciari
Audrey Mash
Xixian Liao
Maite Melero
LRM
42
2
0
13 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
34
0
0
11 Jun 2024
ThaiCoref: Thai Coreference Resolution Dataset
Pontakorn Trakuekul
Wei Qi Leong
Charin Polpanumas
Jitkapat Sawatphol
William-Chandra Tjhi
Attapol T. Rutherford
16
0
0
10 Jun 2024
An Open Multilingual System for Scoring Readability of Wikipedia
Mykola Trokhymovych
Indira Sen
Martin Gerlach
35
3
0
03 Jun 2024
Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition
Zhuojun Ding
Wei Wei
Xiaoye Qu
Dangyang Chen
31
2
0
03 Jun 2024
Exploring Alignment in Shared Cross-lingual Spaces
Basel Mousi
Nadir Durrani
Fahim Dalvi
Majd Hawasly
Ahmed Abdelali
30
0
0
23 May 2024
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
Yihong Liu
Chunlan Ma
Haotian Ye
Hinrich Schütze
36
4
0
16 May 2024
For the Misgendered Chinese in Gender Bias Research: Multi-Task Learning with Knowledge Distillation for Pinyin Name-Gender Prediction
Xiaocong Du
Haipeng Zhang
CVBM
24
1
0
10 May 2024
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
Prashant Kodali
Anmol Goel
Likhith Asapu
Vamshi Bonagiri
Anirudh Govil
Monojit Choudhury
Manish Shrivastava
Ponnurangam Kumaraguru
42
0
0
09 May 2024
Vietnamese AI Generated Text Detection
Quang-Dan Tran
Van-Quan Nguyen
Quang-Huy Pham
K. B. T. Nguyen
Trong-Hop Do
DeLMO
22
1
0
06 May 2024
Unknown Script: Impact of Script on Cross-Lingual Transfer
Wondimagegnhue Tufa
Ilia Markov
Piek Vossen
37
0
0
29 Apr 2024
Interpreting Answers to Yes-No Questions in Dialogues from Multiple Domains
Zijie Wang
Farzana Rashid
Eduardo Blanco
30
0
0
25 Apr 2024
Understanding the role of FFNs in driving multilingual behaviour in LLMs
Sunit Bhattacharya
Ondrej Bojar
21
2
0
22 Apr 2024
mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?
Tianze Hua
Tian Yun
Ellie Pavlick
LRM
24
9
0
18 Apr 2024
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Zhaofeng Wu
Ananth Balashankar
Yoon Kim
Jacob Eisenstein
Ahmad Beirami
46
13
0
18 Apr 2024
Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Shaomu Tan
Di Wu
Christof Monz
MoMe
34
7
0
17 Apr 2024
ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images
Quan Van Nguyen
Dan Quang Tran
Huy Quang Pham
Thang Kien-Bao Nguyen
Nghia Hieu Nguyen
Kiet Van Nguyen
N. Nguyen
CoGe
37
3
0
16 Apr 2024
Measuring Cross-lingual Transfer in Bytes
Leandro Rodrigues de Souza
Thales Sales Almeida
R.A. Lotufo
Rodrigo Nogueira
CLL
27
3
0
12 Apr 2024
The Role of Language Imbalance in Cross-lingual Generalisation: Insights from Cloned Language Experiments
Anton Schäfer
Shauli Ravfogel
Thomas Hofmann
Tiago Pimentel
Imanol Schlag
55
3
0
11 Apr 2024
Language-Independent Representations Improve Zero-Shot Summarization
V. Solovyev
Danni Liu
Jan Niehues
27
0
0
08 Apr 2024
Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly
Changjiang Gao
Hongda Hu
Peng Hu
Jiajun Chen
Jixing Li
Shujian Huang
36
17
0
06 Apr 2024
IITK at SemEval-2024 Task 1: Contrastive Learning and Autoencoders for Semantic Textual Relatedness in Multilingual Texts
Udvas Basak
Rajarshi Dutta
Shivam Pandey
Ashutosh Modi
31
2
0
06 Apr 2024
Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations
Emilio Villa-Cueva
A. P. López-Monroy
Fernando Sánchez-Vega
Thamar Solorio
VLM
38
3
0
03 Apr 2024
Africa-Centric Self-Supervised Pre-Training for Multilingual Speech Representation in a Sub-Saharan Context
Antoine Caubrière
Elodie Gauthier
19
1
0
02 Apr 2024
A Systematic Analysis of Subwords and Cross-Lingual Transfer in Multilingual Translation
Francois Meyer
Jan Buys
29
1
0
29 Mar 2024
Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets
Shadi Manafi
Nikhil Krishnaswamy
AAML
40
0
0
29 Mar 2024
Multilingual Sentence-T5: Scalable Sentence Encoders for Multilingual Applications
Chihiro Yano
Akihiko Fukuchi
Shoko Fukasawa
Hideyuki Tachibana
Yotaro Watanabe
39
2
0
26 Mar 2024
MOGAM: A Multimodal Object-oriented Graph Attention Model for Depression Detection
Junyeop Cha
Seoyun Kim
Dongjae Kim
Eunil Park
15
2
0
21 Mar 2024
X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment
Dongjae Shin
Hyunseok Lim
Inho Won
Changsu Choi
Minjun Kim
Seungwoo Song
Hangyeol Yoo
Sangmin Kim
Kyungtae Lim
21
5
0
18 Mar 2024
Pre-Trained Language Models Represent Some Geographic Populations Better Than Others
Jonathan Dunn
Benjamin Adams
Harish Tayyar Madabushi
24
3
0
16 Mar 2024
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Changsu Choi
Yongbin Jeong
Seoyoon Park
Inho Won
HyeonSeok Lim
...
Yiseul Lee
HyeJin Lee
Younggyun Hahm
Hansaem Kim
Kyungtae Lim
29
11
0
16 Mar 2024
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing
Michael Ginn
Lindia Tjuatja
Taiqi He
Enora Rice
Graham Neubig
Alexis Palmer
Lori Levin University of Colorado
35
3
0
11 Mar 2024
Persian Slang Text Conversion to Formal and Deep Learning of Persian Short Texts on Social Media for Sentiment Classification
Mohsen Khazeni
Mohammad Heydari
Amir Albadvi
26
0
0
09 Mar 2024
Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge
Xin Zhao
Naoki Yoshinaga
Daisuke Oba
KELM
HILM
27
10
0
08 Mar 2024
Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text
Frances Adriana Laureano De Leon
Harish Tayyar Madabushi
Mark Lee
33
3
0
07 Mar 2024
Previous
1
2
3
4
5
...
12
13
14
Next