First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT

Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

26 January 2021

Papers citing "First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT"

50 / 53 papers shown

Model-Based Ranking of Source Languages for Zero-Shot Cross-Lingual Transfer

Abteen Ebrahimi

Adam Wiemerslage

Katharina von der Wense

LRM

167

03 Oct 2025

Safe and Efficient In-Context Learning via Risk Control

104

02 Oct 2025

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

158

28 Jun 2025

Large Language Models as Psychological Simulators: A Methodological Guide

Zhicheng Lin

LLMAG

239

20 Jun 2025

Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline

338

26 May 2025

The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs

Lucas Bandarkar

Nanyun Peng

MoMe LRM

307

23 May 2025

High-Dimensional Interlingual Representations of Large Language Models

562

14 Mar 2025

Language Models' Factuality Depends on the Language of Inquiry

293

25 Feb 2025

Beyond Literal Token Overlap: Token Alignability for MultilingualityNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

194

10 Feb 2025

Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zheng Zhao

Yftah Ziser

Shay B. Cohen

195

25 Oct 2024

The Same But Different: Structural Similarities and Differences in Multilingual Language ModelingInternational Conference on Learning Representations (ICLR), 2024

255

11 Oct 2024

MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Amir Hossein Kargaran

336

08 Oct 2024

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

443

02 Oct 2024

Probing the Emergence of Cross-lingual Alignment during LLM TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Hetong Wang

Pasquale Minervini

Edoardo Ponti

357

19 Jun 2024

Understanding the role of FFNs in driving multilingual behaviour in LLMs

Sunit Bhattacharya

Ondrej Bojar

157

22 Apr 2024

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred KnowledgeConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

181

08 Mar 2024

Analysis of Multi-Source Language Training in Cross-Lingual Transfer

233

21 Feb 2024

The Hidden Space of Transformer Language Adapters

Jesujoba Oluwadara Alabi

363

20 Feb 2024

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

549

213

16 Feb 2024

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

Sara Rajaee

Christof Monz

238

03 Feb 2024

Discovering Low-rank Subspaces for Language-agnostic Multilingual RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zhihui Xie

Handong Zhao

Tong Yu

Shuai Li

219

11 Jan 2024

MELA: Multilingual Evaluation of Linguistic AcceptabilityAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Ziyin Zhang

Rui Wang

292

15 Nov 2023

A Joint Matrix Factorization Analysis of Multilingual RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

252

24 Oct 2023

Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization

Xuanjing Huang

299

19 Oct 2023

Comparing Styles across Languages: A Cross-Cultural Exploration of PolitenessConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

456

11 Oct 2023

Few-Shot Spoken Language Understanding via Joint Speech-Text ModelsAutomatic Speech Recognition & Understanding (ASRU), 2023

242

09 Oct 2023

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language VariantsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Luke Zettlemoyer

Madian Khabsa

360

233

31 Aug 2023

Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language ModelsInternational Conference on Machine Learning (ICML), 2023

Phillip Rust

Anders Søgaard

174

17 Aug 2023

Gradient Sparsification For Masked Fine-Tuning of TransformersIEEE International Joint Conference on Neural Network (IJCNN), 2023

J. Ó. Neill

Sourav Dutta

153

19 Jul 2023

CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity RecognitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Huiqiang Jiang

245

24 May 2023

mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language ModelsFindings (Findings), 2023

André F. T. Martins

223

23 May 2023

How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Rochelle Choenni

Dan Garrette

Ekaterina Shutova

328

22 May 2023

Measuring Cross-Lingual Transferability of Multilingual Transformers on Sentence Classification

Zewen Chi

Heyan Huang

Xian-Ling Mao

243

15 May 2023

Identifying the Correlation Between Language Distance and Cross-Lingual Transfer in a Multilingual Representation Space

Fred Philippy

Siwen Guo

Shohreh Haddadan

152

03 May 2023

ContraSim -- A Similarity Measure Based on Contrastive LearningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Adir Rahamim

Yonatan Belinkov

SSL

224

29 Mar 2023

In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

224

23 Feb 2023

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Xuanjing Huang

226

21 Dec 2022

WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

184

07 Dec 2022

Cross-lingual Similarity of Multilingual Representations Revisited

Maksym Del

Mark Fishel

132

04 Dec 2022

Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer

229

04 Dec 2022

A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing Prediction of Political Polarity in Multilingual News HeadlinesKnowledge-Based Systems (KBS), 2022

Swati Swati

Adrian Mladenic Grobelnik

Dunja Mladenić

M. Grobelnik

209

01 Dec 2022

Discovering Language-neutral Sub-networks in Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

272

25 May 2022

Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Terra Blevins

Hila Gonen

Luke Zettlemoyer

LRM

231

24 May 2022

OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence RetrievalFindings (Findings), 2022

Yingbo Zhou

119

17 May 2022

Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERTInternational Conference on Pattern Recognition (ICPR), 2022

200

17 May 2022

Combining Static and Contextualised Multilingual EmbeddingsFindings (Findings), 2022

Katharina Hämmerl

Jindrich Libovický

Kangyang Luo

217

17 Mar 2022

Cross-Lingual Ability of Multilingual Masked Language Models: A Study of Language StructureAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

158

16 Mar 2022

Multi-Level Contrastive Learning for Cross-Lingual AlignmentIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

238

26 Feb 2022

Does Transliteration Help Multilingual Language Modeling?Findings (Findings), 2022

Ibraheem Muhammad Moosa

Mahmud Elahi Akhter

Ashfia Binte Habib

302

29 Jan 2022

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?

Arij Riabi

Benoît Sagot

Djamé Seddah

267

26 Oct 2021