Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.04721
Cited By
v1
v2
v3
v4 (latest)
Translation Artifacts in Cross-lingual Transfer Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
9 April 2020
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Translation Artifacts in Cross-lingual Transfer Learning"
50 / 87 papers shown
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Marii Ojastu
Hele-Andra Kuulmets
Aleksei Dorkin
Marika Borovikova
Dage Särg
Kairit Sirts
201
0
0
21 Nov 2025
TransAlign: Machine Translation Encoders are Strong Word Aligners, Too
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Benedikt Ebing
Christian Goldschmied
Goran Glavaš
136
0
0
31 Oct 2025
Multilingual Dialogue Generation and Localization with Dialogue Act Scripting
Justin Vasselli
Eunike Andriani Kardinata
Yusuke Sakai
Taro Watanabe
73
0
0
26 Sep 2025
Everyday Physics in Korean Contexts: A Culturally Grounded Physical Reasoning Benchmark
Jihae Jeong
DaeYeop Lee
DongGeon Lee
Hwanjo Yu
207
2
0
22 Sep 2025
Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
Samuel J. Bell
Eduardo Sánchez
David Dale
Pontus Stenetorp
Mikel Artetxe
Marta R. Costa-jussá
128
0
0
17 Sep 2025
Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference
Samir Abdaljalil
E. Serpedin
K. Qaraqe
Hasan Kurban
155
1
0
20 Aug 2025
Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Elisa Sanchez-Bayona
Rodrigo Agerri
219
3
0
21 Jul 2025
Lost in Variation? Evaluating NLI Performance in Basque and Spanish Geographical Variants
Jaione Bengoetxea
Itziar Gonzalez-Dios
Rodrigo Agerri
234
2
0
18 Jun 2025
Emergent Abilities of Large Language Models under Continued Pretraining for Language Adaptation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Ahmed Elhady
Eneko Agirre
Mikel Artetxe
CLL
KELM
ELM
352
2
0
30 May 2025
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Benedikt Ebing
Goran Glavaš
327
1
0
15 May 2025
Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs
Kartik Ravisankar
HyoJung Han
Marine Carpuat
Marine Carpuat
421
7
0
13 Apr 2025
Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study
Calvin Cheng
Scott A. Hale
1.2K
2
0
04 Feb 2025
Comparable Corpora: Opportunities for New Research Directions
Kenneth Church
149
0
0
24 Jan 2025
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Philipp Borchert
Ivan Vulić
Marie-Francine Moens
Jochen De Weerdt
382
3
0
12 Jan 2025
SiTSE: Sinhala Text Simplification Dataset and Evaluation
Surangika Ranathunga
Rumesh Sirithunga
Himashi Rathnayake
Lahiru De Silva
Thamindu Aluthwala
Saman Peramuna
Ravi Shekhar
394
1
0
02 Dec 2024
Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection
Barah Fazili
Ashish Agrawal
Preethi Jyothi
273
4
0
15 Jul 2024
Mitigating Translationese in Low-resource Languages: The Storyboard Approach
Garry Kuwanto
E. Urua
Priscilla Amuok
Shamsuddeen Hassan Muhammad
Anuoluwapo Aremu
...
Deontae Smith
Praise-EL Michaels
David Ifeoluwa Adelani
Derry Wijaya
Anietie U Andy
226
2
0
14 Jul 2024
M2QA: Multi-domain Multilingual Question Answering
Leon Arne Engländer
Hannah Sterz
Clifton A. Poth
Jonas Pfeiffer
Ilia Kuznetsov
Iryna Gurevych
VLM
271
6
0
01 Jul 2024
SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages
G. Ghazaryan
Erik Arakelyan
Pasquale Minervini
Isabelle Augenstein
SyDa
259
0
0
20 Jun 2024
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Pinzhen Chen
Simon Yu
Zhicheng Guo
Barry Haddow
ELM
376
3
0
18 Jun 2024
BertaQA: How Much Do Language Models Know About Local Culture?
Julen Etxaniz
Gorka Azkune
A. Soroa
Oier López de Lacalle
Mikel Artetxe
290
18
0
11 Jun 2024
Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
Yujin Baek
Koanho Lee
Hyesu Lim
Jaeseok Kim
Junmo Park
Yu-Jung Heo
Du-Seong Chang
Jaegul Choo
161
3
0
04 Jun 2024
The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Wenhao Zhu
Shujian Huang
Fei Yuan
Cheng Chen
Jiajun Chen
Alexandra Birch
LRM
429
7
0
02 May 2024
Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language
Tsimur Hadeliya
D. Kajtoch
256
2
0
27 Apr 2024
XNLIeu: a dataset for cross-lingual NLI in Basque
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Maite Heredia
Julen Etxaniz
Muitze Zulaika
X. Saralegi
Jeremy Barnes
A. Soroa
140
5
0
10 Apr 2024
Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation
Computational Linguistics (CL), 2024
Elisa Sanchez-Bayona
Rodrigo Agerri
282
4
0
10 Apr 2024
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
T. Osório
Bernardo Leite
Henrique Lopes Cardoso
Luís Gomes
João Rodrigues
Rodrigo Santos
António Branco
353
5
0
08 Apr 2024
Latxa: An Open Language Model and Evaluation Suite for Basque
Julen Etxaniz
Oscar Sainz
Naiara Pérez
Itziar Aldabe
German Rigau
Eneko Agirre
Aitor Ormazabal
Mikel Artetxe
A. Soroa
ELM
222
56
0
29 Mar 2024
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
International Conference on Language Resources and Evaluation (LREC), 2024
Jaione Bengoetxea
Yi-Ling Chung
Marco Guerini
Rodrigo Agerri
277
13
0
14 Mar 2024
Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning
Ashish Agrawal
Barah Fazili
Preethi Jyothi
251
9
0
03 Feb 2024
Explanatory Argument Extraction of Correct Answers in Resident Medical Exams
Iakes Goenaga
Aitziber Atutxa
Koldo Gojenola
Maite Oronoz
Rodrigo Agerri
ELM
222
10
0
01 Dec 2023
Optimal strategies to perform multilingual analysis of social content for a novel dataset in the tourism domain
Maxime Masson
Rodrigo Agerri
C. Sallaberry
M. Bessagnet
A. L. Parc-Lacayrelle
Philippe Roose
177
5
0
20 Nov 2023
To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Benedikt Ebing
Goran Glavaš
264
7
0
15 Nov 2023
A Material Lens on Coloniality in NLP
William B. Held
Camille Harris
Michael Best
Diyi Yang
339
22
0
14 Nov 2023
Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study
Maarten De Raedt
Semere Kiros Bitew
Fréderic Godin
Thomas Demeester
Chris Develder
246
4
0
11 Nov 2023
Translating away Translationese without Parallel Data
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Rricha Jalota
Koel Dutta Chowdhury
C. España-Bonet
Josef van Genabith
195
7
0
28 Oct 2023
Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation
François Remy
Pieter Delobelle
Bettina Berendt
Kris Demuynck
Thomas Demeester
232
7
0
05 Oct 2023
Promoting Generalized Cross-lingual Question Answering in Few-resource Scenarios via Self-knowledge Distillation
C. Carrino
Carlos Escolano
José A. R. Fonollosa
218
1
0
29 Sep 2023
OYXOY: A Modern NLP Test Suite for Modern Greek
Findings (Findings), 2023
Konstantinos Kogkalidis
S. Chatzikyriakidis
Eirini Chrysovalantou Giannikouri
Vassiliki Katsouli
Christina Klironomou
...
Dimitris Papadakis
Thelka Pasparaki
Erofili Psaltaki
E. Sakellariou
Hara Soupiona
180
0
0
13 Sep 2023
Measuring Spurious Correlation in Classification: 'Clever Hans' in Translationese
Recent Advances in Natural Language Processing (RANLP), 2023
Angana Borah
Daria Pylypenko
C. España-Bonet
Josef van Genabith
164
6
0
25 Aug 2023
Do Multilingual Language Models Think Better in English?
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Julen Etxaniz
Gorka Azkune
Aitor Soroa Etxabe
Oier López de Lacalle
Mikel Artetxe
LRM
250
105
0
02 Aug 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Transactions of the Association for Computational Linguistics (TACL), 2023
Songbo Hu
Han Zhou
Mete Hergul
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Ivan Vulić
Anna Korhonen
384
17
0
26 Jul 2023
On Evaluating Multilingual Compositional Generalization with Translated Datasets
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zi Wang
Daniel Hershcovich
335
8
0
20 Jun 2023
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
280
62
0
12 Jun 2023
Why Does Zero-Shot Cross-Lingual Generation Fail? An Explanation and a Solution
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tianjian Li
Kenton W. Murray
245
28
0
27 May 2023
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tingting Ma
Qianhui Wu
Huiqiang Jiang
Börje F. Karlsson
Tiejun Zhao
Chin-Yew Lin
308
8
0
24 May 2023
Revisiting Machine Translation for Cross-lingual Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mikel Artetxe
Vedanuj Goswami
Shruti Bhosale
Angela Fan
Luke Zettlemoyer
LRM
206
49
0
23 May 2023
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yifu Qiu
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
HILM
380
62
0
23 May 2023
How Good are Commercial Large Language Models on African Languages?
Jessica Ojo
Kelechi Ogueji
178
6
0
11 May 2023
Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Robert Litschko
Ekaterina Artemova
Barbara Plank
212
8
0
09 May 2023
1
2
Next
Page 1 of 2