Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.00436
Cited By
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models
29 June 2024
Peiqin Lin
André F. T. Martins
Hinrich Schütze
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models"
8 / 8 papers shown
Title
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Shaoxiong Ji
Hengyu Luo
Jörg Tiedemann
CLL
40
0
0
05 Apr 2025
AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought
Xin Huang
Tarun K. Vangani
Zhengyuan Liu
Bowei Zou
A. Aw
LRM
AI4CE
53
2
0
27 Jan 2025
Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora
Surangika Ranathunga
Nisansa de Silva
Menan Velayuthan
Aloka Fernando
Charitha Rathnayake
25
10
0
12 Feb 2024
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning
Duarte M. Alves
Nuno M. Guerreiro
Joao Alves
José P. Pombal
Ricardo Rei
José G. C. de Souza
Pierre Colombo
André F.T. Martins
45
47
0
20 Oct 2023
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task
Ricardo Rei
Marcos Vinícius Treviso
Nuno M. Guerreiro
Chrysoula Zerva
Ana C. Farinha
...
T. Glushkova
Duarte M. Alves
A. Lavie
Luísa Coheur
André F. T. Martins
52
137
0
13 Sep 2022
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
Ouyang Xuan
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
51
100
0
31 Dec 2020
MLQA: Evaluating Cross-lingual Extractive Question Answering
Patrick Lewis
Barlas Oğuz
Ruty Rinott
Sebastian Riedel
Holger Schwenk
ELM
239
489
0
16 Oct 2019
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
158
1,630
0
11 Oct 2017
1