Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.07580
Cited By
mGPT: Few-Shot Learners Go Multilingual
15 April 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mGPT: Few-Shot Learners Go Multilingual"
50 / 110 papers shown
Title
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
2
0
0
14 May 2025
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
Jong Myoung Kim
Young-Jun_Lee
Ho-Jin Choi
Sangkeun Jung
58
0
0
24 Mar 2025
Strategic resource allocation in memory encoding: An efficiency principle shaping language processing
Weijie Xu
Richard Futrell
48
1
0
18 Mar 2025
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Y. Wang
Zhuohan Xie
...
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
Preslav Nakov
ALM
KELM
41
0
0
03 Mar 2025
Multilingual Language Model Pretraining using Machine-translated Data
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
David Ifeoluwa Adelani
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
75
2
0
20 Feb 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
Zhiwen Ruan
Yixia Li
He Zhu
Longyue Wang
Weihua Luo
Kaifu Zhang
Y. Chen
Guanhua Chen
41
0
0
17 Feb 2025
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao
Yuheng Chen
Zhuoran Jin
Yubo Chen
Kang-Jun Liu
Jun Zhao
KELM
65
0
0
26 Nov 2024
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
VLM
75
0
0
21 Nov 2024
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
39
1
0
31 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
71
1
0
22 Oct 2024
Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?
Gürkan Soykan
Gözde Gül Şahin
23
0
0
10 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
32
1
0
06 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
S. Oota
R. Mamidi
Manish Gupta
29
0
0
03 Oct 2024
LangSAMP: Language-Script Aware Multilingual Pretraining
Yihong Liu
Haotian Ye
Chunlan Ma
Mingyang Wang
Hinrich Schütze
VLM
29
0
0
26 Sep 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
35
3
0
26 Sep 2024
How Transliterations Improve Crosslingual Alignment
Yihong Liu
Mingyang Wang
Amir Hossein Kargaran
Ayyoob Imani
Orgest Xhelili
Haotian Ye
Chunlan Ma
François Yvon
Hinrich Schütze
31
2
0
25 Sep 2024
Pruning Multilingual Large Language Models for Multilingual Inference
Hwichan Kim
Jun Suzuki
Tosho Hirasawa
Mamoru Komachi
19
0
0
25 Sep 2024
On the Role of Context in Reading Time Prediction
Andreas Opedal
Eleanor Chodroff
Ryan Cotterell
Ethan Gotlieb Wilcox
33
7
0
12 Sep 2024
PsychoLex: Unveiling the Psychological Mind of Large Language Models
Mohammad Amin Abbasi
Farnaz Sadat Mirnezami
Hassan Naderi
LM&MA
30
1
0
16 Aug 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
M. Shoeybi
Bryan Catanzaro
34
5
0
08 Jul 2024
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
Ekaterina Taktasheva
Maxim Bazhukov
Kirill Koncha
Alena Fenogenova
Ekaterina Artemova
Vladislav Mikhailov
31
9
0
27 Jun 2024
Preference Tuning For Toxicity Mitigation Generalizes Across Languages
Xiaochen Li
Zheng-Xin Yong
Stephen H. Bach
CLL
26
13
0
23 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
19
1
0
20 Jun 2024
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
Fabian David Schmidt
Philipp Borchert
Ivan Vulić
Goran Glavaš
42
5
0
18 Jun 2024
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
Jiakuan Xie
Pengfei Cao
Yuheng Chen
Yubo Chen
Kang Liu
Jun Zhao
KELM
32
3
0
17 Jun 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
29
1
0
14 Jun 2024
MACT: Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing
Jiangming Liu
37
1
0
03 Jun 2024
Multilingual Text Style Transfer: Datasets & Models for Indian Languages
Sourabrata Mukherjee
Atul Kr. Ojha
Akanksha Bansal
D. Alok
John P. Mccrae
Ondrej Dusek
VLM
30
7
0
31 May 2024
LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language
Cagri Toraman
VLM
30
5
0
13 May 2024
Bridging the Bosphorus: Advancing Turkish Large Language Models through Strategies for Low-Resource Language Adaptation and Benchmarking
Emre Can Acikgoz
Mete Erdogan
Deniz Yuret
30
7
0
07 May 2024
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad
Ameeta Agrawal
LRM
35
9
0
29 Apr 2024
Introducing cosmosGPT: Monolingual Training for Turkish Language Models
Himmet Toprak Kesgin
M. K. Yuce
Eren Dogan
M. E. Uzun
Atahan Uz
H. E. Seyrek
Ahmed Zeer
M. Amasyalı
35
9
0
26 Apr 2024
Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models
Eren Dogan
M. E. Uzun
Atahan Uz
H. E. Seyrek
Ahmed Zeer
Ezgi Sevi
Himmet Toprak Kesgin
M. K. Yuce
M. Amasyalı
ELM
21
0
0
25 Apr 2024
Understanding Cross-Lingual Alignment -- A Survey
Katharina Hämmerl
Jindvrich Libovický
Alexander M. Fraser
36
10
0
09 Apr 2024
SambaLingo: Teaching Large Language Models New Languages
Zoltan Csaki
Bo Li
Jonathan Li
Qiantong Xu
Pian Pawakapan
Leon Zhang
Yun Du
Hengyu Zhao
Changran Hu
Urmish Thakker
32
6
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Yinghui Li
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
47
36
0
07 Apr 2024
MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages
Daryna Dementieva
N. Babakov
Alexander Panchenko
35
6
0
02 Apr 2024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Taishi Nakamura
Mayank Mishra
Simone Tedeschi
Yekun Chai
Jason T Stillerman
...
Virendra Mehta
Matthew Blumberg
Victor May
Huu Nguyen
S. Pyysalo
LRM
21
7
0
30 Mar 2024
Latxa: An Open Language Model and Evaluation Suite for Basque
Julen Etxaniz
Oscar Sainz
Naiara Pérez
Itziar Aldabe
German Rigau
Eneko Agirre
Aitor Ormazabal
Mikel Artetxe
A. Soroa
ELM
34
22
0
29 Mar 2024
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
Nihar Ranjan Sahoo
Pranamya Prashant Kulkarni
Narjis Asad
Arif Ahmad
Tanu Goyal
Aparna Garimella
Pushpak Bhattacharyya
24
8
0
29 Mar 2024
Attention-aware semantic relevance predicting Chinese sentence reading
Kun Sun
16
1
0
27 Mar 2024
RuBia: A Russian Language Bias Detection Dataset
Veronika Grigoreva
Anastasiia Ivanova
I. Alimova
Ekaterina Artemova
30
1
0
26 Mar 2024
Computational Sentence-level Metrics Predicting Human Sentence Comprehension
Kun Sun
Rong Wang
44
0
0
23 Mar 2024
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing
Michael Ginn
Lindia Tjuatja
Taiqi He
Enora Rice
Graham Neubig
Alexis Palmer
Lori Levin University of Colorado
23
3
0
11 Mar 2024
From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models
Luiza Amador Pozzobon
Patrick Lewis
Sara Hooker
B. Ermiş
36
7
0
06 Mar 2024
Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?
E. Razumovskaia
Ivan Vulić
Anna Korhonen
29
6
0
04 Mar 2024
On the Scaling Laws of Geographical Representation in Language Models
Nathan Godey
Eric Villemonte de la Clergerie
Benoît Sagot
31
6
0
29 Feb 2024
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Rodrigo Santos
Joao Silva
Luís Gomes
João Rodrigues
António Branco
39
10
0
29 Feb 2024
Spot the bot: Coarse-Grained Partition of Semantic Paths for Bots and Humans
Vasilii A. Gromov
A. S. Kogan
27
1
0
27 Feb 2024
GlórIA -- A Generative and Open Large Language Model for Portuguese
Ricardo Lopes
João Magalhães
David Semedo
25
8
0
20 Feb 2024
1
2
3
Next