Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.01502
Cited By
How multilingual is Multilingual BERT?
4 June 2019
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"How multilingual is Multilingual BERT?"
50 / 655 papers shown
Title
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
Jiin Park
Misuk Kim
13
0
0
14 May 2025
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
16
0
0
14 May 2025
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
Andrei-Alexandru Manea
Jindřich Libovický
VLM
52
0
0
30 Apr 2025
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs
Jaap Jumelet
Leonie Weissweiler
Arianna Bisazza
38
2
0
03 Apr 2025
Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training
Zhijun Wang
Jiahuan Li
Hao Zhou
Rongxiang Weng
J. Wang
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
LRM
48
1
0
02 Apr 2025
Redefining technology for indigenous languages
Silvia Fernandez-Sabido
Laura Peniche-Sabido
31
0
0
02 Apr 2025
Advancing Sentiment Analysis in Tamil-English Code-Mixed Texts: Challenges and Transformer-Based Solutions
Mikhail Krasitskii
Olga Kolesnikova
Liliana Chanona Hernandez
Grigori Sidorov
Alexander Gelbukh
49
1
0
30 Mar 2025
Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging
Enora Rice
Ali Marashian
Hannah Haynie
K. Wense
Alexis Palmer
44
0
0
25 Mar 2025
High-Dimensional Interlingual Representations of Large Language Models
Bryan Wilie
Samuel Cahyawijaya
Junxian He
Pascale Fung
52
0
0
14 Mar 2025
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models
Julian Spravil
Sebastian Houben
Sven Behnke
VLM
68
0
0
12 Mar 2025
CareerBERT: Matching Resumes to ESCO Jobs in a Shared Embedding Space for Generic Job Recommendations
Julian Rosenberger
Lukas Wolfrum
Sven Weinzierl
Mathias Kraus
Patrick Zschech
50
0
0
03 Mar 2025
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Masahiro Kaneko
Alham Fikri Aji
Timothy Baldwin
67
0
0
17 Feb 2025
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
100
0
0
17 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Yuexian Zou
83
2
0
10 Feb 2025
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Hieu Man
Nghia Trung Ngo
Viet Dac Lai
Ryan Rossi
Franck Dernoncourt
T. Nguyen
127
0
0
01 Jan 2025
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation
Samin Mahdizadeh Sani
Pouya Sadeghi
Thuy-Trang Vu
Yadollah Yaghoobzadeh
Gholamreza Haffari
71
2
0
17 Dec 2024
Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models
Sina Bagheri Nezhad
Ameeta Agrawal
Rhitabrat Pokharel
LRM
74
2
0
17 Dec 2024
The Evolution and Future Perspectives of Artificial Intelligence Generated Content
Chengzhang Zhu
Luobin Cui
Ying Tang
Jiacun Wang
89
1
0
02 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
68
2
0
28 Nov 2024
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Thai-Binh Nguyen
Alexander Waibel
74
1
0
27 Nov 2024
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao
Yuheng Chen
Zhuoran Jin
Yubo Chen
Kang-Jun Liu
Jun Zhao
KELM
70
0
0
26 Nov 2024
Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa
Jinu Nyachhyon
Mridul Sharma
Bal Krishna Bal
71
0
0
24 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
77
0
0
20 Nov 2024
Deploying Multi-task Online Server with Large Language Model
Yincen Qu
Chao Ma
Xiangying Dai
Hui Zhou
Yiting Wu
Hengyue Liu
26
0
0
06 Nov 2024
Investigating Idiomaticity in Word Representations
Wei He
Tiago Kramer Vieira
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
34
1
0
04 Nov 2024
Do LLMs Know to Respect Copyright Notice?
Jialiang Xu
Shenglan Li
Zhaozhuo Xu
Denghui Zhang
21
2
0
02 Nov 2024
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick
Sombit Bose
Abhilash Nandy
G. Chaitanya
Pawan Goyal
24
0
0
29 Oct 2024
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Haoyu Song
W. Zhang
Kaiyan Zhang
Ting Liu
32
3
0
26 Oct 2024
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
Donglin Di
Weinan Zhang
Yue Zhang
Fanglin Wang
23
1
0
24 Oct 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Guijin Son
Dongkeun Yoon
Juyoung Suk
Javier Aula-Blasco
Mano Aslan
Vu Trong Kim
Shayekh Bin Islam
Jaume Prats-Cristià
Lucía Tormo-Bañuelos
Seungone Kim
ELM
LRM
25
0
0
23 Oct 2024
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis
Yiyi Chen
Qiongxiu Li
Russa Biswas
Johannes Bjerva
34
1
0
17 Oct 2024
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Boyang Xue
Hongru Wang
Rui Wang
Sheng Wang
Zezhong Wang
Yiming Du
Bin Liang
Kam-Fai Wong
29
0
0
16 Oct 2024
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models
Hongchuan Zeng
Senyu Han
Lu Chen
Kai Yu
57
6
0
15 Oct 2024
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?
Zeno Vandenbulcke
Lukas Vermeire
Miryam de Lhoneux
26
0
0
14 Oct 2024
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
Ruochen Zhang
Qinan Yu
Matianyu Zang
Carsten Eickhoff
Ellie Pavlick
45
1
0
11 Oct 2024
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
G. Kumari
Kirtan Jain
Asif Ekbal
18
1
0
11 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob
Lorenzo Sani
Meghdad Kurmanji
William F. Shen
Xinchi Qiu
Dongqi Cai
Yan Gao
Nicholas D. Lane
VLM
115
0
0
07 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
32
1
0
06 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
S. Oota
R. Mamidi
Manish Gupta
34
0
0
03 Oct 2024
Concept Space Alignment in Multilingual LLMs
Qiwei Peng
Anders Søgaard
33
3
0
01 Oct 2024
PclGPT: A Large Language Model for Patronizing and Condescending Language Detection
Hongbo Wang
Mingda Li
Junyu Lu
Hebin Xia
Liang Yang
Bo Xu
Ruizhu Liu
Hongfei Lin
27
0
0
01 Oct 2024
How Transliterations Improve Crosslingual Alignment
Yihong Liu
Mingyang Wang
Amir Hossein Kargaran
Ayyoob Imani
Orgest Xhelili
Haotian Ye
Chunlan Ma
François Yvon
Hinrich Schütze
34
2
0
25 Sep 2024
Mitigating Semantic Leakage in Cross-lingual Embeddings via Orthogonality Constraint
Dayeon Ki
Cheonbok Park
H. Kim
FedML
31
0
0
24 Sep 2024
DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models
Sangyeon Cho
Jangyeong Jeon
Dongjoon Lee
Changhee Lee
Junyeong Kim
14
1
0
23 Sep 2024
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
36
4
0
19 Aug 2024
LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP
Danlu Chen
Freda Shi
Aditi Agarwal
Jacobo Myerston
Taylor Berg-Kirkpatrick
29
2
0
08 Aug 2024
DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions
Siying Hu
Huanchen Wang
Yu Zhang
Piaohong Wang
Zhicong Lu
16
0
0
05 Aug 2024
Investigating the Impact of Semi-Supervised Methods with Data Augmentation on Offensive Language Detection in Romanian Language
Elena Beatrice Nicola
Dumitru-Clementin Cercel
Florin-Catalin Pop
18
1
0
29 Jul 2024
FarSSiBERT: A Novel Transformer-based Model for Semantic Similarity Measurement of Persian Social Networks Informal Texts
Seyed Mojtaba Sadjadi
Zeinab Rajabi
Leila Rabiei
M. Moin
26
2
0
27 Jul 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
50
10
0
26 Jul 2024
1
2
3
4
...
12
13
14
Next