ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.01502
  4. Cited By
How multilingual is Multilingual BERT?

How multilingual is Multilingual BERT?

4 June 2019
Telmo Pires
Eva Schlinger
Dan Garrette
    LRM
    VLM
ArXivPDFHTML

Papers citing "How multilingual is Multilingual BERT?"

50 / 655 papers shown
Title
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
Jiin Park
Misuk Kim
13
0
0
14 May 2025
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
Subrit Dikshit
Ritu Tiwari
Priyank Jain
16
0
0
14 May 2025
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
Andrei-Alexandru Manea
Jindřich Libovický
VLM
52
0
0
30 Apr 2025
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs
MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs
Jaap Jumelet
Leonie Weissweiler
Arianna Bisazza
38
2
0
03 Apr 2025
Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training
Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training
Zhijun Wang
Jiahuan Li
Hao Zhou
Rongxiang Weng
J. Wang
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
LRM
48
1
0
02 Apr 2025
Redefining technology for indigenous languages
Redefining technology for indigenous languages
Silvia Fernandez-Sabido
Laura Peniche-Sabido
31
0
0
02 Apr 2025
Advancing Sentiment Analysis in Tamil-English Code-Mixed Texts: Challenges and Transformer-Based Solutions
Advancing Sentiment Analysis in Tamil-English Code-Mixed Texts: Challenges and Transformer-Based Solutions
Mikhail Krasitskii
Olga Kolesnikova
Liliana Chanona Hernandez
Grigori Sidorov
Alexander Gelbukh
49
1
0
30 Mar 2025
Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging
Untangling the Influence of Typology, Data and Model Architecture on Ranking Transfer Languages for Cross-Lingual POS Tagging
Enora Rice
Ali Marashian
Hannah Haynie
K. Wense
Alexis Palmer
44
0
0
25 Mar 2025
High-Dimensional Interlingual Representations of Large Language Models
High-Dimensional Interlingual Representations of Large Language Models
Bryan Wilie
Samuel Cahyawijaya
Junxian He
Pascale Fung
52
0
0
14 Mar 2025
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models
Julian Spravil
Sebastian Houben
Sven Behnke
VLM
68
0
0
12 Mar 2025
CareerBERT: Matching Resumes to ESCO Jobs in a Shared Embedding Space for Generic Job Recommendations
Julian Rosenberger
Lukas Wolfrum
Sven Weinzierl
Mathias Kraus
Patrick Zschech
50
0
0
03 Mar 2025
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Masahiro Kaneko
Alham Fikri Aji
Timothy Baldwin
67
0
0
17 Feb 2025
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
100
0
0
17 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Yuexian Zou
83
2
0
10 Feb 2025
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models
Hieu Man
Nghia Trung Ngo
Viet Dac Lai
Ryan Rossi
Franck Dernoncourt
T. Nguyen
127
0
0
01 Jan 2025
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation
Samin Mahdizadeh Sani
Pouya Sadeghi
Thuy-Trang Vu
Yadollah Yaghoobzadeh
Gholamreza Haffari
71
2
0
17 Dec 2024
Beyond Data Quantity: Key Factors Driving Performance in Multilingual
  Language Models
Beyond Data Quantity: Key Factors Driving Performance in Multilingual Language Models
Sina Bagheri Nezhad
Ameeta Agrawal
Rhitabrat Pokharel
LRM
74
2
0
17 Dec 2024
The Evolution and Future Perspectives of Artificial Intelligence
  Generated Content
The Evolution and Future Perspectives of Artificial Intelligence Generated Content
Chengzhang Zhu
Luobin Cui
Ying Tang
Jiacun Wang
89
1
0
02 Dec 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual
  Semantic Textual Relatedness Task
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
68
2
0
28 Nov 2024
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Thai-Binh Nguyen
Alexander Waibel
74
1
0
27 Nov 2024
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge
  Neurons in Large Language Models
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Pengfei Cao
Yuheng Chen
Zhuoran Jin
Yubo Chen
Kang-Jun Liu
Jun Zhao
KELM
70
0
0
26 Nov 2024
Development of Pre-Trained Transformer-based Models for the Nepali
  Language
Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa
Jinu Nyachhyon
Mridul Sharma
Bal Krishna Bal
71
0
0
24 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
77
0
0
20 Nov 2024
Deploying Multi-task Online Server with Large Language Model
Deploying Multi-task Online Server with Large Language Model
Yincen Qu
Chao Ma
Xiangying Dai
Hui Zhou
Yiting Wu
Hengyue Liu
26
0
0
06 Nov 2024
Investigating Idiomaticity in Word Representations
Investigating Idiomaticity in Word Representations
Wei He
Tiago Kramer Vieira
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
34
1
0
04 Nov 2024
Do LLMs Know to Respect Copyright Notice?
Do LLMs Know to Respect Copyright Notice?
Jialiang Xu
Shenglan Li
Zhaozhuo Xu
Denghui Zhang
21
2
0
02 Nov 2024
A Pointer Network-based Approach for Joint Extraction and Detection of
  Multi-Label Multi-Class Intents
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick
Sombit Bose
Abhilash Nandy
G. Chaitanya
Pawan Goyal
24
0
0
29 Oct 2024
A Stack-Propagation Framework for Low-Resource Personalized Dialogue
  Generation
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Haoyu Song
W. Zhang
Kaiyan Zhang
Ting Liu
32
3
0
26 Oct 2024
Building Dialogue Understanding Models for Low-resource Language
  Indonesian from Scratch
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
Donglin Di
Weinan Zhang
Yue Zhang
Fanglin Wang
23
1
0
24 Oct 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Guijin Son
Dongkeun Yoon
Juyoung Suk
Javier Aula-Blasco
Mano Aslan
Vu Trong Kim
Shayekh Bin Islam
Jaume Prats-Cristià
Lucía Tormo-Bañuelos
Seungone Kim
ELM
LRM
25
0
0
23 Oct 2024
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis
Yiyi Chen
Qiongxiu Li
Russa Biswas
Johannes Bjerva
34
1
0
17 Oct 2024
MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Boyang Xue
Hongru Wang
Rui Wang
Sheng Wang
Zezhong Wang
Yiming Du
Bin Liang
Kam-Fai Wong
29
0
0
16 Oct 2024
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models
Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models
Hongchuan Zeng
Senyu Han
Lu Chen
Kai Yu
57
6
0
15 Oct 2024
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?
Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios?
Zeno Vandenbulcke
Lukas Vermeire
Miryam de Lhoneux
26
0
0
14 Oct 2024
The Same But Different: Structural Similarities and Differences in
  Multilingual Language Modeling
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
Ruochen Zhang
Qinan Yu
Matianyu Zang
Carsten Eickhoff
Ellie Pavlick
45
1
0
11 Oct 2024
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop
  Chain-of-Thought
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
G. Kumari
Kirtan Jain
Asif Ekbal
18
1
0
11 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language Models
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob
Lorenzo Sani
Meghdad Kurmanji
William F. Shen
Xinchi Qiu
Dongqi Cai
Yan Gao
Nicholas D. Lane
VLM
115
0
0
07 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
32
1
0
06 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode
  Linguistic Properties for Indic Languages?
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
S. Oota
R. Mamidi
Manish Gupta
34
0
0
03 Oct 2024
Concept Space Alignment in Multilingual LLMs
Concept Space Alignment in Multilingual LLMs
Qiwei Peng
Anders Søgaard
33
3
0
01 Oct 2024
PclGPT: A Large Language Model for Patronizing and Condescending
  Language Detection
PclGPT: A Large Language Model for Patronizing and Condescending Language Detection
Hongbo Wang
Mingda Li
Junyu Lu
Hebin Xia
Liang Yang
Bo Xu
Ruizhu Liu
Hongfei Lin
27
0
0
01 Oct 2024
How Transliterations Improve Crosslingual Alignment
How Transliterations Improve Crosslingual Alignment
Yihong Liu
Mingyang Wang
Amir Hossein Kargaran
Ayyoob Imani
Orgest Xhelili
Haotian Ye
Chunlan Ma
François Yvon
Hinrich Schütze
34
2
0
25 Sep 2024
Mitigating Semantic Leakage in Cross-lingual Embeddings via
  Orthogonality Constraint
Mitigating Semantic Leakage in Cross-lingual Embeddings via Orthogonality Constraint
Dayeon Ki
Cheonbok Park
H. Kim
FedML
31
0
0
24 Sep 2024
DSG-KD: Knowledge Distillation from Domain-Specific to General Language
  Models
DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models
Sangyeon Cho
Jangyeong Jeon
Dongjoon Lee
Changhee Lee
Junyeong Kim
14
1
0
23 Sep 2024
Goldfish: Monolingual Language Models for 350 Languages
Goldfish: Monolingual Language Models for 350 Languages
Tyler A. Chang
Catherine Arnett
Zhuowen Tu
Benjamin Bergen
LRM
36
4
0
19 Aug 2024
LogogramNLP: Comparing Visual and Textual Representations of Ancient
  Logographic Writing Systems for NLP
LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP
Danlu Chen
Freda Shi
Aditi Agarwal
Jacobo Myerston
Taylor Berg-Kirkpatrick
29
2
0
08 Aug 2024
DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing
  Platforms that Leverages Impact Captions
DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions
Siying Hu
Huanchen Wang
Yu Zhang
Piaohong Wang
Zhicong Lu
16
0
0
05 Aug 2024
Investigating the Impact of Semi-Supervised Methods with Data
  Augmentation on Offensive Language Detection in Romanian Language
Investigating the Impact of Semi-Supervised Methods with Data Augmentation on Offensive Language Detection in Romanian Language
Elena Beatrice Nicola
Dumitru-Clementin Cercel
Florin-Catalin Pop
18
1
0
29 Jul 2024
FarSSiBERT: A Novel Transformer-based Model for Semantic Similarity
  Measurement of Persian Social Networks Informal Texts
FarSSiBERT: A Novel Transformer-based Model for Semantic Similarity Measurement of Persian Social Networks Informal Texts
Seyed Mojtaba Sadjadi
Zeinab Rajabi
Leila Rabiei
M. Moin
26
2
0
27 Jul 2024
Fairness Definitions in Language Models Explained
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
50
10
0
26 Jul 2024
1234...121314
Next