ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.12821
  4. Cited By
Rethinking embedding coupling in pre-trained language models

Rethinking embedding coupling in pre-trained language models

International Conference on Learning Representations (ICLR), 2020
24 October 2020
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Rethinking embedding coupling in pre-trained language models"

50 / 70 papers shown
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets
Muhammad Muneeb
David B. Ascher
Ahsan Baidar Bakht
120
0
0
29 Nov 2025
PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
Zaur Gouliev
Jennifer Waters
Chengqian Wang
132
1
0
12 Sep 2025
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Khloud Al Jallad
Nada Ghneim
Ghaida Rebdawi
LM&MAELM
269
0
0
27 Jul 2025
POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization
POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization
Usman Naseem
Juan Ren
S. Anwar
Sarah Kohail
Rudy Alexandro Garrido Veliz
...
Adem Chanie Ali
Martin Semmann
Chris Biemann
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
247
0
0
27 May 2025
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
554
9
0
27 May 2025
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages
Tadesse Destaw Belay
Dawit Ketema Gete
Abinew Ali Ayele
Olga Kolesnikova
Grigori Sidorov
Seid Muhie Yimam
Seid Muhie Yimam
218
3
0
24 Mar 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
Tadesse Destaw Belay
Israel Abebe Azime
Ibrahim Said Ahmad
David Ifeoluwa Adelani
Idris Abdulmumin
Abinew Ali Ayele
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
538
2
0
24 Mar 2025
LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts
LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts
Md Kamrujjaman Mobin
Md Saiful Islam
DeLMO
174
5
0
21 Jan 2025
Beyond Correlation: Interpretable Evaluation of Machine Translation
  Metrics
Beyond Correlation: Interpretable Evaluation of Machine Translation MetricsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Stefano Perrella
Lorenzo Proietti
Pere-Lluís Huguet Cabot
Edoardo Barba
Roberto Navigli
349
10
0
07 Oct 2024
Zero-Shot Tokenizer Transfer
Zero-Shot Tokenizer TransferNeural Information Processing Systems (NeurIPS), 2024
Benjamin Minixhofer
Edoardo Ponti
Ivan Vulić
VLM
375
28
0
13 May 2024
Understanding Cross-Lingual Alignment -- A Survey
Understanding Cross-Lingual Alignment -- A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Katharina Hämmerl
Jindvrich Libovický
Kangyang Luo
358
35
0
09 Apr 2024
Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Ashok Vardhan Makkuva
Marco Bondaschi
Adway Girish
Alliot Nagle
Martin Jaggi
Hyeji Kim
Michael C. Gastpar
OffRL
492
42
0
06 Feb 2024
An Empirical Analysis of Diversity in Argument Summarization
An Empirical Analysis of Diversity in Argument Summarization
Michiel van der Meer
Piek T. J. M. Vossen
Catholijn M. Jonker
P. Murukannaiah
318
10
0
02 Feb 2024
Efficient slot labelling
Efficient slot labelling
Vladimir Vlasov
252
0
0
17 Jan 2024
Using fine-tuning and min lookahead beam search to improve Whisper
Using fine-tuning and min lookahead beam search to improve Whisper
Andrea Do
Oscar Brown
Zhengjie Wang
Nikhil Mathew
Zixin Liu
Jawwad Ahmed
Cheng Yu
192
4
0
19 Sep 2023
Extending an Event-type Ontology: Adding Verbs and Classes Using
  Fine-tuned LLMs Suggestions
Extending an Event-type Ontology: Adding Verbs and Classes Using Fine-tuned LLMs SuggestionsLaw (LAW), 2023
Jana Straková
Eva Fucíková
Jan Hajic
Zdenka Uresová
157
6
0
03 Jun 2023
Distilling Efficient Language-Specific Models for Cross-Lingual Transfer
Distilling Efficient Language-Specific Models for Cross-Lingual TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Alan Ansell
Edoardo Ponti
Anna Korhonen
Ivan Vulić
249
6
0
02 Jun 2023
RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian
  News Texts
RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News TextsComputational Linguistics and Intellectual Technologies (CLIT), 2023
A. Golubev
Nicolay Rusnachenko
Natalia Loukachevitch
161
8
0
28 May 2023
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African
  Languages
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Cheikh M. Bamba Dione
David Ifeoluwa Adelani
Peter Nabende
Jesujoba Oluwadara Alabi
Thapelo Sindane
...
Seydou T. Traoré
C. Uchechukwu
Aliyu Yusuf
M. Abdullahi
Dietrich Klakow
311
21
0
23 May 2023
DN at SemEval-2023 Task 12: Low-Resource Language Text Classification
  via Multilingual Pretrained Language Model Fine-tuning
DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuningInternational Workshop on Semantic Evaluation (SemEval), 2023
Daniil Homskiy
Narek Maloyan
167
2
0
04 May 2023
ScandEval: A Benchmark for Scandinavian Natural Language Processing
ScandEval: A Benchmark for Scandinavian Natural Language ProcessingNordic Conference of Computational Linguistics (NODALIDA), 2023
Dan Saattrup Nielsen
ELM
278
20
0
03 Apr 2023
Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task
  Strategies for Genre and Framing Detection in Online News
Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task Strategies for Genre and Framing Detection in Online NewsInternational Workshop on Semantic Evaluation (SemEval), 2023
Yuta Koreeda
Ken-ichi Yokote
Hiroaki Ozaki
Atsuki Yamaguchi
Masaya Tsunokake
Yasuhiro Sogawa
225
3
0
03 Mar 2023
Enhancing Model Performance in Multilingual Information Retrieval with
  Comprehensive Data Engineering Techniques
Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques
Qi Zhang
Zijian Yang
Yi-Li Huang
Ze Chen
Zijian Cai
Kangxu Wang
Jiewen Zheng
Jiarong He
Jin Gao
LRMVLM
191
1
0
14 Feb 2023
Leveraging Semantic Representations Combined with Contextual Word
  Representations for Recognizing Textual Entailment in Vietnamese
Leveraging Semantic Representations Combined with Contextual Word Representations for Recognizing Textual Entailment in VietnameseNational Foundation for Science and Technology Development Conference on Information and Computer Science (TDICS), 2022
Quoc-Loc Duong
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
164
2
0
01 Jan 2023
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One DayInternational Conference on Machine Learning (ICML), 2022
Jonas Geiping
Tom Goldstein
MoE
407
108
0
28 Dec 2022
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for
  Indian Languages
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ananya B. Sai
Vignesh Nagarajan
Tanay Dixit
Mary Dabre
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
419
37
0
20 Dec 2022
SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic
  Mistakes
SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic MistakesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Wenda Xu
Xian Qian
Mingxuan Wang
Lei Li
William Yang Wang
179
14
0
19 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
DC-MBR: Distributional Cooling for Minimum Bayesian Risk DecodingInternational Conference on Language Resources and Evaluation (LREC), 2022
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
388
4
0
08 Dec 2022
Word-Level Representation From Bytes For Language Modeling
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
233
1
0
23 Nov 2022
Prompting PaLM for Translation: Assessing Strategies and Performance
Prompting PaLM for Translation: Assessing Strategies and PerformanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
David Vilar
Markus Freitag
Colin Cherry
Jiaming Luo
Viresh Ratnakar
George F. Foster
LRM
398
220
0
16 Nov 2022
Dialect-robust Evaluation of Generated Text
Dialect-robust Evaluation of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jiao Sun
Thibault Sellam
Elizabeth Clark
Tu Vu
Timothy Dozat
Dan Garrette
Aditya Siddhant
Jacob Eisenstein
Sebastian Gehrmann
292
26
0
02 Nov 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
RuCoLA: Russian Corpus of Linguistic AcceptabilityConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
389
36
0
23 Oct 2022
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity
  Recognition
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
David Ifeoluwa Adelani
Graham Neubig
Sebastian Ruder
Shruti Rijhwani
Michael Beukman
...
Idris Abdulmumin
Odunayo Ogundepo
Oreen Yousuf
Tatiana Moteu Ngoli
Dietrich Klakow
325
58
0
22 Oct 2022
HashFormers: Towards Vocabulary-independent Pre-trained Transformers
HashFormers: Towards Vocabulary-independent Pre-trained TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Huiyin Xue
Nikolaos Aletras
205
5
0
14 Oct 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for
  Text Generation
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianxiang Sun
Junliang He
Xipeng Qiu
Xuanjing Huang
256
65
0
14 Oct 2022
Findings of the Shared Task on Multilingual Coreference Resolution
Findings of the Shared Task on Multilingual Coreference Resolution
Zdenvek vZabokrtský
Miloslav Konopík
A. Nedoluzhko
Michal Novák
Maciej Ogrodniczuk
Martin Popel
Ondvrej Pravzák
Jakub Sido
Daniel Zeman
Yilun Zhu
LRM
171
25
0
16 Sep 2022
ÚFAL CorPipe at CRAC 2022: Effectivity of Multilingual Models for
  Coreference Resolution
ÚFAL CorPipe at CRAC 2022: Effectivity of Multilingual Models for Coreference Resolution
Milan Straka
Jana Straková
LRM
167
14
0
15 Sep 2022
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared
  Task
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared TaskConference on Machine Translation (WMT), 2022
Ricardo Rei
Marcos Vinícius Treviso
Nuno M. Guerreiro
Chrysoula Zerva
Ana C. Farinha
...
T. Glushkova
Duarte M. Alves
A. Lavie
Luísa Coheur
Marcely Zanon Boito
1.3K
224
0
13 Sep 2022
5q032e@SMM4H'22: Transformer-based classification of premise in tweets
  related to COVID-19
5q032e@SMM4H'22: Transformer-based classification of premise in tweets related to COVID-19
Vadim Porvatov
Natalia Semenova
197
2
0
08 Sep 2022
Predicting Query-Item Relationship using Adversarial Training and Robust
  Modeling Techniques
Predicting Query-Item Relationship using Adversarial Training and Robust Modeling Techniques
Min Seok Kim
150
0
0
23 Aug 2022
Sort by Structure: Language Model Ranking as Dependency Probing
Sort by Structure: Language Model Ranking as Dependency ProbingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
262
3
0
10 Jun 2022
Beyond Static Models and Test Sets: Benchmarking the Potential of
  Pre-trained Models Across Tasks and Languages
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages
Kabir Ahuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
LRM
236
20
0
12 May 2022
Lifting the Curse of Multilinguality by Pre-training Modular
  Transformers
Lifting the Curse of Multilinguality by Pre-training Modular TransformersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Jonas Pfeiffer
Naman Goyal
Xi Lin
Xian Li
James Cross
Sebastian Riedel
Mikel Artetxe
LRM
281
168
0
12 May 2022
Quality-Aware Decoding for Neural Machine Translation
Quality-Aware Decoding for Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Patrick Fernandes
António Farinhas
Ricardo Rei
José G. C. de Souza
Perez Ogayo
Graham Neubig
Marcely Zanon Boito
363
62
0
02 May 2022
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence
  Embedding
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence EmbeddingInternational Workshop on Semantic Evaluation (SemEval), 2022
Harish Tayyar Madabushi
Edward Gow-Smith
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
258
68
0
21 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
mGPT: Few-Shot Learners Go MultilingualTransactions of the Association for Computational Linguistics (TACL), 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
465
197
0
15 Apr 2022
Disentangling Uncertainty in Machine Translation Evaluation
Disentangling Uncertainty in Machine Translation EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Chrysoula Zerva
T. Glushkova
Ricardo Rei
André F.T. Martins
UDUQCV
384
9
0
13 Apr 2022
Adapting Pre-trained Language Models to African Languages via
  Multilingual Adaptive Fine-Tuning
Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-TuningInternational Conference on Computational Linguistics (COLING), 2022
Jesujoba Oluwadara Alabi
David Ifeoluwa Adelani
Marius Mosbach
Dietrich Klakow
320
182
0
13 Apr 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAMLELM
268
22
0
21 Mar 2022
Does Transliteration Help Multilingual Language Modeling?
Does Transliteration Help Multilingual Language Modeling?Findings (Findings), 2022
Ibraheem Muhammad Moosa
Mahmud Elahi Akhter
Ashfia Binte Habib
355
17
0
29 Jan 2022
12
Next
Page 1 of 2