Rethinking embedding coupling in pre-trained language models

International Conference on Learning Representations (ICLR), 2020

24 October 2020

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Rethinking embedding coupling in pre-trained language models"

50 / 70 papers shown

Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Muhammad Muneeb

David B. Ascher

Ahsan Baidar Bakht

120

29 Nov 2025

PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models

Zaur Gouliev

Jennifer Waters

Chengqian Wang

132

12 Sep 2025

Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?

269

27 Jul 2025

POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization

Rudy Alexandro Garrido Veliz

...

Adem Chanie Ali

Martin Semmann

Chris Biemann

Shamsuddeen Hassan Muhammad

Seid Muhie Yimam

247

27 May 2025

Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead

Jesujoba Oluwadara Alabi

Michael A. Hedderich

David Ifeoluwa Adelani

Dietrich Klakow

554

27 May 2025

Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages

218

24 Mar 2025

AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Tadesse Destaw Belay

Israel Abebe Azime

Ibrahim Said Ahmad

David Ifeoluwa Adelani

Idris Abdulmumin

Abinew Ali Ayele

Shamsuddeen Hassan Muhammad

Seid Muhie Yimam

538

24 Mar 2025

LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts

Md Kamrujjaman Mobin

Md Saiful Islam

DeLMO

174

21 Jan 2025

Beyond Correlation: Interpretable Evaluation of Machine Translation MetricsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Stefano Perrella

Lorenzo Proietti

Pere-Lluís Huguet Cabot

Edoardo Barba

Roberto Navigli

349

07 Oct 2024

Zero-Shot Tokenizer TransferNeural Information Processing Systems (NeurIPS), 2024

375

13 May 2024

Understanding Cross-Lingual Alignment -- A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Katharina Hämmerl

Jindvrich Libovický

Kangyang Luo

358

09 Apr 2024

Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

Ashok Vardhan Makkuva

492

06 Feb 2024

An Empirical Analysis of Diversity in Argument Summarization

318

02 Feb 2024

Efficient slot labelling

Vladimir Vlasov

252

17 Jan 2024

Using fine-tuning and min lookahead beam search to improve Whisper

Andrea Do

Oscar Brown

Zhengjie Wang

Nikhil Mathew

Zixin Liu

Jawwad Ahmed

Cheng Yu

192

19 Sep 2023

Extending an Event-type Ontology: Adding Verbs and Classes Using Fine-tuned LLMs SuggestionsLaw (LAW), 2023

157

03 Jun 2023

Distilling Efficient Language-Specific Models for Cross-Lingual TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

249

02 Jun 2023

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News TextsComputational Linguistics and Intellectual Technologies (CLIT), 2023

A. Golubev

Nicolay Rusnachenko

Natalia Loukachevitch

161

28 May 2023

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Cheikh M. Bamba Dione

David Ifeoluwa Adelani

Peter Nabende

Jesujoba Oluwadara Alabi

...

311

23 May 2023

DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuningInternational Workshop on Semantic Evaluation (SemEval), 2023

Daniil Homskiy

Narek Maloyan

167

04 May 2023

ScandEval: A Benchmark for Scandinavian Natural Language ProcessingNordic Conference of Computational Linguistics (NODALIDA), 2023

Dan Saattrup Nielsen

ELM

278

03 Apr 2023

Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task Strategies for Genre and Framing Detection in Online NewsInternational Workshop on Semantic Evaluation (SemEval), 2023

225

03 Mar 2023

Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques

191

14 Feb 2023

Leveraging Semantic Representations Combined with Contextual Word Representations for Recognizing Textual Entailment in VietnameseNational Foundation for Science and Technology Development Conference on Information and Computer Science (TDICS), 2022

Quoc-Loc Duong

Duc-Vu Nguyen

Ngan Luu-Thuy Nguyen

164

01 Jan 2023

Cramming: Training a Language Model on a Single GPU in One DayInternational Conference on Machine Learning (ICML), 2022

Jonas Geiping

Tom Goldstein

MoE

407

108

28 Dec 2022

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Mitesh M. Khapra

419

20 Dec 2022

SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic MistakesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Lei Li

179

19 Dec 2022

DC-MBR: Distributional Cooling for Minimum Bayesian Risk DecodingInternational Conference on Language Resources and Evaluation (LREC), 2022

Jianhao Yan

Jin Xu

Fandong Meng

Jie Zhou

Yue Zhang

388

08 Dec 2022

Word-Level Representation From Bytes For Language Modeling

Chul Lee

Qipeng Guo

Xipeng Qiu

233

23 Nov 2022

Prompting PaLM for Translation: Assessing Strategies and PerformanceAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Colin Cherry

398

220

16 Nov 2022

Dialect-robust Evaluation of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

292

02 Nov 2022

RuCoLA: Russian Corpus of Linguistic AcceptabilityConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

389

23 Oct 2022

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

David Ifeoluwa Adelani

Graham Neubig

...

325

22 Oct 2022

HashFormers: Towards Vocabulary-independent Pre-trained TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Huiyin Xue

Nikolaos Aletras

205

14 Oct 2022

BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Tianxiang Sun

Junliang He

Xipeng Qiu

Xuanjing Huang

256

14 Oct 2022

Findings of the Shared Task on Multilingual Coreference Resolution

171

16 Sep 2022

ÚFAL CorPipe at CRAC 2022: Effectivity of Multilingual Models for Coreference Resolution

Milan Straka

Jana Straková

LRM

167

15 Sep 2022

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared TaskConference on Machine Translation (WMT), 2022

Ricardo Rei

Marcos Vinícius Treviso

...

1.3K

224

13 Sep 2022

5q032e@SMM4H'22: Transformer-based classification of premise in tweets related to COVID-19

Vadim Porvatov

Natalia Semenova

197

08 Sep 2022

Predicting Query-Item Relationship using Adversarial Training and Robust Modeling Techniques

Min Seok Kim

150

23 Aug 2022

Sort by Structure: Language Model Ranking as Dependency ProbingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Max Müller-Eberstein

Rob van der Goot

Barbara Plank

262

10 Jun 2022

Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

236

12 May 2022

Lifting the Curse of Multilinguality by Pre-training Modular TransformersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Xian Li

281

168

12 May 2022

Quality-Aware Decoding for Neural Machine TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

António Farinhas

Graham Neubig

363

02 May 2022

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence EmbeddingInternational Workshop on Semantic Evaluation (SemEval), 2022

Harish Tayyar Madabushi

258

21 Apr 2022

mGPT: Few-Shot Learners Go MultilingualTransactions of the Association for Computational Linguistics (TACL), 2022

Alena Fenogenova

465

197

15 Apr 2022

Disentangling Uncertainty in Machine Translation EvaluationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

384

13 Apr 2022

Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-TuningInternational Conference on Computational Linguistics (COLING), 2022

Jesujoba Oluwadara Alabi

David Ifeoluwa Adelani

Marius Mosbach

Dietrich Klakow

320

182

13 Apr 2022

Towards Explainable Evaluation Metrics for Natural Language Generation

Christoph Leiter

Piyawat Lertvittayakumjorn

268

21 Mar 2022

Does Transliteration Help Multilingual Language Modeling?Findings (Findings), 2022

Ibraheem Muhammad Moosa

Mahmud Elahi Akhter

Ashfia Binte Habib

355

29 Jan 2022