Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks

23 January 2018

Yonatan Belinkov

Lluís Màrquez i Villodre

Papers citing "Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks"

50 / 105 papers shown

Different types of syntactic agreement recruit the same units within large language models

106

03 Dec 2025

RoSA: Enhancing Parameter-Efficient Fine-Tuning via RoPE-aware Selective Adaptation in Large Language Models

21 Nov 2025

From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators

171

27 Oct 2025

Towards Transparent AI: A Survey on Explainable Language Models

Avash Palikhe

Sribala Vidyadhari Chinta

185

25 Sep 2025

Dissecting Persona-Driven Reasoning in Language Models via Activation Patching

Ansh Poonia

Maeghal Jain

228

28 Jul 2025

A Representation Level Analysis of NMT Model Robustness to Grammatical ErrorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

262

27 May 2025

Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation

339

13 May 2025

MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025

R. Teo

T. Nguyen

MoE

425

14 Mar 2025

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

446

24 Feb 2025

Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zheng Zhao

Yftah Ziser

Shay B. Cohen

205

25 Oct 2024

Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5

Thao Anh Dang

Limor Raviv

Lukas Galke

337

15 Oct 2024

Mechanistic?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024

Naomi Saphra

Sarah Wiegreffe

AI4CE

263

07 Oct 2024

The representation landscape of few-shot learning and fine-tuning in large language modelsNeural Information Processing Systems (NeurIPS), 2024

374

05 Sep 2024

Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT

286

03 May 2024

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction TuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Wenlin Yao

Ninghao Liu

Dong Yu

LRM

275

30 Sep 2023

Explainability for Large Language Models: A SurveyACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023

Haiyan Zhao

Hanjie Chen

Fan Yang

Ninghao Liu

500

710

02 Sep 2023

Operationalising Representation in Natural Language ProcessingBritish Journal for the Philosophy of Science (BJPS), 2023

J. Harding

354

14 Jun 2023

Morphosyntactic probing of multilingual BERT modelsNatural Language Engineering (NLE), 2023

201

09 Jun 2023

Can LLMs facilitate interpretation of pre-trained language models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Basel Mousi

Nadir Durrani

Fahim Dalvi

303

22 May 2023

NxPlain: Web-based Tool for Discovery of Latent ConceptsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

205

06 Mar 2023

The geometry of hidden representations of large transformer modelsNeural Information Processing Systems (NeurIPS), 2023

343

01 Feb 2023

Semantic Tagging with LSTM-CRF

Farshad Noravesh

206

28 Jan 2023

Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022

507

02 Dec 2022

Prompting Language Models for Linguistic StructureAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Terra Blevins

Hila Gonen

Luke Zettlemoyer

LRM

250

15 Nov 2022

On the Transformation of Latent Space in Fine-Tuned NLP ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Nadir Durrani

Hassan Sajjad

Fahim Dalvi

Firoj Alam

268

23 Oct 2022

Understanding Domain Learning in Language Models Through Subpopulation AnalysisBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022

Zheng Zhao

Yftah Ziser

Shay B. Cohen

192

22 Oct 2022

Probing with Noise: Unpicking the Warp and Weft of EmbeddingsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022

Filip Klubicka

John D. Kelleher

192

21 Oct 2022