v1v2v3 (latest)

What do Neural Machine Translation Models Learn about Morphology?

11 April 2017

Papers citing "What do Neural Machine Translation Models Learn about Morphology?"

50 / 251 papers shown

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Emily Chang

Niyati Bafna

ELM

192

19 Oct 2025

Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing

169

23 Sep 2025

Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms

186

10 Sep 2025

Interpreting the Effects of Quantization on LLMs

Manpreet Singh

Hassan Sajjad

MQ MILM

463

22 Aug 2025

Probing Syntax in Large Language Models: Successes and Remaining Challenges

353

05 Aug 2025

On the Performance of Concept Probing: The Influence of the Data (Extended Version)

Manuel de Sousa Ribeiro

Afonso Leote

João Leite

296

24 Jul 2025

Large Language Models Encode Semantics and Alignment in Linearly Separable Representations

259

13 Jul 2025

SAEs Are Good for Steering -- If You Select the Right Features

498

26 May 2025

Designing and Contextualising Probes for African Languages

Wisdom Aduah

Francois Meyer

471

15 May 2025

Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation

420

13 May 2025

Signatures of human-like processing in Transformer forward passes

1.2K

18 Apr 2025

Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry

497

23 Mar 2025

MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025

R. Teo

T. Nguyen

MoE

524

14 Mar 2025

AxBERT: An Interpretable Chinese Spelling Correction Method Driven by Associative Knowledge Network

Fanyu Wang

Hangyu Zhu

Zhenping Xie

259

04 Mar 2025

How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal RepresentationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Hyunji Lee

Danni Liu

Supriti Sinhamahapatra

Jan Niehues

565

21 Feb 2025

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of TransformersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Anton Razzhigaev

Matvey Mikhalchuk

Temurbek Rahmatullaev

300

20 Feb 2025

The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

239

11 Feb 2025

How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct MatchingAAAI Conference on Artificial Intelligence (AAAI), 2024

András Balogh

Márk Jelasity

316

15 Dec 2024

Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering

Rumi A. Allbert

James K. Wiles

Vlad Grankovsky

LLMSV AI4CE

440

10 Dec 2024

Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zheng Zhao

Yftah Ziser

Shay B. Cohen

262

25 Oct 2024

Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5

Thao Anh Dang

Limor Raviv

Lukas Galke

428

15 Oct 2024

Mechanistic?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024

Naomi Saphra

Sarah Wiegreffe

AI4CE

327

07 Oct 2024

The representation landscape of few-shot learning and fine-tuning in large language modelsNeural Information Processing Systems (NeurIPS), 2024

446

05 Sep 2024

Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic EvaluationInternational Conference on Multimodal Interaction (ICMI), 2024

Aslı Özyürek

272

31 Aug 2024

The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation AnalysisComputational Linguistics (CL), 2024

...

606

02 Aug 2024

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects

David Ifeoluwa Adelani

Daud Abolade

Noah A. Smith

Yulia Tsvetkov

424

27 Jun 2024

In Tree Structure Should Sentence Be Generated

Yaguang Li

Xin Chen

164

20 Jun 2024

Estimating Knowledge in Large Language Models Without Generating a Single TokenConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Daniela Gottesman

Mor Geva

301

18 Jun 2024

What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions

Liyi Zhang

Michael Y. Li

Thomas Griffiths

Theodore R. Sumers

Jian-Qiao Zhu

Thomas L. Griffiths

278

06 Jun 2024

InversionView: A General-Purpose Method for Reading Information from Neural Activations

398

27 May 2024

I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures

Timothee Mickus

Ananda Sreenidhi

Joseph Attieh

396

27 Apr 2024

Locating and Editing Factual Associations in Mamba

254

04 Apr 2024

Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization

282

02 Feb 2024

Deep de Finetti: Recovering Topic Distributions from Large Language Models

280

21 Dec 2023

INSPECT: Intrinsic and Systematic Probing Evaluation for Code TransformersIEEE Transactions on Software Engineering (TSE), 2023

Anjan Karmakar

Romain Robbes

258

08 Dec 2023

Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structureNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

349

13 Nov 2023

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based ModelsFindings (Findings), 2023

378

10 Nov 2023

Unlearn What You Want to Forget: Efficient Unlearning for LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jiaao Chen

Diyi Yang

497

238

31 Oct 2023

Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject NumberConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Sophie Hao

Tal Linzen

213

23 Oct 2023

Understanding the Inner Workings of Language Models Through Representation DissimilarityConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

256

23 Oct 2023

Disentangling the Linguistic Competence of Privacy-Preserving BERTBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023

Stefan Arnold

Nils Kemmerzell

Annika Schreiner

308

17 Oct 2023

Unsupervised Contrast-Consistent Ranking with Language ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Niklas Stoehr

Pengxiang Cheng

Jing Wang

Daniel Preoţiuc-Pietro

Rajarshi Bhowmik

ALM

331

13 Sep 2023

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Finale Doshi-Velez

253

01 Sep 2023

Scaling up Discovery of Latent Concepts in Deep NLP ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Majd Hawasly

Fahim Dalvi

Nadir Durrani

415

20 Aug 2023