Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1704.03471
Cited By
v1
v2
v3 (latest)
What do Neural Machine Translation Models Learn about Morphology?
11 April 2017
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"What do Neural Machine Translation Models Learn about Morphology?"
50 / 251 papers shown
Title
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
Emily Chang
Niyati Bafna
ELM
95
0
0
19 Oct 2025
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Sabri Boughorbel
Fahim Dalvi
Nadir Durrani
Majd Hawasly
88
0
0
23 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELM
HILM
102
2
0
10 Sep 2025
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQ
MILM
253
0
0
22 Aug 2025
Probing Syntax in Large Language Models: Successes and Remaining Challenges
Pablo Diego-Simón
Emmanuel Chemla
J. King
Yair Lakretz
190
1
0
05 Aug 2025
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
Manuel de Sousa Ribeiro
Afonso Leote
João Leite
118
1
0
24 Jul 2025
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
Baturay Saglam
Paul Kassianik
Blaine Nelson
Sajana Weerawardhena
Yaron Singer
Amin Karbasi
107
2
0
13 Jul 2025
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
171
18
0
26 May 2025
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
291
0
0
15 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
271
3
0
13 May 2025
Signatures of human-like processing in Transformer forward passes
Jennifer Hu
Michael A. Lepori
Michael Franke
AI4CE
923
0
0
18 Apr 2025
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou
Hang Le
Yichen Wang
SueYeon Chung
324
1
0
23 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
International Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
308
4
0
14 Mar 2025
AxBERT: An Interpretable Chinese Spelling Correction Method Driven by Associative Knowledge Network
Fanyu Wang
Hangyu Zhu
Zhenping Xie
185
0
0
04 Mar 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
393
4
0
21 Feb 2025
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Anton Razzhigaev
Matvey Mikhalchuk
Temurbek Rahmatullaev
Elizaveta Goncharova
Polina Druzhinina
Ivan Oseledets
Andrey Kuznetsov
225
7
0
20 Feb 2025
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Artem Kirsanov
Chi-Ning Chou
Kyunghyun Cho
SueYeon Chung
AI4CE
149
10
0
11 Feb 2025
How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct Matching
AAAI Conference on Artificial Intelligence (AAAI), 2024
András Balogh
Márk Jelasity
206
1
0
15 Dec 2024
Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
Rumi A. Allbert
James K. Wiles
Vlad Grankovsky
LLMSV
AI4CE
347
3
0
10 Dec 2024
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zheng Zhao
Yftah Ziser
Shay B. Cohen
151
7
0
25 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
237
8
0
15 Oct 2024
Mechanistic?
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Naomi Saphra
Sarah Wiegreffe
AI4CE
209
31
0
07 Oct 2024
The representation landscape of few-shot learning and fine-tuning in large language models
Neural Information Processing Systems (NeurIPS), 2024
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
289
11
0
05 Sep 2024
Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic Evaluation
International Conference on Multimodal Interaction (ICMI), 2024
E. Ghaleb
Bulat Khaertdinov
Wim Pouw
Marlou Rasenberg
Judith Holler
Aslı Özyürek
Raquel Fernández
SSL
171
2
0
31 Aug 2024
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation Analysis
Computational Linguistics (CL), 2024
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
454
34
0
02 Aug 2024
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia
Anuoluwapo Aremu
Diana Abagyan
Hila Gonen
David Ifeoluwa Adelani
Daud Abolade
Noah A. Smith
Yulia Tsvetkov
313
13
0
27 Jun 2024
In Tree Structure Should Sentence Be Generated
Yaguang Li
Xin Chen
77
0
0
20 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a Single Token
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daniela Gottesman
Mor Geva
207
27
0
18 Jun 2024
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Liyi Zhang
Michael Y. Li
Thomas Griffiths
154
4
0
06 Jun 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
307
9
0
27 May 2024
I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures
Timothee Mickus
Ananda Sreenidhi
Joseph Attieh
229
0
0
27 Apr 2024
Locating and Editing Factual Associations in Mamba
Arnab Sen Sharma
David Atkinson
David Bau
KELM
202
37
0
04 Apr 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
185
9
0
02 Feb 2024
Deep de Finetti: Recovering Topic Distributions from Large Language Models
Liyi Zhang
R. Thomas McCoy
T. Sumers
Jian-Qiao Zhu
Thomas Griffiths
BDL
189
8
0
21 Dec 2023
INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers
IEEE Transactions on Software Engineering (TSE), 2023
Anjan Karmakar
Romain Robbes
185
5
0
08 Dec 2023
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structure
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
David Arps
Laura Kallmeyer
Younes Samih
Hassan Sajjad
221
3
0
13 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Findings (Findings), 2023
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
269
21
0
10 Nov 2023
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiaao Chen
Diyi Yang
MU
278
210
0
31 Oct 2023
Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sophie Hao
Tal Linzen
134
6
0
23 Oct 2023
Understanding the Inner Workings of Language Models Through Representation Dissimilarity
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Davis Brown
Charles Godfrey
Nicholas Konz
Jonathan Tu
Henry Kvinge
179
12
0
23 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
216
0
0
17 Oct 2023
Unsupervised Contrast-Consistent Ranking with Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Niklas Stoehr
Pengxiang Cheng
Jing Wang
Daniel Preoţiuc-Pietro
Rajarshi Bhowmik
ALM
181
14
0
13 Sep 2023
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash
Anna Bialas
Weiwei Pan
Finale Doshi-Velez
AAML
165
16
0
01 Sep 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Majd Hawasly
Fahim Dalvi
Nadir Durrani
255
6
0
20 Aug 2023
Linearity of Relation Decoding in Transformer Language Models
International Conference on Learning Representations (ICLR), 2023
Evan Hernandez
Arnab Sen Sharma
Tal Haklay
Kevin Meng
Martin Wattenberg
Jacob Andreas
Yonatan Belinkov
David Bau
KELM
303
130
0
17 Aug 2023
Morphosyntactic probing of multilingual BERT models
Natural Language Engineering (NLE), 2023
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
159
15
0
09 Jun 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dávid Javorský
Ondrej Bojar
François Yvon
110
3
0
31 May 2023
NeuroX Library for Neuron Analysis of Deep NLP Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
205
14
0
26 May 2023
On convex decision regions in deep network representations
Nature Communications (Nat. Commun.), 2023
Lenka Tvetková
Thea Brusch
Teresa Scheidt
Fabian Martin Mager
R. Aagaard
Jonathan Foldager
T. S. Alstrøm
Lars Kai Hansen
262
4
0
26 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Basel Mousi
Nadir Durrani
Fahim Dalvi
260
15
0
22 May 2023
1
2
3
4
5
6
Next