Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1704.03471
Cited By
v1
v2
v3 (latest)
What do Neural Machine Translation Models Learn about Morphology?
11 April 2017
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"What do Neural Machine Translation Models Learn about Morphology?"
50 / 251 papers shown
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
Emily Chang
Niyati Bafna
ELM
192
0
0
19 Oct 2025
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Sabri Boughorbel
Fahim Dalvi
Nadir Durrani
Majd Hawasly
169
1
0
23 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELM
HILM
186
3
0
10 Sep 2025
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQ
MILM
463
3
0
22 Aug 2025
Probing Syntax in Large Language Models: Successes and Remaining Challenges
Pablo Diego-Simón
Emmanuel Chemla
J. King
Yair Lakretz
353
2
0
05 Aug 2025
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
Manuel de Sousa Ribeiro
Afonso Leote
João Leite
296
1
0
24 Jul 2025
Large Language Models Encode Semantics and Alignment in Linearly Separable Representations
Baturay Saglam
Paul Kassianik
Blaine Nelson
Sajana Weerawardhena
Yaron Singer
Amin Karbasi
259
3
0
13 Jul 2025
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
498
29
0
26 May 2025
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
471
0
0
15 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
420
4
0
13 May 2025
Signatures of human-like processing in Transformer forward passes
Jennifer Hu
Michael A. Lepori
Michael Franke
AI4CE
1.2K
0
0
18 Apr 2025
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou
Hang Le
Yichen Wang
SueYeon Chung
497
5
0
23 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
International Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
524
5
0
14 Mar 2025
AxBERT: An Interpretable Chinese Spelling Correction Method Driven by Associative Knowledge Network
Fanyu Wang
Hangyu Zhu
Zhenping Xie
259
0
0
04 Mar 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
565
7
0
21 Feb 2025
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Anton Razzhigaev
Matvey Mikhalchuk
Temurbek Rahmatullaev
Elizaveta Goncharova
Polina Druzhinina
Ivan Oseledets
Andrey Kuznetsov
300
10
0
20 Feb 2025
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Artem Kirsanov
Chi-Ning Chou
Kyunghyun Cho
SueYeon Chung
AI4CE
239
13
0
11 Feb 2025
How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct Matching
AAAI Conference on Artificial Intelligence (AAAI), 2024
András Balogh
Márk Jelasity
316
3
0
15 Dec 2024
Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
Rumi A. Allbert
James K. Wiles
Vlad Grankovsky
LLMSV
AI4CE
440
8
0
10 Dec 2024
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zheng Zhao
Yftah Ziser
Shay B. Cohen
262
9
0
25 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
428
12
0
15 Oct 2024
Mechanistic?
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Naomi Saphra
Sarah Wiegreffe
AI4CE
327
43
0
07 Oct 2024
The representation landscape of few-shot learning and fine-tuning in large language models
Neural Information Processing Systems (NeurIPS), 2024
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
446
15
0
05 Sep 2024
Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic Evaluation
International Conference on Multimodal Interaction (ICMI), 2024
E. Ghaleb
Bulat Khaertdinov
Wim Pouw
Marlou Rasenberg
Judith Holler
Aslı Özyürek
Raquel Fernández
SSL
272
2
0
31 Aug 2024
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation Analysis
Computational Linguistics (CL), 2024
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
606
34
0
02 Aug 2024
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia
Anuoluwapo Aremu
Diana Abagyan
Hila Gonen
David Ifeoluwa Adelani
Daud Abolade
Noah A. Smith
Yulia Tsvetkov
424
14
0
27 Jun 2024
In Tree Structure Should Sentence Be Generated
Yaguang Li
Xin Chen
164
0
0
20 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a Single Token
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daniela Gottesman
Mor Geva
301
36
0
18 Jun 2024
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Liyi Zhang
Michael Y. Li
Thomas Griffiths
Theodore R. Sumers
Jian-Qiao Zhu
Thomas L. Griffiths
278
9
0
06 Jun 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
398
9
0
27 May 2024
I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures
Timothee Mickus
Ananda Sreenidhi
Joseph Attieh
396
0
0
27 Apr 2024
Locating and Editing Factual Associations in Mamba
Arnab Sen Sharma
David Atkinson
David Bau
KELM
254
40
0
04 Apr 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
282
10
0
02 Feb 2024
Deep de Finetti: Recovering Topic Distributions from Large Language Models
Liyi Zhang
R. Thomas McCoy
T. Sumers
Jian-Qiao Zhu
Thomas Griffiths
BDL
280
8
0
21 Dec 2023
INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers
IEEE Transactions on Software Engineering (TSE), 2023
Anjan Karmakar
Romain Robbes
258
7
0
08 Dec 2023
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structure
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
David Arps
Laura Kallmeyer
Younes Samih
Hassan Sajjad
349
5
0
13 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Findings (Findings), 2023
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
378
26
0
10 Nov 2023
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiaao Chen
Diyi Yang
MU
497
238
0
31 Oct 2023
Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sophie Hao
Tal Linzen
213
10
0
23 Oct 2023
Understanding the Inner Workings of Language Models Through Representation Dissimilarity
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Davis Brown
Charles Godfrey
Nicholas Konz
Jonathan Tu
Henry Kvinge
256
13
0
23 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
308
0
0
17 Oct 2023
Unsupervised Contrast-Consistent Ranking with Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Niklas Stoehr
Pengxiang Cheng
Jing Wang
Daniel Preoţiuc-Pietro
Rajarshi Bhowmik
ALM
331
16
0
13 Sep 2023
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash
Anna Bialas
Weiwei Pan
Finale Doshi-Velez
AAML
253
17
0
01 Sep 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Majd Hawasly
Fahim Dalvi
Nadir Durrani
415
9
0
20 Aug 2023
Linearity of Relation Decoding in Transformer Language Models
International Conference on Learning Representations (ICLR), 2023
Evan Hernandez
Arnab Sen Sharma
Tal Haklay
Kevin Meng
Martin Wattenberg
Jacob Andreas
Yonatan Belinkov
David Bau
KELM
434
152
0
17 Aug 2023
Morphosyntactic probing of multilingual BERT models
Natural Language Engineering (NLE), 2023
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
258
19
0
09 Jun 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dávid Javorský
Ondrej Bojar
François Yvon
220
3
0
31 May 2023
NeuroX Library for Neuron Analysis of Deep NLP Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
312
16
0
26 May 2023
On convex decision regions in deep network representations
Nature Communications (Nat. Commun.), 2023
Lenka Tvetková
Thea Brusch
Teresa Scheidt
Fabian Martin Mager
R. Aagaard
Jonathan Foldager
T. S. Alstrøm
Lars Kai Hansen
346
5
0
26 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Basel Mousi
Nadir Durrani
Fahim Dalvi
365
16
0
22 May 2023
1
2
3
4
5
6
Next
Page 1 of 6