Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.03471
Cited By
What do Neural Machine Translation Models Learn about Morphology?
11 April 2017
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What do Neural Machine Translation Models Learn about Morphology?"
50 / 242 papers shown
Title
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
20
0
0
15 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
22
0
0
13 May 2025
Linking forward-pass dynamics in Transformers and real-time human processing
Jennifer Hu
Michael A. Lepori
Michael Franke
AI4CE
113
0
0
18 Apr 2025
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou
Hang Le
Yichen Wang
SueYeon Chung
48
0
0
23 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
R. Teo
T. Nguyen
MoE
55
2
0
14 Mar 2025
AxBERT: An Interpretable Chinese Spelling Correction Method Driven by Associative Knowledge Network
Fanyu Wang
Hangyu Zhu
Zhenping Xie
38
0
0
04 Mar 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
106
0
0
21 Feb 2025
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
Anton Razzhigaev
Matvey Mikhalchuk
Temurbek Rahmatullaev
Elizaveta Goncharova
Polina Druzhinina
Ivan V. Oseledets
Andrey Kuznetsov
57
2
0
20 Feb 2025
How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct Matching
András Balogh
Márk Jelasity
72
0
0
15 Dec 2024
Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
Rumi A. Allbert
James K. Wiles
Vlad Grankovsky
LLMSV
AI4CE
75
1
0
10 Dec 2024
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
Zheng Zhao
Yftah Ziser
Shay B. Cohen
20
0
0
25 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
23
1
0
15 Oct 2024
Mechanistic?
Naomi Saphra
Sarah Wiegreffe
AI4CE
21
9
0
07 Oct 2024
The representation landscape of few-shot learning and fine-tuning in large language models
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
88
4
0
05 Sep 2024
Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic Evaluation
E. Ghaleb
Bulat Khaertdinov
Wim Pouw
Marlou Rasenberg
Judith Holler
Aslı Özyürek
Raquel Fernández
SSL
25
1
0
31 Aug 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
42
18
0
02 Aug 2024
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia
Anuoluwapo Aremu
Diana Abagyan
Hila Gonen
David Ifeoluwa Adelani
Daud Abolade
Noah A. Smith
Yulia Tsvetkov
59
4
0
27 Jun 2024
In Tree Structure Should Sentence Be Generated
Yaguang Li
Xin Chen
18
0
0
20 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a Single Token
Daniela Gottesman
Mor Geva
37
10
0
18 Jun 2024
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Liyi Zhang
Michael Y. Li
Thomas L. Griffiths
48
2
0
06 Jun 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
26
3
0
27 May 2024
I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures
Timothee Mickus
Raúl Vázquez
Joseph Attieh
19
0
0
27 Apr 2024
Locating and Editing Factual Associations in Mamba
Arnab Sen Sharma
David Atkinson
David Bau
KELM
68
28
0
04 Apr 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
24
7
0
02 Feb 2024
Deep de Finetti: Recovering Topic Distributions from Large Language Models
Liyi Zhang
R. Thomas McCoy
T. Sumers
Jian-Qiao Zhu
Thomas L. Griffiths
BDL
42
6
0
21 Dec 2023
INSPECT: Intrinsic and Systematic Probing Evaluation for Code Transformers
Anjan Karmakar
Romain Robbes
22
4
0
08 Dec 2023
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structure
David Arps
Laura Kallmeyer
Younes Samih
Hassan Sajjad
19
1
0
13 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan V. Oseledets
Denis Dimitrov
Andrey Kuznetsov
27
7
0
10 Nov 2023
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Jiaao Chen
Diyi Yang
MU
22
134
0
31 Oct 2023
Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject Number
Sophie Hao
Tal Linzen
14
5
0
23 Oct 2023
Understanding the Inner Workings of Language Models Through Representation Dissimilarity
Davis Brown
Charles Godfrey
N. Konz
Jonathan Tu
Henry Kvinge
32
7
0
23 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
25
0
0
17 Oct 2023
Unsupervised Contrast-Consistent Ranking with Language Models
Niklas Stoehr
Pengxiang Cheng
Jing Wang
Daniel Preotiuc-Pietro
Rajarshi Bhowmik
ALM
18
11
0
13 Sep 2023
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash
Anna Bialas
Weiwei Pan
Finale Doshi-Velez
AAML
14
10
0
01 Sep 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Majd Hawasly
Fahim Dalvi
Nadir Durrani
29
4
0
20 Aug 2023
Linearity of Relation Decoding in Transformer Language Models
Evan Hernandez
Arnab Sen Sharma
Tal Haklay
Kevin Meng
Martin Wattenberg
Jacob Andreas
Yonatan Belinkov
David Bau
KELM
17
82
0
17 Aug 2023
Morphosyntactic probing of multilingual BERT models
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
27
9
0
09 Jun 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Dávid Javorský
Ondrej Bojar
François Yvon
17
2
0
31 May 2023
NeuroX Library for Neuron Analysis of Deep NLP Models
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
25
9
0
26 May 2023
On convex decision regions in deep network representations
Lenka Tvetková
Thea Brusch
Teresa Scheidt
Fabian Martin Mager
R. Aagaard
Jonathan Foldager
T. S. Alstrøm
Lars Kai Hansen
26
2
0
26 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Basel Mousi
Nadir Durrani
Fahim Dalvi
36
12
0
22 May 2023
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He
Philip N. Garner
36
4
0
16 May 2023
How Good are Commercial Large Language Models on African Languages?
Jessica Ojo
Kelechi Ogueji
19
5
0
11 May 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
28
41
0
08 Apr 2023
Do Transformers Parse while Predicting the Masked Word?
Haoyu Zhao
A. Panigrahi
Rong Ge
Sanjeev Arora
74
31
0
14 Mar 2023
NxPlain: Web-based Tool for Discovery of Latent Concepts
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
Tamim Jaban
Musab Husaini
Ummar Abbas
13
1
0
06 Mar 2023
On the Role of Morphological Information for Contextual Lemmatization
Olia Toporkov
Rodrigo Agerri
23
8
0
01 Feb 2023
Evaluating Neuron Interpretation Methods of NLP Models
Yimin Fan
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
35
8
0
30 Jan 2023
The Role of Interactive Visualization in Explaining (Large) NLP Models: from Data to Inference
R. Brath
Daniel A. Keim
Johannes Knittel
Shimei Pan
Pia Sommerauer
Hendrik Strobelt
11
11
0
11 Jan 2023
Can Large Language Models Change User Preference Adversarially?
Varshini Subhash
AAML
24
8
0
05 Jan 2023
1
2
3
4
5
Next