ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.03471
  4. Cited By
What do Neural Machine Translation Models Learn about Morphology?
v1v2v3 (latest)

What do Neural Machine Translation Models Learn about Morphology?

11 April 2017
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
ArXiv (abs)PDFHTML

Papers citing "What do Neural Machine Translation Models Learn about Morphology?"

50 / 251 papers shown
Title
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models
Emily Chang
Niyati Bafna
ELM
95
0
0
19 Oct 2025
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Sabri Boughorbel
Fahim Dalvi
Nadir Durrani
Majd Hawasly
88
0
0
23 Sep 2025
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Do All Autoregressive Transformers Remember Facts the Same Way? A Cross-Architecture Analysis of Recall Mechanisms
Minyeong Choe
Haehyun Cho
Changho Seo
Hyunil Kim
KELMHILM
102
2
0
10 Sep 2025
Interpreting the Effects of Quantization on LLMs
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQMILM
253
0
0
22 Aug 2025
Probing Syntax in Large Language Models: Successes and Remaining Challenges
Probing Syntax in Large Language Models: Successes and Remaining Challenges
Pablo Diego-Simón
Emmanuel Chemla
J. King
Yair Lakretz
190
1
0
05 Aug 2025
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
Manuel de Sousa Ribeiro
Afonso Leote
João Leite
118
1
0
24 Jul 2025
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
Baturay Saglam
Paul Kassianik
Blaine Nelson
Sajana Weerawardhena
Yaron Singer
Amin Karbasi
107
2
0
13 Jul 2025
SAEs Are Good for Steering -- If You Select the Right Features
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
171
18
0
26 May 2025
Designing and Contextualising Probes for African Languages
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
291
0
0
15 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
271
3
0
13 May 2025
Signatures of human-like processing in Transformer forward passes
Signatures of human-like processing in Transformer forward passes
Jennifer Hu
Michael A. Lepori
Michael Franke
AI4CE
923
0
0
18 Apr 2025
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
Chi-Ning Chou
Hang Le
Yichen Wang
SueYeon Chung
324
1
0
23 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
308
4
0
14 Mar 2025
AxBERT: An Interpretable Chinese Spelling Correction Method Driven by Associative Knowledge Network
Fanyu Wang
Hangyu Zhu
Zhenping Xie
185
0
0
04 Mar 2025
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal Representations
How do Multimodal Foundation Models Encode Text and Speech? An Analysis of Cross-Lingual and Cross-Modal RepresentationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hyunji Lee
Danni Liu
Supriti Sinhamahapatra
Jan Niehues
393
4
0
21 Feb 2025
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of TransformersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Anton Razzhigaev
Matvey Mikhalchuk
Temurbek Rahmatullaev
Elizaveta Goncharova
Polina Druzhinina
Ivan Oseledets
Andrey Kuznetsov
225
7
0
20 Feb 2025
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Artem Kirsanov
Chi-Ning Chou
Kyunghyun Cho
SueYeon Chung
AI4CE
149
10
0
11 Feb 2025
How not to Stitch Representations to Measure Similarity: Task Loss
  Matching versus Direct Matching
How not to Stitch Representations to Measure Similarity: Task Loss Matching versus Direct MatchingAAAI Conference on Artificial Intelligence (AAAI), 2024
András Balogh
Márk Jelasity
206
1
0
15 Dec 2024
Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
Rumi A. Allbert
James K. Wiles
Vlad Grankovsky
LLMSVAI4CE
347
3
0
10 Dec 2024
Layer by Layer: Uncovering Where Multi-Task Learning Happens in
  Instruction-Tuned Large Language Models
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zheng Zhao
Yftah Ziser
Shay B. Cohen
151
7
0
25 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
237
8
0
15 Oct 2024
Mechanistic?
Mechanistic?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Naomi Saphra
Sarah Wiegreffe
AI4CE
209
31
0
07 Oct 2024
The representation landscape of few-shot learning and fine-tuning in
  large language models
The representation landscape of few-shot learning and fine-tuning in large language modelsNeural Information Processing Systems (NeurIPS), 2024
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
289
11
0
05 Sep 2024
Learning Co-Speech Gesture Representations in Dialogue through
  Contrastive Learning: An Intrinsic Evaluation
Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic EvaluationInternational Conference on Multimodal Interaction (ICMI), 2024
E. Ghaleb
Bulat Khaertdinov
Wim Pouw
Marlou Rasenberg
Judith Holler
Aslı Özyürek
Raquel Fernández
SSL
171
2
0
31 Aug 2024
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation Analysis
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation AnalysisComputational Linguistics (CL), 2024
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
454
34
0
02 Aug 2024
Voices Unheard: NLP Resources and Models for Yorùbá Regional
  Dialects
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia
Anuoluwapo Aremu
Diana Abagyan
Hila Gonen
David Ifeoluwa Adelani
Daud Abolade
Noah A. Smith
Yulia Tsvetkov
313
13
0
27 Jun 2024
In Tree Structure Should Sentence Be Generated
In Tree Structure Should Sentence Be Generated
Yaguang Li
Xin Chen
77
0
0
20 Jun 2024
Estimating Knowledge in Large Language Models Without Generating a
  Single Token
Estimating Knowledge in Large Language Models Without Generating a Single TokenConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Daniela Gottesman
Mor Geva
207
27
0
18 Jun 2024
What Should Embeddings Embed? Autoregressive Models Represent Latent
  Generating Distributions
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Liyi Zhang
Michael Y. Li
Thomas Griffiths
154
4
0
06 Jun 2024
InversionView: A General-Purpose Method for Reading Information from
  Neural Activations
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
307
9
0
27 May 2024
I Have an Attention Bridge to Sell You: Generalization Capabilities of
  Modular Translation Architectures
I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures
Timothee Mickus
Ananda Sreenidhi
Joseph Attieh
229
0
0
27 Apr 2024
Locating and Editing Factual Associations in Mamba
Locating and Editing Factual Associations in Mamba
Arnab Sen Sharma
David Atkinson
David Bau
KELM
202
37
0
04 Apr 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic
  Generalization
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
185
9
0
02 Feb 2024
Deep de Finetti: Recovering Topic Distributions from Large Language
  Models
Deep de Finetti: Recovering Topic Distributions from Large Language Models
Liyi Zhang
R. Thomas McCoy
T. Sumers
Jian-Qiao Zhu
Thomas Griffiths
BDL
189
8
0
21 Dec 2023
INSPECT: Intrinsic and Systematic Probing Evaluation for Code
  Transformers
INSPECT: Intrinsic and Systematic Probing Evaluation for Code TransformersIEEE Transactions on Software Engineering (TSE), 2023
Anjan Karmakar
Romain Robbes
185
5
0
08 Dec 2023
Multilingual Nonce Dependency Treebanks: Understanding how Language
  Models represent and process syntactic structure
Multilingual Nonce Dependency Treebanks: Understanding how Language Models represent and process syntactic structureNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
David Arps
Laura Kallmeyer
Younes Samih
Hassan Sajjad
221
3
0
13 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in
  Transformer-Based Models
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based ModelsFindings (Findings), 2023
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
269
21
0
10 Nov 2023
Unlearn What You Want to Forget: Efficient Unlearning for LLMs
Unlearn What You Want to Forget: Efficient Unlearning for LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiaao Chen
Diyi Yang
MU
278
210
0
31 Oct 2023
Verb Conjugation in Transformers Is Determined by Linear Encodings of
  Subject Number
Verb Conjugation in Transformers Is Determined by Linear Encodings of Subject NumberConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sophie Hao
Tal Linzen
134
6
0
23 Oct 2023
Understanding the Inner Workings of Language Models Through
  Representation Dissimilarity
Understanding the Inner Workings of Language Models Through Representation DissimilarityConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Davis Brown
Charles Godfrey
Nicholas Konz
Jonathan Tu
Henry Kvinge
179
12
0
23 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Disentangling the Linguistic Competence of Privacy-Preserving BERTBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
216
0
0
17 Oct 2023
Unsupervised Contrast-Consistent Ranking with Language Models
Unsupervised Contrast-Consistent Ranking with Language ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Niklas Stoehr
Pengxiang Cheng
Jing Wang
Daniel Preoţiuc-Pietro
Rajarshi Bhowmik
ALM
181
14
0
13 Sep 2023
Why do universal adversarial attacks work on large language models?:
  Geometry might be the answer
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash
Anna Bialas
Weiwei Pan
Finale Doshi-Velez
AAML
165
16
0
01 Sep 2023
Scaling up Discovery of Latent Concepts in Deep NLP Models
Scaling up Discovery of Latent Concepts in Deep NLP ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Majd Hawasly
Fahim Dalvi
Nadir Durrani
255
6
0
20 Aug 2023
Linearity of Relation Decoding in Transformer Language Models
Linearity of Relation Decoding in Transformer Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Evan Hernandez
Arnab Sen Sharma
Tal Haklay
Kevin Meng
Martin Wattenberg
Jacob Andreas
Yonatan Belinkov
David Bau
KELM
303
130
0
17 Aug 2023
Morphosyntactic probing of multilingual BERT models
Morphosyntactic probing of multilingual BERT modelsNatural Language Engineering (NLE), 2023
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
159
15
0
09 Jun 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Assessing Word Importance Using Models Trained for Semantic TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Dávid Javorský
Ondrej Bojar
François Yvon
110
3
0
31 May 2023
NeuroX Library for Neuron Analysis of Deep NLP Models
NeuroX Library for Neuron Analysis of Deep NLP ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Fahim Dalvi
Hassan Sajjad
Nadir Durrani
205
14
0
26 May 2023
On convex decision regions in deep network representations
On convex decision regions in deep network representationsNature Communications (Nat. Commun.), 2023
Lenka Tvetková
Thea Brusch
Teresa Scheidt
Fabian Martin Mager
R. Aagaard
Jonathan Foldager
T. S. Alstrøm
Lars Kai Hansen
262
4
0
26 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Can LLMs facilitate interpretation of pre-trained language models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Basel Mousi
Nadir Durrani
Fahim Dalvi
260
15
0
22 May 2023
123456
Next