ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.07772
  4. Cited By
Evaluating Layers of Representation in Neural Machine Translation on
  Part-of-Speech and Semantic Tagging Tasks

Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks

23 January 2018
Yonatan Belinkov
Lluís Màrquez i Villodre
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
James R. Glass
ArXiv (abs)PDFHTML

Papers citing "Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks"

50 / 105 papers shown
Different types of syntactic agreement recruit the same units within large language models
Different types of syntactic agreement recruit the same units within large language models
Daria Kryvosheieva
Andrea de Varda
Evelina Fedorenko
Greta Tuckute
106
0
0
03 Dec 2025
RoSA: Enhancing Parameter-Efficient Fine-Tuning via RoPE-aware Selective Adaptation in Large Language Models
RoSA: Enhancing Parameter-Efficient Fine-Tuning via RoPE-aware Selective Adaptation in Large Language Models
Dayan Pan
Jingyuan Wang
Yilong Zhou
Jiawei Cheng
Pengyue Jia
Xiangyu Zhao
61
0
0
21 Nov 2025
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
Lei Liu
Zhongyi Yu
Hong Wang
Huanshuo Dong
Haiyang Xin
Hongwei Zhao
B. Li
171
0
0
27 Oct 2025
Towards Transparent AI: A Survey on Explainable Language Models
Towards Transparent AI: A Survey on Explainable Language Models
Avash Palikhe
Sribala Vidyadhari Chinta
Zhipeng Yin
Rui Guo
Qiang Duan
Jie Yang
Wenbin Zhang
185
2
0
25 Sep 2025
Dissecting Persona-Driven Reasoning in Language Models via Activation Patching
Dissecting Persona-Driven Reasoning in Language Models via Activation Patching
Ansh Poonia
Maeghal Jain
228
0
0
28 Jul 2025
A Representation Level Analysis of NMT Model Robustness to Grammatical Errors
A Representation Level Analysis of NMT Model Robustness to Grammatical ErrorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Abderrahmane Issam
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
262
0
0
27 May 2025
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Are We Paying Attention to Her? Investigating Gender Disambiguation and Attention in Machine Translation
Chiara Manna
Afra Alishahi
Frédéric Blain
Eva Vanmassenhove
339
3
0
13 May 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
425
5
0
14 Mar 2025
Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Xuansheng Wu
Jiayi Yuan
Wenlin Yao
Xiaoming Zhai
Ninghao Liu
LLMSV
446
19
0
24 Feb 2025
Layer by Layer: Uncovering Where Multi-Task Learning Happens in
  Instruction-Tuned Large Language Models
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zheng Zhao
Yftah Ziser
Shay B. Cohen
205
7
0
25 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
337
10
0
15 Oct 2024
Mechanistic?
Mechanistic?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Naomi Saphra
Sarah Wiegreffe
AI4CE
263
34
0
07 Oct 2024
The representation landscape of few-shot learning and fine-tuning in
  large language models
The representation landscape of few-shot learning and fine-tuning in large language modelsNeural Information Processing Systems (NeurIPS), 2024
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
374
13
0
05 Sep 2024
Analyzing Narrative Processing in Large Language Models (LLMs): Using
  GPT4 to test BERT
Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT
Patrick Krauss
Jannik Hösch
C. Metzner
Andreas K. Maier
Peter Uhrig
Achim Schilling
286
3
0
03 May 2024
From Language Modeling to Instruction Following: Understanding the
  Behavior Shift in LLMs after Instruction Tuning
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction TuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Xuansheng Wu
Wenlin Yao
Jianshu Chen
Xiaoman Pan
Xiaoyang Wang
Ninghao Liu
Dong Yu
LRM
275
49
0
30 Sep 2023
Explainability for Large Language Models: A Survey
Explainability for Large Language Models: A SurveyACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jundong Li
LRM
500
710
0
02 Sep 2023
Operationalising Representation in Natural Language Processing
Operationalising Representation in Natural Language ProcessingBritish Journal for the Philosophy of Science (BJPS), 2023
J. Harding
354
17
0
14 Jun 2023
Morphosyntactic probing of multilingual BERT models
Morphosyntactic probing of multilingual BERT modelsNatural Language Engineering (NLE), 2023
Judit Ács
Endre Hamerlik
Roy Schwartz
Noah A. Smith
András Kornai
201
18
0
09 Jun 2023
Can LLMs facilitate interpretation of pre-trained language models?
Can LLMs facilitate interpretation of pre-trained language models?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Basel Mousi
Nadir Durrani
Fahim Dalvi
303
16
0
22 May 2023
NxPlain: Web-based Tool for Discovery of Latent Concepts
NxPlain: Web-based Tool for Discovery of Latent ConceptsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
Tamim Jaban
Musab Husaini
Ummar Abbas
205
1
0
06 Mar 2023
The geometry of hidden representations of large transformer models
The geometry of hidden representations of large transformer modelsNeural Information Processing Systems (NeurIPS), 2023
L. Valeriani
Diego Doimo
F. Cuturello
Alessandro Laio
A. Ansuini
Alberto Cazzaniga
MILM
343
84
0
01 Feb 2023
Semantic Tagging with LSTM-CRF
Semantic Tagging with LSTM-CRF
Farshad Noravesh
206
1
0
28 Jan 2023
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikelyCognitive Sciences (CS), 2022
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
507
87
0
02 Dec 2022
Prompting Language Models for Linguistic Structure
Prompting Language Models for Linguistic StructureAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Terra Blevins
Hila Gonen
Luke Zettlemoyer
LRM
250
52
0
15 Nov 2022
On the Transformation of Latent Space in Fine-Tuned NLP Models
On the Transformation of Latent Space in Fine-Tuned NLP ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
Firoj Alam
268
20
0
23 Oct 2022
Understanding Domain Learning in Language Models Through Subpopulation
  Analysis
Understanding Domain Learning in Language Models Through Subpopulation AnalysisBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Zheng Zhao
Yftah Ziser
Shay B. Cohen
192
7
0
22 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Probing with Noise: Unpicking the Warp and Weft of EmbeddingsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Filip Klubicka
John D. Kelleher
192
4
0
21 Oct 2022
Log-linear Guardedness and its Implications
Log-linear Guardedness and its ImplicationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Shauli Ravfogel
Yoav Goldberg
Robert Bamler
723
2
0
18 Oct 2022
Probing via Prompting
Probing via PromptingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Jiaoda Li
Robert Bamler
Mrinmaya Sachan
261
14
0
04 Jul 2022
Is neural language acquisition similar to natural? A chronological
  probing study
Is neural language acquisition similar to natural? A chronological probing studyComputational Linguistics and Intellectual Technologies (CLIT), 2022
E. Voloshina
O. Serikov
Tatiana Shavrina
243
4
0
01 Jul 2022
Analyzing Encoded Concepts in Transformer Language Models
Analyzing Encoded Concepts in Transformer Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
Firoj Alam
A. Khan
Jia Xu
184
54
0
27 Jun 2022
Discovering Salient Neurons in Deep NLP Models
Discovering Salient Neurons in Deep NLP ModelsJournal of machine learning research (JMLR), 2022
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
KELMMILM
307
20
0
27 Jun 2022
Discovering Latent Concepts Learned in BERT
Discovering Latent Concepts Learned in BERTInternational Conference on Learning Representations (ICLR), 2022
Fahim Dalvi
A. Khan
Firoj Alam
Nadir Durrani
Jia Xu
Hassan Sajjad
SSL
163
68
0
15 May 2022
Naturalistic Causal Probing for Morpho-Syntax
Naturalistic Causal Probing for Morpho-SyntaxTransactions of the Association for Computational Linguistics (TACL), 2022
Afra Amini
Tiago Pimentel
Clara Meister
Robert Bamler
MILM
279
25
0
14 May 2022
Variation and generality in encoding of syntactic anomaly information in
  sentence embeddings
Variation and generality in encoding of syntactic anomaly information in sentence embeddingsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021
Qinxuan Wu
Allyson Ettinger
179
2
0
12 Nov 2021
Micromodels for Efficient, Explainable, and Reusable Systems: A Case
  Study on Mental Health
Micromodels for Efficient, Explainable, and Reusable Systems: A Case Study on Mental Health
Andrew Lee
Jonathan K. Kummerfeld
Lawrence C. An
Amélie Reymond
233
25
0
28 Sep 2021
A Bayesian Framework for Information-Theoretic Probing
A Bayesian Framework for Information-Theoretic ProbingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Tiago Pimentel
Robert Bamler
230
25
0
08 Sep 2021
Neuron-level Interpretation of Deep NLP Models: A Survey
Neuron-level Interpretation of Deep NLP Models: A SurveyTransactions of the Association for Computational Linguistics (TACL), 2021
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
MILMAI4CE
320
97
0
30 Aug 2021
Meta-Learning to Compositionally Generalize
Meta-Learning to Compositionally GeneralizeAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Henry Conklin
Bailin Wang
Kenny Smith
Ivan Titov
OOD
235
81
0
08 Jun 2021
How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social
  Impact
How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social ImpactFindings (Findings), 2021
Zhijing Jin
Geeticka Chauhan
Brian Tse
Mrinmaya Sachan
Amélie Reymond
298
29
0
04 Jun 2021
How transfer learning impacts linguistic knowledge in deep NLP models?
How transfer learning impacts linguistic knowledge in deep NLP models?Findings (Findings), 2021
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
157
53
0
31 May 2021
On Compositional Generalization of Neural Machine Translation
On Compositional Generalization of Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang
351
52
0
31 May 2021
Fine-grained Interpretation and Causation Analysis in Deep NLP Models
Fine-grained Interpretation and Causation Analysis in Deep NLP ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Hassan Sajjad
Narine Kokhlikyan
Fahim Dalvi
Nadir Durrani
MILM
326
8
0
17 May 2021
Bird's Eye: Probing for Linguistic Graph Structures with a Simple
  Information-Theoretic Approach
Bird's Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic ApproachAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Buse Giledereli
Mrinmaya Sachan
276
11
0
06 May 2021
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Vladislav Mikhailov
O. Serikov
Ekaterina Artemova
259
10
0
26 Apr 2021
A multilabel approach to morphosyntactic probing
A multilabel approach to morphosyntactic probingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Naomi Tachikawa Shapiro
Amandalynne Paullada
Shane Steinert-Threlkeld
225
12
0
17 Apr 2021
Effect of Post-processing on Contextualized Word Representations
Effect of Post-processing on Contextualized Word RepresentationsInternational Conference on Computational Linguistics (COLING), 2021
Hassan Sajjad
Firoj Alam
Fahim Dalvi
Nadir Durrani
178
12
0
15 Apr 2021
Local Interpretations for Explainable Natural Language Processing: A
  Survey
Local Interpretations for Explainable Natural Language Processing: A SurveyACM Computing Surveys (CSUR), 2021
Siwen Luo
Michal Guerquin
S. Han
Josiah Poon
MILM
417
64
0
20 Mar 2021
Hierarchical Transformer for Multilingual Machine Translation
Hierarchical Transformer for Multilingual Machine TranslationWorkshop on NLP for Similar Languages, Varieties and Dialects (VarDial), 2021
A. Khusainova
A. Khan
Adín Ramirez Rivera
V. Romanov
MoE
124
3
0
05 Mar 2021
An empirical analysis of phrase-based and neural machine translation
An empirical analysis of phrase-based and neural machine translation
Hamidreza Ghader
120
1
0
04 Mar 2021
123
Next
Page 1 of 3