ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.03368
  4. Cited By
Designing and Interpreting Probes with Control Tasks

Designing and Interpreting Probes with Control Tasks

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
8 September 2019
John Hewitt
Abigail Z. Jacobs
ArXiv (abs)PDFHTML

Papers citing "Designing and Interpreting Probes with Control Tasks"

50 / 381 papers shown
Title
Emergent Stack Representations in Modeling Counter Languages Using Transformers
Emergent Stack Representations in Modeling Counter Languages Using Transformers
Utkarsh Tiwari
Aviral Gupta
Michael Hahn
886
1
0
03 Feb 2025
Toward Neurosymbolic Program Comprehension
Toward Neurosymbolic Program ComprehensionIEEE International Conference on Program Comprehension (ICPC), 2025
Alejandro Velasco
Aya Garryyeva
David Nader-Palacio
Antonio Mastropaolo
Denys Poshyvanyk
251
0
0
03 Feb 2025
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference ResolutionWorkshop on Representation Learning for NLP (RepL4NLP), 2025
Tatiana Anikina
Arne Binder
David Harbecke
Stalin Varanasi
Leonhard Hennig
Simon Ostermann
Sebastian Möller
Josef van Genabith
355
0
0
31 Jan 2025
Unraveling Token Prediction Refinement and Identifying Essential Layers in Language Models
Unraveling Token Prediction Refinement and Identifying Essential Layers in Language Models
Jaturong Kongmanee
227
1
0
25 Jan 2025
Explainable and Interpretable Multimodal Large Language Models: A
  Comprehensive Survey
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
409
48
0
03 Dec 2024
Gumbel Counterfactual Generation From Language Models
Gumbel Counterfactual Generation From Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Shauli Ravfogel
Anej Svete
Vésteinn Snæbjarnarson
Robert Bamler
LRMCML
527
1
0
11 Nov 2024
On Memorization of Large Language Models in Logical Reasoning
On Memorization of Large Language Models in Logical Reasoning
Chulin Xie
Yangsibo Huang
Chiyuan Zhang
Da Yu
Xinyun Chen
Bill Yuchen Lin
Bo Li
Badih Ghazi
Ravi Kumar
LRM
424
92
0
30 Oct 2024
The Tug of War Within: Mitigating the Fairness-Privacy Conflicts in Large Language Models
The Tug of War Within: Mitigating the Fairness-Privacy Conflicts in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chen Qian
Dongrui Liu
Jie Zhang
Yong Liu
Jing Shao
287
1
0
22 Oct 2024
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based
  Language Models
Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kushal Tatariya
Vladimir Araujo
Thomas Bauwens
Miryam de Lhoneux
VLM
216
1
0
15 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
261
9
0
15 Oct 2024
Mechanistic?
Mechanistic?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Naomi Saphra
Sarah Wiegreffe
AI4CE
213
32
0
07 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
R. Mamidi
R. Mamidi
Subba Reddy Oota
241
2
0
03 Oct 2024
Don't Stop Me Now: Embedding Based Scheduling for LLMs
Don't Stop Me Now: Embedding Based Scheduling for LLMsInternational Conference on Learning Representations (ICLR), 2024
Rana Shahout
Eran Malach
Chunwei Liu
Weifan Jiang
Minlan Yu
Michael Mitzenmacher
AI4TS
219
14
0
01 Oct 2024
Probing Omissions and Distortions in Transformer-based RDF-to-Text
  Models
Probing Omissions and Distortions in Transformer-based RDF-to-Text Models
J. Faille
Albert Gatt
Claire Gardent
203
0
0
25 Sep 2024
Probing Context Localization of Polysemous Words in Pre-trained Language
  Model Sub-Layers
Probing Context Localization of Polysemous Words in Pre-trained Language Model Sub-Layers
Soniya Vijayakumar
Josef van Genabith
Simon Ostermann
213
0
0
21 Sep 2024
Exploring syntactic information in sentence embeddings through
  multilingual subject-verb agreement
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement
Vivi Nastase
Chunyang Jiang
Giuseppe Samo
Paola Merlo
146
6
0
10 Sep 2024
How Reliable are Causal Probing Interventions?
How Reliable are Causal Probing Interventions?
Marc E. Canby
Adam Davies
Chirag Rastogi
Anjali Narayan-Chen
267
0
0
28 Aug 2024
Multilevel Interpretability Of Artificial Neural Networks: Leveraging
  Framework And Methods From Neuroscience
Multilevel Interpretability Of Artificial Neural Networks: Leveraging Framework And Methods From Neuroscience
Zhonghao He
Jascha Achterberg
Katie Collins
Kevin K. Nejad
Danyal Akarca
...
Chole Li
Kai J. Sandbrink
Stephen Casper
Anna Ivanova
Grace W. Lindsay
AI4CE
277
5
0
22 Aug 2024
Training an NLP Scholar at a Small Liberal Arts College: A Backwards
  Designed Course Proposal
Training an NLP Scholar at a Small Liberal Arts College: A Backwards Designed Course Proposal
Grusha Prasad
Forrest Davis
152
1
0
11 Aug 2024
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation Analysis
The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation AnalysisComputational Linguistics (CL), 2024
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
478
34
0
02 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
346
23
0
27 Jul 2024
Latent Causal Probing: A Formal Perspective on Probing with Causal
  Models of Data
Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data
Charles Jin
Martin Rinard
188
3
0
18 Jul 2024
Validating Mechanistic Interpretations: An Axiomatic Approach
Validating Mechanistic Interpretations: An Axiomatic Approach
Nils Palumbo
Ravi Mangal
Zifan Wang
Saranya Vijayakumar
Corina S. Pasareanu
Somesh Jha
281
1
0
18 Jul 2024
States Hidden in Hidden States: LLMs Emerge Discrete State
  Representations Implicitly
States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly
Junhao Chen
Shengding Hu
Zhiyuan Liu
Maosong Sun
LRM
159
9
0
16 Jul 2024
Are there identifiable structural parts in the sentence embedding whole?
Are there identifiable structural parts in the sentence embedding whole?
Vivi Nastase
Paola Merlo
166
6
0
24 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
167
16
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal
  Component Analysis
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
321
7
0
19 Jun 2024
When Parts are Greater Than Sums: Individual LLM Components Can
  Outperform Full Models
When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ting-Yun Chang
Jesse Thomason
Robin Jia
392
6
0
19 Jun 2024
A Critical Study of What Code-LLMs (Do Not) Learn
A Critical Study of What Code-LLMs (Do Not) Learn
Abhinav Anand
Shweta Verma
Krishna Narasimhan
Mira Mezini
259
5
0
17 Jun 2024
Evaluating the World Model Implicit in a Generative Model
Evaluating the World Model Implicit in a Generative Model
Keyon Vafa
Justin Y. Chen
Jon M. Kleinberg
S. Mullainathan
Ashesh Rambachan
303
75
0
06 Jun 2024
Probing the Category of Verbal Aspect in Transformer Language Models
Probing the Category of Verbal Aspect in Transformer Language Models
Anisia Katinskaia
R. Yangarber
262
6
0
04 Jun 2024
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network
Erik Jenner
Shreyas Kapur
Vasil Georgiev
Cameron Allen
Scott Emmons
Stuart J. Russell
305
20
0
02 Jun 2024
Deep Learning for Assessment of Oral Reading Fluency
Deep Learning for Assessment of Oral Reading Fluency
Mithilesh Vaidya
Binaya Kumar Sahoo
Preeti Rao
142
0
0
29 May 2024
On Fairness of Low-Rank Adaptation of Large Models
On Fairness of Low-Rank Adaptation of Large Models
Zhoujie Ding
Katja Filippova
Pura Peetathawatchai
Berivan Isik
Sanmi Koyejo
190
6
0
27 May 2024
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emily Cheng
Diego Doimo
Corentin Kervadec
Iuri Macocco
Jade Yu
Alessandro Laio
Marco Baroni
610
28
0
24 May 2024
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by
  Step
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
Yuntian Deng
Yejin Choi
Stuart M. Shieber
ReLMLRM
231
118
0
23 May 2024
Interpretability Needs a New Paradigm
Interpretability Needs a New Paradigm
Andreas Madsen
Himabindu Lakkaraju
Siva Reddy
Sarath Chandar
181
6
0
08 May 2024
A Philosophical Introduction to Language Models - Part II: The Way
  Forward
A Philosophical Introduction to Language Models - Part II: The Way Forward
Raphael Milliere
Cameron Buckner
LRM
238
24
0
06 May 2024
What does the Knowledge Neuron Thesis Have to do with Knowledge?
What does the Knowledge Neuron Thesis Have to do with Knowledge?International Conference on Learning Representations (ICLR), 2024
Jingcheng Niu
Andrew Liu
Zining Zhu
Gerald Penn
299
46
0
03 May 2024
Let's Think Dot by Dot: Hidden Computation in Transformer Language
  Models
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
Jacob Pfau
William Merrill
Samuel R. Bowman
LRM
252
127
0
24 Apr 2024
What do Transformers Know about Government?
What do Transformers Know about Government?
Jue Hou
Anisia Katinskaia
Lari Kotilainen
Sathianpong Trangcasanchai
Anh Vu
R. Yangarber
235
2
0
22 Apr 2024
Decomposing and Editing Predictions by Modeling Model Computation
Decomposing and Editing Predictions by Modeling Model Computation
Harshay Shah
Andrew Ilyas
Aleksander Madry
KELM
270
24
0
17 Apr 2024
MuLan: A Study of Fact Mutability in Language Models
MuLan: A Study of Fact Mutability in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Constanza Fierro
Nicolas Garneau
Emanuele Bugliarello
Yova Kementchedjhieva
Anders Søgaard
KELMHILM
161
10
0
03 Apr 2024
Adjusting Interpretable Dimensions in Embedding Space with Human
  Judgments
Adjusting Interpretable Dimensions in Embedding Space with Human JudgmentsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Katrin Erk
Marianna Apidianaki
249
5
0
03 Apr 2024
On Linearizing Structured Data in Encoder-Decoder Language Models:
  Insights from Text-to-SQL
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQLNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yutong Shao
N. Nakashole
218
3
0
03 Apr 2024
Where does In-context Translation Happen in Large Language Models
Where does In-context Translation Happen in Large Language Models
Suzanna Sia
David Mueller
Kevin Duh
LRM
207
1
0
07 Mar 2024
Topic Aware Probing: From Sentence Length Prediction to Idiom
  Identification how reliant are Neural Language Models on Topic?
Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?
Vasudevan Nedumpozhimana
John D. Kelleher
171
2
0
04 Mar 2024
How do Large Language Models Handle Multilingualism?
How do Large Language Models Handle Multilingualism?
Yiran Zhao
Wenxuan Zhang
Guizhen Chen
Kenji Kawaguchi
Lidong Bing
LRM
334
127
0
29 Feb 2024
What Do Language Models Hear? Probing for Auditory Representations in
  Language Models
What Do Language Models Hear? Probing for Auditory Representations in Language Models
Jerry Ngo
Yoon Kim
AuLLMMILM
158
13
0
26 Feb 2024
How Large Language Models Encode Context Knowledge? A Layer-Wise Probing
  Study
How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study
Tianjie Ju
Weiwei Sun
Wei Du
Xinwei Yuan
Zhaochun Ren
Gongshen Liu
KELM
169
54
0
25 Feb 2024
Previous
12345678
Next