ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.08855
  4. Cited By
Linguistic Knowledge and Transferability of Contextual Representations
v1v2v3v4v5 (latest)

Linguistic Knowledge and Transferability of Contextual Representations

North American Chapter of the Association for Computational Linguistics (NAACL), 2019
21 March 2019
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
ArXiv (abs)PDFHTML

Papers citing "Linguistic Knowledge and Transferability of Contextual Representations"

50 / 478 papers shown
Gradient Descent with Provably Tuned Learning-rate Schedules
Gradient Descent with Provably Tuned Learning-rate Schedules
Dravyansh Sharma
230
0
0
04 Dec 2025
Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Yi Yang
Haowen Li
Tianxiang Li
Boyu Cao
Xiaohan Zhang
L. Chen
Qi Liu
316
1
0
11 Nov 2025
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
On the Analogy between Human Brain and LLMs: Spotting Key Neurons in Grammar Perception
Sanaz Saki Norouzi
Mohammad Masjedi
Pascal Hitzler
169
0
0
09 Nov 2025
SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens
SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens
Yinhan He
Wendy Zheng
Y. Zhu
Zaiyi Zheng
Lin Su
Sriram Vasudevan
Qi Guo
Liangjie Hong
Jundong Li
LRM
266
4
0
28 Oct 2025
Probing Neural Combinatorial Optimization Models
Probing Neural Combinatorial Optimization Models
Zhiqin Zhang
Yining Ma
Zhiguang Cao
Hoong Chuin Lau
148
2
0
25 Oct 2025
Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
Do Prompts Reshape Representations? An Empirical Study of Prompting Effects on Embeddings
Cesar Gonzalez-Gutierrez
Dirk Hovy
194
0
0
22 Oct 2025
Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
Rongzhi Zhang
Meghaj Tarte
Yuzhao Heng
Xiang Chen
Tong Yu
Lingkai Kong
Sudheer Chava
Chao Zhang
181
0
0
14 Oct 2025
Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion
Decoding Emotion in the Deep: A Systematic Study of How LLMs Represent, Retain, and Express Emotion
Jingxiang Zhang
Lujia Zhong
291
6
0
05 Oct 2025
Learning to Look at the Other Side: A Semantic Probing Study of Word Embeddings in LLMs with Enabled Bidirectional Attention
Learning to Look at the Other Side: A Semantic Probing Study of Word Embeddings in LLMs with Enabled Bidirectional AttentionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhaoxin Feng
Jianfei Ma
Emmanuele Chersoni
Xiaojing Zhao
Xiaoyi Bao
206
3
0
02 Oct 2025
Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT
Beyond Token Probes: Hallucination Detection via Activation Tensors with ACT-ViT
Guy Bar-Shalom
Fabrizio Frasca
Yaniv Galron
Yftah Ziser
Haggai Maron
MLLM
226
3
0
30 Sep 2025
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Hanwen Du
Yuxin Dong
Xia Ning
LRMAI4CE
251
5
0
30 Sep 2025
Investigating Multi-layer Representations for Dense Passage Retrieval
Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie
Thomas Lukasiewicz
175
1
0
28 Sep 2025
Dual-Space Smoothness for Robust and Balanced LLM Unlearning
Dual-Space Smoothness for Robust and Balanced LLM Unlearning
Han Yan
Zheyuan Liu
Meng Jiang
MUAAML
187
2
0
27 Sep 2025
Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning
Evaluating CxG Generalisation in LLMs via Construction-Based NLI Fine Tuning
Tom Mackintosh
Harish Tayyar Madabushi
C. Bonial
ALM
149
1
0
19 Sep 2025
STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment
STARE at the Structure: Steering ICL Exemplar Selection with Structural Alignment
Jiaqian Li
Qisheng Hu
Jing Li
Wenya Wang
167
1
0
28 Aug 2025
ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models
ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models
Manlai Liang
Mandi Liu
Jiangzhou Ji
Huaijun Li
Haobo Yang
Yaohan He
Jinlong Li
RALM
340
0
0
25 Aug 2025
How Does Controllability Emerge In Language Models During Pretraining?
How Does Controllability Emerge In Language Models During Pretraining?
Jianshu She
Xinyue Li
Eric Xing
Zhengzhong Liu
Qirong Ho
LLMSV
317
1
0
03 Aug 2025
Explainable Mapper: Charting LLM Embedding Spaces Using Perturbation-Based Explanation and Verification Agents
Explainable Mapper: Charting LLM Embedding Spaces Using Perturbation-Based Explanation and Verification Agents
Xinyuan Yan
Rita Sevastjanova
Sinie van der Ben
Mennatallah El-Assady
Bei Wang
308
3
0
24 Jul 2025
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
On the Performance of Concept Probing: The Influence of the Data (Extended Version)
Manuel de Sousa Ribeiro
Afonso Leote
João Leite
297
1
0
24 Jul 2025
Concept Probing: Where to Find Human-Defined Concepts (Extended Version)
Concept Probing: Where to Find Human-Defined Concepts (Extended Version)
Manuel de Sousa Ribeiro
Afonso Leote
João Leite
308
1
0
24 Jul 2025
Large Language Models Encode Semantics and Alignment in Linearly Separable Representations
Large Language Models Encode Semantics and Alignment in Linearly Separable Representations
Baturay Saglam
Paul Kassianik
Blaine Nelson
Sajana Weerawardhena
Yaron Singer
Amin Karbasi
261
3
0
13 Jul 2025
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Xiao Xu
L. Qin
Wanxiang Che
Min-Yen Kan
MoEVLM
336
1
0
13 Jun 2025
Superior Molecular Representations from Intermediate Encoder Layers
Superior Molecular Representations from Intermediate Encoder Layers
Luis Pinto
AI4CE
345
0
0
06 Jun 2025
Model Internal Sleuthing: Finding Lexical Identity and Inflectional Features in Modern Language Models
Model Internal Sleuthing: Finding Lexical Identity and Inflectional Features in Modern Language Models
Michael Li
Nishant Subramani
MILMKELM
381
1
0
02 Jun 2025
Domain Pre-training Impact on Representations
Domain Pre-training Impact on Representations
César González-Gutiérrez
A. Quattoni
330
0
0
30 May 2025
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
Hadi Askari
Shivanshu Gupta
Fei Wang
Anshuman Chhabra
Muhao Chen
TDI
486
8
0
27 May 2025
A Representation Level Analysis of NMT Model Robustness to Grammatical Errors
A Representation Level Analysis of NMT Model Robustness to Grammatical ErrorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Abderrahmane Issam
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
294
1
0
27 May 2025
SAEs Are Good for Steering -- If You Select the Right Features
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
499
31
0
26 May 2025
Multi-Scale Probabilistic Generation Theory: A Unified Information-Theoretic Framework for Hierarchical Structure in Large Language Models
Multi-Scale Probabilistic Generation Theory: A Unified Information-Theoretic Framework for Hierarchical Structure in Large Language Models
Yukin Zhang
Qi Dong
376
0
0
23 May 2025
Linguistic Interpretability of Transformer-based Language Models: a systematic review
Linguistic Interpretability of Transformer-based Language Models: a systematic review
Miguel López-Otal
Jorge Gracia
Jordi Bernad
Carlos Bobed
Lucía Pitarch-Ballesteros
Emma Anglés-Herrero
VLM
482
9
0
09 Apr 2025
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models
Guy Kaplan
Michael Toker
Yuval Reif
Yonatan Belinkov
Roy Schwartz
DiffM
509
3
0
01 Apr 2025
Construction Identification and Disambiguation Using BERT: A Case Study of NPN
Construction Identification and Disambiguation Using BERT: A Case Study of NPN
Wesley Scivetti
Nathan Schneider
370
2
0
24 Mar 2025
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting AttackAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Murong Yue
Ziyu Yao
SILMAAML
401
0
0
18 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
Wei Wei
Yue Shang
Ge Zhang
AI4CE
492
2
0
17 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
524
5
0
14 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models
Evaluating Discourse Cohesion in Pre-trained Language Models
Jie He
Wanqiu Long
Deyi Xiong
ELM
457
3
0
08 Mar 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
400
7
0
21 Feb 2025
ExpertLens: Activation steering features are highly interpretable
ExpertLens: Activation steering features are highly interpretable
Masha Fedzechkina
Eleonora Gualdoni
Sinead Williamson
Katherine Metcalf
Skyler Seto
B. Theobald
529
1
0
20 Feb 2025
BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study
BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study
Atharva Mutsaddi
Anvi Jamkhande
Aryan Thakre
Yashodhara Haribhakta
259
8
0
08 Jan 2025
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation
  on Nepali
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Sharad Duwal
Suraj Prasai
Suresh Manandhar
CLL
358
4
0
18 Dec 2024
Does Representation Matter? Exploring Intermediate Layers in Large
  Language Models
Does Representation Matter? Exploring Intermediate Layers in Large Language Models
Oscar Skean
Md Rifat Arefin
Yann LeCun
Ravid Shwartz-Ziv
371
28
0
12 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream
  applications of foundation models?
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
371
1
0
27 Nov 2024
Latent Space Disentanglement in Diffusion Transformers Enables Precise
  Zero-shot Semantic Editing
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
DiffM
344
1
0
12 Nov 2024
From Tokens to Materials: Leveraging Language Models for Scientific
  Discovery
From Tokens to Materials: Leveraging Language Models for Scientific Discovery
Yuwei Wan
Tong Xie
Nan Wu
Wenjie Zhang
Chunyu Kit
B. Hoex
294
3
0
21 Oct 2024
On the Use of Audio to Improve Dialogue Policies
On the Use of Audio to Improve Dialogue PoliciesIberSPEECH Conference (IberSPEECH), 2024
Daniel Roncel
Federico Costa
Javier Hernando
272
1
0
17 Oct 2024
How much do contextualized representations encode long-range context?
How much do contextualized representations encode long-range context?North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Simeng Sun
Cheng-Ping Hsieh
465
0
0
16 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A
  Comparative Analysis of mT5 and ByT5
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
428
12
0
15 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
851
4
0
08 Oct 2024
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs
  for Astronomy
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy
Boyao Wang
Tuan Dung Nguyen
Hardik Arora
Alberto Accomazzi
Tirthankar Ghosal
Yuan-Sen Ting
319
6
0
29 Sep 2024
Norm of Mean Contextualized Embeddings Determines their Variance
Norm of Mean Contextualized Embeddings Determines their VarianceInternational Conference on Computational Linguistics (COLING), 2024
Hiroaki Yamagiwa
Hidetoshi Shimodaira
311
0
0
17 Sep 2024
1234...8910
Next
Page 1 of 10