Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.08855
Cited By
Linguistic Knowledge and Transferability of Contextual Representations
21 March 2019
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Linguistic Knowledge and Transferability of Contextual Representations"
50 / 454 papers shown
Title
Linguistic Interpretability of Transformer-based Language Models: a systematic review
Miguel López-Otal
Jorge Gracia
Jordi Bernad
Carlos Bobed
Lucía Pitarch-Ballesteros
Emma Anglés-Herrero
VLM
36
0
0
09 Apr 2025
Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models
Guy Kaplan
Michael Toker
Yuval Reif
Yonatan Belinkov
Roy Schwartz
DiffM
48
0
0
01 Apr 2025
Construction Identification and Disambiguation Using BERT: A Case Study of NPN
Wesley Scivetti
Nathan Schneider
44
0
0
24 Mar 2025
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack
Murong Yue
Ziyu Yao
SILM
AAML
53
0
0
18 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
X. Zhang
Yue Shang
Ge Zhang
AI4CE
58
0
0
17 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
R. Teo
T. Nguyen
MoE
55
2
0
14 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models
Jie He
Wanqiu Long
Deyi Xiong
ELM
55
2
0
08 Mar 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
35
0
0
21 Feb 2025
Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans
Masha Fedzechkina
Eleonora Gualdoni
Sinead Williamson
Katherine Metcalf
Skyler Seto
B. Theobald
38
1
0
20 Feb 2025
Improving Rule-based Reasoning in LLMs via Neurosymbolic Representations
Varun Dhanraj
Chris Eliasmith
LRM
45
0
0
31 Jan 2025
BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study
Atharva Mutsaddi
Anvi Jamkhande
Aryan Thakre
Yashodhara Haribhakta
21
0
0
08 Jan 2025
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Sharad Duwal
Suraj Prasai
Suresh Manandhar
CLL
79
1
0
18 Dec 2024
Does Representation Matter? Exploring Intermediate Layers in Large Language Models
Oscar Skean
Md Rifat Arefin
Yann LeCun
Ravid Shwartz-Ziv
79
7
0
12 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
64
0
0
27 Nov 2024
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
DiffM
50
0
0
12 Nov 2024
From Tokens to Materials: Leveraging Language Models for Scientific Discovery
Yuwei Wan
Tong Xie
Nan Wu
Wenjie Zhang
Chunyu Kit
B. Hoex
16
0
0
21 Oct 2024
On the Use of Audio to Improve Dialogue Policies
Daniel Roncel
Federico Costa
Javier Hernando
21
0
0
17 Oct 2024
How much do contextualized representations encode long-range context?
Simeng Sun
Cheng-Ping Hsieh
39
0
0
16 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
23
1
0
15 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
76
1
0
08 Oct 2024
AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy
Rui Pan
Tuan Dung Nguyen
Hardik Arora
Alberto Accomazzi
Tirthankar Ghosal
Yuan-Sen Ting
24
1
0
29 Sep 2024
Norm of Mean Contextualized Embeddings Determines their Variance
Hiroaki Yamagiwa
Hidetoshi Shimodaira
23
0
0
17 Sep 2024
The representation landscape of few-shot learning and fine-tuning in large language models
Diego Doimo
Alessandro Serra
A. Ansuini
Alberto Cazzaniga
88
4
0
05 Sep 2024
A Law of Next-Token Prediction in Large Language Models
Hangfeng He
Weijie J. Su
27
5
0
24 Aug 2024
Latent Space Disentanglement in Diffusion Transformers Enables Zero-shot Fine-grained Semantic Editing
Zitao Shuai
Chenwei Wu
Zhengxu Tang
Bowen Song
Liyue Shen
33
0
0
23 Aug 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
42
18
0
02 Aug 2024
Disentangling Dense Embeddings with Sparse Autoencoders
Charles OÑeill
Christine Ye
K. Iyer
John F. Wu
17
4
0
01 Aug 2024
DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
38
1
0
11 Jul 2024
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
Chenyuan Wu
Gangwei Jiang
Defu Lian
CLL
18
0
0
18 Jun 2024
Self-Regulated Data-Free Knowledge Amalgamation for Text Classification
Prashanth Vijayaraghavan
Hongzhi Wang
Luyao Shi
Tyler Baldwin
David Beymer
Ehsan Degan
27
1
0
16 Jun 2024
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages
Nadav Borenstein
Anej Svete
R. Chan
Josef Valvoda
Franz Nowak
Isabelle Augenstein
Eleanor Chodroff
Ryan Cotterell
40
11
0
06 Jun 2024
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
Liyi Zhang
Michael Y. Li
Thomas L. Griffiths
40
2
0
06 Jun 2024
Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT
Patrick Krauss
Jannik Hösch
C. Metzner
Andreas K. Maier
Peter Uhrig
Achim Schilling
31
1
0
03 May 2024
What do Transformers Know about Government?
Jue Hou
Anisia Katinskaia
Lari Kotilainen
Sathianpong Trangcasanchai
Anh Vu
R. Yangarber
24
1
0
22 Apr 2024
More Room for Language: Investigating the Effect of Retrieval on Language Models
David Samuel
Lucas Georges Gabriel Charpentier
Sondre Wold
LRM
RALM
KELM
28
1
0
16 Apr 2024
Bridging Vision and Language Spaces with Assignment Prediction
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
VLM
29
6
0
15 Apr 2024
On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL
Yutong Shao
N. Nakashole
14
1
0
03 Apr 2024
Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models
Stephan Linzbach
Dimitar Dimitrov
Laura Kallmeyer
Kilian Evang
Hajira Jabeen
Stefan Dietze
KELM
31
0
0
02 Apr 2024
Towards Explainability in Legal Outcome Prediction Models
Josef Valvoda
Ryan Cotterell
ELM
AILaw
48
4
0
25 Mar 2024
A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Dongjun Jang
Sungjoo Byun
Hyopil Shin
17
1
0
25 Mar 2024
Are Human Conversations Special? A Large Language Model Perspective
Toshish Jawale
Chaitanya Animesh
Sekhar Vallath
Kartik Talamadupula
Larry Heck
24
2
0
08 Mar 2024
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Bingyan Liu
Chengyu Wang
Tingfeng Cao
Kui Jia
Jun Huang
DiffM
35
50
0
06 Mar 2024
Word Importance Explains How Prompts Affect Language Model Outputs
Stefan Hackmann
Haniyeh Mahmoudian
Mark Steadman
Michael Schmidt
AAML
156
5
0
05 Mar 2024
Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?
Vasudevan Nedumpozhimana
John D. Kelleher
29
1
0
04 Mar 2024
DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal Inference
Jialong Wu
Linhai Zhang
Deyu Zhou
Guoqiang Xu
CML
19
3
0
02 Mar 2024
Probing Multimodal Large Language Models for Global and Local Semantic Representations
Mingxu Tao
Quzhe Huang
Kun Xu
Liwei Chen
Yansong Feng
Dongyan Zhao
19
5
0
27 Feb 2024
What Do Language Models Hear? Probing for Auditory Representations in Language Models
Jerry Ngo
Yoon Kim
AuLLM
MILM
16
8
0
26 Feb 2024
Semantic change detection for Slovene language: a novel dataset and an approach based on optimal transport
Marko Pranjic
Kaja Dobrovoljc
Senja Pollak
Matej Martinc
28
2
0
26 Feb 2024
When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality
Brielen Madureira
Patrick Kahardipraja
David Schlangen
31
2
0
20 Feb 2024
Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers
Shuzhou Yuan
Ercong Nie
Bolei Ma
Michael Farber
32
3
0
18 Feb 2024
1
2
3
4
...
8
9
10
Next