Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1506.02004
Cited By
Sparse Overcomplete Word Vector Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2015
5 June 2015
Manaal Faruqui
Yulia Tsvetkov
Dani Yogatama
Chris Dyer
Noah A. Smith
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sparse Overcomplete Word Vector Representations"
50 / 96 papers shown
SAGE: An Agentic Explainer Framework for Interpreting SAE Features in Language Models
Jiaojiao Han
Wujiang Xu
Mingyu Jin
Mengnan Du
LRM
148
2
0
25 Nov 2025
Analysis of Variational Sparse Autoencoders
Zachary Baker
Yuxiao Li
DRL
370
0
0
26 Sep 2025
CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features
Seonglae Cho
Zekun Wu
Adriano Soares Koshiyama
LLMSV
375
0
0
18 Aug 2025
Dense SAE Latents Are Features, Not Bugs
Xiaoqing Sun
Alessandro Stolfo
Joshua Engels
Ben Wu
Senthooran Rajamanoharan
Mrinmaya Sachan
Max Tegmark
435
7
0
18 Jun 2025
Transferring Linear Features Across Language Models With Model Stitching
Alan Chen
Jack Merullo
Alessandro Stolfo
Ellie Pavlick
301
1
0
07 Jun 2025
BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Lindia Tjuatja
Graham Neubig
306
5
0
02 Jun 2025
Geometry of Semantics in Next-Token Prediction: How Optimization Implicitly Organizes Linguistic Representations
Yize Zhao
Christos Thrampoulidis
331
0
0
13 May 2025
Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings
Saniya Karwa
Navpreet Singh
CoGe
324
2
0
20 Apr 2025
Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Sai Sumedh R. Hindupur
Ekdeep Singh Lubana
Thomas Fel
Demba Ba
380
38
0
03 Mar 2025
Mind the Gap: Bridging the Divide Between AI Aspirations and the Reality of Autonomous Characterization
Grace Guinan
Addison Salvador
Michelle A. Smeaton
Andrew Glaws
Hilary Egan
Brian C. Wyatt
Babak Anasori
K. Fiedler
M. Olszta
Steven Spurgeon
383
9
0
25 Feb 2025
Dictionary Learning: The Complexity of Learning Sparse Superposed Features with Feedback
Akash Kumar
1.1K
0
0
08 Feb 2025
The Geometry of Tokens in Internal Representations of Large Language Models
Karthik Viswanathan
Yuri Gardinazzi
Giada Panerai
Alberto Cazzaniga
Matteo Biagetti
AIFin
620
15
0
17 Jan 2025
Refusal Behavior in Large Language Models: A Nonlinear Perspective
Fabian Hildebrandt
Andreas K. Maier
Patrick Krauss
A. Schilling
287
9
0
14 Jan 2025
The Geometry of Concepts: Sparse Autoencoder Feature Structure
Yuxiao Li
Eric J. Michaud
David D. Baek
Joshua Engels
Xiaoqing Sun
Max Tegmark
423
42
0
10 Oct 2024
Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings
Hiroaki Yamagiwa
Momose Oyama
Hidetoshi Shimodaira
LLMSV
307
6
0
16 Jun 2024
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning
Neural Information Processing Systems (NeurIPS), 2024
Dan Braun
Jordan K. Taylor
Nicholas Goldowsky-Dill
Lee D. Sharkey
388
59
0
17 May 2024
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control
Aleksandar Makelov
Georg Lange
Neel Nanda
412
66
0
14 May 2024
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)
Usha Bhalla
Alexander X. Oesterling
Suraj Srinivas
Flavio du Pin Calmon
Himabindu Lakkaraju
476
100
0
16 Feb 2024
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings
Sung Hwan Mun
Mingrui Han
Canyeong Moon
Nam Soo Kim
285
1
0
11 Dec 2023
Measuring Feature Sparsity in Language Models
Mingyang Deng
Lucas Tao
Joe Benton
314
2
0
11 Oct 2023
DINE: Dimensional Interpretability of Node Embeddings
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023
Simone Piaggesi
Megha Khosla
Andre' Panisson
Avishek Anand
275
9
0
02 Oct 2023
Sparse Autoencoders Find Highly Interpretable Features in Language Models
International Conference on Learning Representations (ICLR), 2023
Hoagy Cunningham
Aidan Ewart
Logan Riggs
R. Huben
Lee Sharkey
MILM
765
987
0
15 Sep 2023
Interpretable Neural Embeddings with Sparse Self-Representation
Minxue Xia
Hao Zhu
MILM
223
0
0
25 Jun 2023
Discovering Universal Geometry in Embeddings with ICA
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hiroaki Yamagiwa
Momose Oyama
Hidetoshi Shimodaira
263
20
0
22 May 2023
SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddings
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jan Engler
Sandipan Sikdar
Marlene Lutz
M. Strohmaier
257
10
0
11 Jan 2023
Tsetlin Machine Embedding: Representing Words Using Logical Expressions
Findings (Findings), 2023
Bimal Bhattarai
Ole-Christoffer Granmo
Lei Jiao
Rohan Kumar Yadav
Jivitesh Sharma
NAI
270
21
0
02 Jan 2023
On the Explainability of Natural Language Processing Deep Models
ACM Computing Surveys (ACM CSUR), 2022
Julia El Zini
M. Awad
311
116
0
13 Oct 2022
Emergent organization of receptive fields in networks of excitatory and inhibitory neurons
Leon Lufkin
Ashish Puri
Ganlin Song
Xinyi Zhong
John D. Lafferty
279
1
0
26 May 2022
Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
International Conference on Learning Representations (ICLR), 2022
Samuel Lavoie
Christos Tsirigotis
Max Schwarzer
Ankit Vani
Michael Noukhovitch
Kenji Kawaguchi
Rameswar Panda
SSL
307
26
0
01 Apr 2022
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
507
102
0
08 Nov 2021
Interpretable contrastive word mover's embedding
Ruijie Jiang
J. Gouvea
Preetish Rath
David M. Hammer
Shuchin Aeron
284
2
0
01 Nov 2021
Neuron-level Interpretation of Deep NLP Models: A Survey
Transactions of the Association for Computational Linguistics (TACL), 2021
Hassan Sajjad
Nadir Durrani
Fahim Dalvi
MILM
AI4CE
440
101
0
30 Aug 2021
Biomedical Interpretable Entity Representations
Findings (Findings), 2021
Diego Garcia-Olano
Yasumasa Onoe
Ioana Baldini
Joydeep Ghosh
Byron C. Wallace
Kush R. Varshney
AI4CE
295
4
0
17 Jun 2021
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Kyoung-Rok Jang
Junmo Kang
Giwon Hong
Sung-Hyon Myaeng
Joohee Park
Taewon Yoon
Heecheol Seo
272
22
0
15 Apr 2021
Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors
Workshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2021
Zeyu Yun
Yubei Chen
Bruno A. Olshausen
Yann LeCun
348
115
0
29 Mar 2021
Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications
AAAI Conference on Artificial Intelligence (AAAI), 2021
Haw-Shiuan Chang
Amol Agrawal
Andrew McCallum
284
9
0
29 Mar 2021
SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus
Rishabh Gupta
Rajesh N. Rao
110
0
0
21 Mar 2021
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Findings (Findings), 2020
P. Prakash
Saurabh Kumar Shashidhar
Wenlong Zhao
Subendhu Rongali
Haidar Khan
Michael Kayser
226
5
0
10 Oct 2020
Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders
Victor Prokhorov
Yingzhen Li
Ehsan Shareghi
Nigel Collier
SSL
DRL
175
1
0
25 Sep 2020
Compression of Deep Learning Models for Text: A Survey
ACM Transactions on Knowledge Discovery from Data (TKDD), 2020
Manish Gupta
Puneet Agrawal
VLM
MedIm
AI4CE
685
141
0
12 Aug 2020
Evaluating Sparse Interpretable Word Embeddings for Biomedical Domain
M. Samadi
Mohammad Sadegh Akhondzadeh
Sayed Jalal Zahabi
M. Manshaei
Zeinab Maleki
P. Adibi
209
1
0
11 May 2020
The Explanation Game: Towards Prediction Explainability through Sparse Communication
Marcos Vinícius Treviso
André F. T. Martins
FAtt
242
3
0
28 Apr 2020
Word Equations: Inherently Interpretable Sparse Word Embeddingsthrough Sparse Coding
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020
Adly Templeton
353
7
0
08 Apr 2020
The Fluidity of Concept Representations in Human Brain Signals
E. Hendrikx
Lisa Beinborn
93
0
0
20 Feb 2020
The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word Embeddings
The Web Conference (WWW), 2020
Binny Mathew
Sandipan Sikdar
Florian Lemmerich
M. Strohmaier
283
39
0
27 Jan 2020
Shared task: Lexical semantic change detection in German (Student Project Report)
Adnan Ahmad
Kiflom Desta
Fabian Lang
Dominik Schlechtweg
226
4
0
21 Jan 2020
Analyzing Structures in the Semantic Vector Space: A Framework for Decomposing Word Embeddings
Andreas Hanselowski
Iryna Gurevych
202
2
0
17 Dec 2019
Improving Interpretability of Word Embeddings by Generating Definition and Usage
Expert systems with applications (ESWA), 2019
Haitong Zhang
Yongping Du
Jiaxin Sun
Qingxia Li
201
15
0
12 Dec 2019
RETRO: Relation Retrofitting For In-Database Machine Learning on Textual Data
International Conference on Extending Database Technology (EDBT), 2019
Michael Günther
Maik Thiele
Wolfgang Lehner
340
7
0
28 Nov 2019
Sparse associative memory based on contextual code learning for disambiguating word senses
M. R. S. Marques
Tales Marra
Deok-Hee Kim-Dufor
C. Berrou
233
1
0
14 Nov 2019
1
2
Next
Page 1 of 2