Sparse Overcomplete Word Vector Representations

Annual Meeting of the Association for Computational Linguistics (ACL), 2015

5 June 2015

Papers citing "Sparse Overcomplete Word Vector Representations"

50 / 96 papers shown

SAGE: An Agentic Explainer Framework for Interpreting SAE Features in Language Models

148

25 Nov 2025

Analysis of Variational Sparse Autoencoders

Zachary Baker

Yuxiao Li

DRL

370

26 Sep 2025

CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features

Seonglae Cho

Zekun Wu

Adriano Soares Koshiyama

LLMSV

375

18 Aug 2025

Dense SAE Latents Are Features, Not Bugs

Senthooran Rajamanoharan

Mrinmaya Sachan

Max Tegmark

435

18 Jun 2025

Transferring Linear Features Across Language Models With Model Stitching

301

07 Jun 2025

BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Lindia Tjuatja

Graham Neubig

306

02 Jun 2025

Geometry of Semantics in Next-Token Prediction: How Optimization Implicitly Organizes Linguistic Representations

Yize Zhao

Christos Thrampoulidis

331

13 May 2025

Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings

Saniya Karwa

Navpreet Singh

CoGe

324

20 Apr 2025

Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry

Sai Sumedh R. Hindupur

Ekdeep Singh Lubana

Thomas Fel

Demba Ba

380

03 Mar 2025

Mind the Gap: Bridging the Divide Between AI Aspirations and the Reality of Autonomous Characterization

383

25 Feb 2025

Dictionary Learning: The Complexity of Learning Sparse Superposed Features with Feedback

Akash Kumar

1.1K

08 Feb 2025

The Geometry of Tokens in Internal Representations of Large Language Models

620

17 Jan 2025

Refusal Behavior in Large Language Models: A Nonlinear Perspective

287

14 Jan 2025

The Geometry of Concepts: Sparse Autoencoder Feature Structure

423

10 Oct 2024

Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings

307

16 Jun 2024

Identifying Functionally Important Features with End-to-End Sparse Dictionary LearningNeural Information Processing Systems (NeurIPS), 2024

Dan Braun

Jordan K. Taylor

Nicholas Goldowsky-Dill

Lee D. Sharkey

388

17 May 2024

Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control

Aleksandar Makelov

Georg Lange

Neel Nanda

412

14 May 2024

Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)

Usha Bhalla

Alexander X. Oesterling

Suraj Srinivas

Flavio du Pin Calmon

Himabindu Lakkaraju

476

100

16 Feb 2024

EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings

285

11 Dec 2023

Measuring Feature Sparsity in Language Models

Mingyang Deng

Lucas Tao

Joe Benton

314

11 Oct 2023

DINE: Dimensional Interpretability of Node EmbeddingsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2023

275

02 Oct 2023

Sparse Autoencoders Find Highly Interpretable Features in Language ModelsInternational Conference on Learning Representations (ICLR), 2023

765

987

15 Sep 2023

Interpretable Neural Embeddings with Sparse Self-Representation

Minxue Xia

Hao Zhu

MILM

223

25 Jun 2023

Discovering Universal Geometry in Embeddings with ICAConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Hiroaki Yamagiwa

Momose Oyama

Hidetoshi Shimodaira

263

22 May 2023

SensePOLAR: Word sense aware interpretability for pre-trained contextual word embeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

257

11 Jan 2023

Tsetlin Machine Embedding: Representing Words Using Logical ExpressionsFindings (Findings), 2023

Bimal Bhattarai

Ole-Christoffer Granmo

270

02 Jan 2023

On the Explainability of Natural Language Processing Deep ModelsACM Computing Surveys (ACM CSUR), 2022

Julia El Zini

M. Awad

311

116

13 Oct 2022

Emergent organization of receptive fields in networks of excitatory and inhibitory neurons

279

26 May 2022

Simplicial Embeddings in Self-Supervised Learning and Downstream ClassificationInternational Conference on Learning Representations (ICLR), 2022

307

01 Apr 2022

A Survey on Green Deep Learning

Lei Li

507

102

08 Nov 2021

Interpretable contrastive word mover's embedding

284

01 Nov 2021

Neuron-level Interpretation of Deep NLP Models: A SurveyTransactions of the Association for Computational Linguistics (TACL), 2021

440

101

30 Aug 2021

Biomedical Interpretable Entity RepresentationsFindings (Findings), 2021

295

17 Jun 2021

Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

272

15 Apr 2021

Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factorsWorkshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2021

348

115

29 Mar 2021

Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic ApplicationsAAAI Conference on Artificial Intelligence (AAAI), 2021

Haw-Shiuan Chang

Amol Agrawal

Andrew McCallum

284

29 Mar 2021

SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus

Rishabh Gupta

Rajesh N. Rao

110

21 Mar 2021

Compressing Transformer-Based Semantic Parsing Models using Compositional Code EmbeddingsFindings (Findings), 2020

P. Prakash

Saurabh Kumar Shashidhar

226

10 Oct 2020

Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders

Ehsan Shareghi

175

25 Sep 2020

Compression of Deep Learning Models for Text: A SurveyACM Transactions on Knowledge Discovery from Data (TKDD), 2020

Manish Gupta

Puneet Agrawal

VLM MedIm AI4CE

685

141

12 Aug 2020

Evaluating Sparse Interpretable Word Embeddings for Biomedical Domain

M. Samadi

Mohammad Sadegh Akhondzadeh

209

11 May 2020

The Explanation Game: Towards Prediction Explainability through Sparse Communication

Marcos Vinícius Treviso

André F. T. Martins

FAtt

242

28 Apr 2020

Word Equations: Inherently Interpretable Sparse Word Embeddingsthrough Sparse CodingBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020

Adly Templeton

353

08 Apr 2020

The Fluidity of Concept Representations in Human Brain Signals

E. Hendrikx

Lisa Beinborn

20 Feb 2020

The POLAR Framework: Polar Opposites Enable Interpretability of Pre-Trained Word EmbeddingsThe Web Conference (WWW), 2020

283

27 Jan 2020

Shared task: Lexical semantic change detection in German (Student Project Report)

226

21 Jan 2020

Analyzing Structures in the Semantic Vector Space: A Framework for Decomposing Word Embeddings

Andreas Hanselowski

Iryna Gurevych

202

17 Dec 2019

Improving Interpretability of Word Embeddings by Generating Definition and UsageExpert systems with applications (ESWA), 2019

201

12 Dec 2019

RETRO: Relation Retrofitting For In-Database Machine Learning on Textual DataInternational Conference on Extending Database Technology (EDBT), 2019

Michael Günther

Maik Thiele

Wolfgang Lehner

340

28 Nov 2019

Sparse associative memory based on contextual code learning for disambiguating word senses

233

14 Nov 2019