v1v2v3 (latest)

A Primer in BERTology: What we know about how BERT works

Transactions of the Association for Computational Linguistics (TACL), 2020

27 February 2020

Papers citing "A Primer in BERTology: What we know about how BERT works"

50 / 780 papers shown

Search-R3: Unifying Reasoning and Embedding in Large Language Models

Yuntao Gui

James Cheng

KELM LRM

260

10 Apr 2026

What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models

...

159

03 Dec 2025

Layer Probing Improves Kinase Functional Prediction with Protein Language Models

Ajit Kumar

IndraPrakash Jha

29 Nov 2025

Standard Occupation Classifier -- A Natural Language Processing Approach

Sidharth Rony

Jack Patman

163

28 Nov 2025

Generation, Evaluation, and Explanation of Novelists' Styles with Single-Token Prompts

170

25 Nov 2025

A Hybrid Classical-Quantum Fine Tuned BERT for Text Classification

Abu Kaisar Mohammad Masum

Naveed Mahmud

M. H. Najafi

Sercan Aygün

141

21 Nov 2025

N-GLARE: An Non-Generative Latent Representation-Efficient LLM Safety Evaluator

191

18 Nov 2025

SPEAR-MM: Selective Parameter Evaluation and Restoration via Model Merging for Efficient Financial LLM Adaptation

278

11 Nov 2025

Catching Contamination Before Generation: Spectral Kill Switches for Agents

Valentin Noël

144

08 Nov 2025

Quantitative Bounds for Length Generalization in Transformers

Zachary Izzo

Eshaan Nichani

Jason D. Lee

300

30 Oct 2025

Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion

132

30 Oct 2025

Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models

Sriram Balasubramaniam

421

29 Oct 2025

Forging GEMs: Advancing Greek NLP through Quality-Based Corpus Curation

Alexandra Apostolopoulou

221

22 Oct 2025

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

Mark Hasegawa-Johnson

Heng Ji

145

21 Oct 2025

Training-Free Spectral Fingerprints of Voice Processing in Transformers

Valentin Noël

216

21 Oct 2025

Attention Is All You Need for KV Cache in Diffusion LLMs

Quan Nguyen-Tri

Mukul Ranjan

Zhiqiang Shen

228

16 Oct 2025

CAST: Compositional Analysis via Spectral Tracking for Understanding Transformer Layer Functions

162

16 Oct 2025

Ethic-BERT: An Enhanced Deep Learning Model for Ethical and Non-Ethical Content Classification

Mahamodul Hasan Mahadi

123

14 Oct 2025

Fairness Metric Design Exploration in Multi-Domain Moral Sentiment Classification using Transformer-Based Models

Battemuulen Naranbat

Seyed Sahand Mohammadi Ziabari

Yousuf Nasser Al Husaini

Ali Mohammed Mansoor Alsahag

102

13 Oct 2025

Entropy Meets Importance: A Unified Head Importance-Entropy Score for Stable and Efficient Transformer Pruning

184

10 Oct 2025

Mapping Semantic & Syntactic Relationships with Geometric Rotation

Michael Freenor

Lauren Alvarez

LLMSV

238

10 Oct 2025

SkipSR: Faster Super Resolution with Token Skipping

279

09 Oct 2025

Reasoning for Hierarchical Text Classification: The Case of Patents

208

08 Oct 2025

Mechanistic Interpretability of Socio-Political Frames in Language Models

Hadi Asghari

Sami Nenno

128

04 Oct 2025

Allocation of Parameters in Transformers

197

04 Oct 2025

A Hierarchical Error Framework for Reliable Automated Coding in Communication Research: Applications to Health and Political Communication

Zhilong Zhao

Yindi Liu

AILaw

257

29 Sep 2025

Investigating Multi-layer Representations for Dense Passage Retrieval

Zhongbin Xie

Thomas Lukasiewicz

170

28 Sep 2025

Uncovering Graph Reasoning in Decoder-only Transformers with Circuit Tracing

188

24 Sep 2025

A Novel Differential Feature Learning for Effective Hallucination Detection and Classification

Wenkai Wang

Vincent C. S. Lee

Yizhen Zheng

130

20 Sep 2025

Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect

Alina Klerings

Jannik Brinkmann

Daniel Ruffinelli

Simone Paolo Ponzetto

LLMSV

208

15 Sep 2025

Documents Are People and Words Are Items: A Psychometric Approach to Textual Data with Contextual Embeddings

Jinsong Chen

10 Sep 2025

Mask-GCG: Are All Tokens in Adversarial Suffixes Necessary for Jailbreak Attacks?

214

08 Sep 2025

Comparative Analysis of Transformer Models in Disaster Tweet Classification for Public Safety

Sharif Noor Zisad

N. M. Istiak Chowdhury

Ragib Hasan

273

04 Sep 2025

Learning Mechanism Underlying NLP Pre-Training and Fine-Tuning

184

03 Sep 2025

Towards Fundamental Language Models: Does Linguistic Competence Scale with Model Size?

Jaime Collado-Montañez

L. Alfonso Ureña-López

Arturo Montejo-Ráez

HILM ELM LRM

140

02 Sep 2025

MindGuard: Intrinsic Decision Inspection for Securing LLM Agents Against Metadata Poisoning

415

28 Aug 2025

Transplant Then Regenerate: A New Paradigm for Text Data Augmentation

330

20 Aug 2025

Semantic Anchoring in Agentic Memory: Leveraging Linguistic Structures for Persistent Conversational Context

Maitreyi Chatterjee

Devansh Agarwal

RALM KELM

191

18 Aug 2025

Cognitive Decision Routing in Large Language Models: When to Think Fast, When to Think Slow

152

17 Aug 2025

Streamlining Admission with LOR Insights: AI-Based Leadership Assessment in Online Master's Program

234

07 Aug 2025

I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations

152

06 Aug 2025

When Truth Is Overridden: Uncovering the Internal Origins of Sycophancy in Large Language Models

583

04 Aug 2025

Length Representations in Large Language Models

263

27 Jul 2025

Explainable Mapper: Charting LLM Embedding Spaces Using Perturbation-Based Explanation and Verification Agents

Xinyuan Yan

Rita Sevastjanova

Sinie van der Ben

Mennatallah El-Assady

Bei Wang

308

24 Jul 2025

Discourse Heuristics For Paradoxically Moral Self-CorrectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

220

01 Jul 2025

Can structural correspondences ground real world representational content in Large Language Models?

Iwan Williams

241

19 Jun 2025

A Vietnamese Dataset for Text Segmentation and Multiple Choices Reading Comprehension

193

19 Jun 2025

Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning

Stanley Ngugi

233

18 Jun 2025

From Raw Corpora to Domain Benchmarks: Automated Evaluation of LLM Domain Expertise

251

09 Jun 2025

Towards an Explainable Comparison and Alignment of Feature Embeddings

Mohammad Jalali

Bahar Dibaei Nia

Farzan Farnia

438

06 Jun 2025