Open Sesame: Getting Inside BERT's Linguistic Knowledge

4 June 2019

Papers citing "Open Sesame: Getting Inside BERT's Linguistic Knowledge"

50 / 166 papers shown

Exploring the Role of BERT Token Representations to Explain Sentence Probing ResultsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Hosein Mohebbi

Ali Modarressi

Mohammad Taher Pilehvar

MILM

226

03 Apr 2021

Explaining the Road Not Taken

Hua Shen

Ting-Hao 'Kenneth' Huang

FAtt XAI

200

27 Mar 2021

Bertinho: Galician BERT Representations

David Vilares

Marcos Garcia

Carlos Gómez-Rodríguez

167

25 Mar 2021

Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management

110

22 Mar 2021

Local Interpretations for Explainable Natural Language Processing: A SurveyACM Computing Surveys (CSUR), 2021

414

20 Mar 2021

Large Pre-trained Language Models Contain Human-like Biases of What is Right and Wrong to DoNature Machine Intelligence (Nat. Mach. Intell.), 2021

317

359

08 Mar 2021

Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages

176

01 Mar 2021

On the Evolution of Syntactic Information Encoded by BERT's Contextualized RepresentationsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

203

27 Jan 2021

Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitationsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Prodromos Malakasiotis

AILaw

154

26 Jan 2021

The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERTAAAI Conference on Artificial Intelligence (AAAI), 2021

Mitesh M. Khapra

193

22 Jan 2021

Of Non-Linearity and Commutativity in BERTIEEE International Joint Conference on Neural Network (IJCNN), 2021

Sumu Zhao

Damian Pascual

Gino Brunner

Roger Wattenhofer

316

12 Jan 2021

Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex WordsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Valentin Hofmann

J. Pierrehumbert

Hinrich Schütze

494

02 Jan 2021

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Yashar Mehdad

133

31 Dec 2020

Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?Findings (Findings), 2020

551

127

30 Dec 2020

Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Carlos Escolano

161

24 Dec 2020

Pre-Training a Language Model Without Human Language

Cheng-Han Chiang

Hung-yi Lee

160

22 Dec 2020

Enhancing deep neural networks with morphological informationNatural Language Engineering (NLE), 2020

Matej Klemen

Luka Krsnik

Marko Robnik-Šikonja

239

24 Nov 2020

Picking BERT's Brain: Probing for Linguistic Dependencies in Contextualized Embeddings Using Representational Similarity AnalysisInternational Conference on Computational Linguistics (COLING), 2020

Michael A. Lepori

R. Thomas McCoy

131

24 Nov 2020

Positional Artefacts Propagate Through Masked Language Model Embeddings

Ziyang Luo

Artur Kulmizev

Xiaoxi Mao

305

09 Nov 2020

Influence Patterns for Explaining Information Flow in BERTNeural Information Processing Systems (NeurIPS), 2020

Kaiji Lu

Zifan Wang

Piotr (Peter) Mardziel

Anupam Datta

GNN

274

02 Nov 2020

Dynamic Contextualized Word EmbeddingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Valentin Hofmann

J. Pierrehumbert

Hinrich Schütze

417

23 Oct 2020

A Benchmark for Lease Contract Review

294

20 Oct 2020

Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations

Nikolaos Manginas

Ilias Chalkidis

Prodromos Malakasiotis

114

12 Oct 2020

Unsupervised Distillation of Syntactic Information from Contextualized Word RepresentationsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020

221

11 Oct 2020

Recurrent babbling: evaluating the acquisition of grammar from limited input dataConference on Computational Natural Language Learning (CoNLL), 2020

Ludovica Pannitto

Aurélie Herbelot

146

09 Oct 2020

Intrinsic Probing through Dimension Selection

Lucas Torroba Hennigen

Adina Williams

Robert Bamler

212

06 Oct 2020

On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained TransformersBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020

169

06 Oct 2020

Guiding Attention for Self-Supervised Learning with TransformersFindings (Findings), 2020

Ameet Deshpande

Karthik Narasimhan

162

06 Oct 2020

Linguistic Profiling of a Neural Language ModelInternational Conference on Computational Linguistics (COLING), 2020

279

05 Oct 2020

Which *BERT? A Survey Organizing Contextualized EncodersConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Patrick Xia

Shijie Wu

Benjamin Van Durme

227

02 Oct 2020

An information theoretic view on selecting linguistic probesConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Zining Zhu

Frank Rudzicz

169

15 Sep 2020

Bio-inspired Structure Identification in Language Embeddings

166

05 Sep 2020

Attention Flows: Analyzing and Comparing Attention Mechanisms in Language ModelsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2020

Joseph F DeRose

Jiayao Wang

M. Berger

141

108

03 Sep 2020

Is Supervised Syntactic Parsing Beneficial for Language Understanding? An Empirical Investigation

Goran Glavaš

Ivan Vulić

264

15 Aug 2020

Deep Contextual Clinical Prediction with Reverse DistillationAAAI Conference on Artificial Intelligence (AAAI), 2020

240

10 Jul 2020

BERTology Meets Biology: Interpreting Attention in Protein Language Models

409

336

26 Jun 2020

A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading ComprehensionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020

111

02 Jun 2020

Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals

385

01 Jun 2020

Query Resolution for Conversational Search with Limited Supervision

Maarten de Rijke

186

140

24 May 2020

Weakly-Supervised Neural Response Selection from an Ensemble of Task-Specialised Dialogue Agents

124

06 May 2020

The Sensitivity of Language Models and Humans to Winograd Schema PerturbationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

209

04 May 2020

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model

Valentin Hofmann

J. Pierrehumbert

Hinrich Schütze

254

02 May 2020

Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?Annual Meeting of the Association for Computational Linguistics (ACL), 2020

229

204

01 May 2020

When BERT Plays the Lottery, All Tickets Are WinningConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

309

200

01 May 2020

Attribution Analysis of Grammatical Dependencies in LSTMs

Sophie Hao

250

30 Apr 2020

Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

209

30 Apr 2020

A Matter of Framing: The Impact of Linguistic Formalism on Probing ResultsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Ilia Kuznetsov

Iryna Gurevych

114

30 Apr 2020

Logic2Text: High-Fidelity Natural Language Generation from Logical FormsFindings (Findings), 2020

204

30 Apr 2020

Quantifying the Contextualization of Word Representations with Semantic Class ProbingFindings (Findings), 2020

Mengjie Zhao

Philipp Dufter

Yadollah Yaghoobzadeh

Hinrich Schütze

276

25 Apr 2020

Attention is Not Only a Weight: Analyzing Transformers with Vector Norms

189

21 Apr 2020