v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018

Alex Warstadt

Amanpreet Singh

Samuel R. Bowman

ArXiv (abs)PDF HTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown

Towards Debiasing Sentence RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Louis-Philippe Morency

220

271

16 Jul 2020

Can neural networks acquire a structural bias from raw linguistic data?Annual Meeting of the Cognitive Science Society (CogSci), 2020

Alex Warstadt

Samuel R. Bowman

AI4CE

200

14 Jul 2020

HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections

179

12 Jul 2020

Unsupervised Paraphrasing via Deep Reinforcement Learning

A.B. Siddique

Samet Oymak

Vagelis Hristidis

201

05 Jul 2020

Building Interpretable Interaction Trees for Deep NLP Models

131

29 Jun 2020

MaxVA: Fast Adaptation of Step Sizes by Maximizing Observed Variance of Gradients

Furong Huang

208

21 Jun 2020

SqueezeBERT: What can computer vision teach NLP about efficient neural networks?

251

136

19 Jun 2020

Revisiting Few-sample BERT Fine-tuningInternational Conference on Learning Representations (ICLR), 2020

407

488

10 Jun 2020

On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines

Marius Mosbach

Maksym Andriushchenko

Dietrich Klakow

452

417

08 Jun 2020

BERT Loses Patience: Fast and Robust Inference with Early Exit

376

399

07 Jun 2020

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

Xiaodong Liu

616

3,407

05 Jun 2020

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

260

256

05 Jun 2020

Syntactic Structure Distillation Pretraining For Bidirectional EncodersTransactions of the Association for Computational Linguistics (TACL), 2020

A. Kuncoro

Lingpeng Kong

Daniel Fried

203

27 May 2020

Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

Anne Lauscher

Olga Majewska

Leonardo F. R. Ribeiro

Iryna Gurevych

Nikolai Rozanov

Goran Glavaš

KELM

168

24 May 2020

CERT: Contrastive Self-supervised Learning for Language Understanding

213

368

16 May 2020

A Systematic Assessment of Syntactic Generalization in Neural Language Models

303

250

07 May 2020

Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding wordsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Josef Klafka

Allyson Ettinger

184

04 May 2020

Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?Annual Meeting of the Association for Computational Linguistics (ACL), 2020

226

204

01 May 2020

When BERT Plays the Lottery, All Tickets Are WinningConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

298

200

01 May 2020

Cross-Linguistic Syntactic Evaluation of Word Prediction ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Aaron Mueller

Garrett Nicolai

Panayiota Petrou-Zeniou

N. Talmina

Tal Linzen

238

01 May 2020

Segatron: Segment-Aware Transformer for Language Modeling and Understanding

154

30 Apr 2020

Investigating Transferability in Pretrained Language ModelsFindings (Findings), 2020

140

30 Apr 2020

TAVAT: Token-Aware Virtual Adversarial Training for Language Understanding

Linyang Li

Xipeng Qiu

161

30 Apr 2020

Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less ForgettingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Yutai Hou

297

256

27 Apr 2020

Masking as an Efficient Alternative to Finetuning for Pretrained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

224

126

26 Apr 2020

Reevaluating Adversarial Examples in Natural LanguageFindings (Findings), 2020

341

122

25 Apr 2020

How fine can fine-tuning be? Learning efficient language modelsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Evani Radiya-Dixit

Xin Wang

147

24 Apr 2020

Considering Likelihood in NLP Classification Explanations with Occlusion and Language Modeling

David Harbecke

Christoph Alt

130

21 Apr 2020

MPNet: Masked and Permuted Pre-training for Language UnderstandingNeural Information Processing Systems (NeurIPS), 2020

Xu Tan

240

1,459

20 Apr 2020

Adversarial Training for Large Neural Language Models

Xiaodong Liu

264

204

20 Apr 2020

Coreferential Reasoning Learning for Language RepresentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Deming Ye

Yankai Lin

Jiaju Du

Zhenghao Liu

Peng Li

Maosong Sun

Zhiyuan Liu

236

185

15 Apr 2020

VGCN-BERT: Augmenting BERT with Graph Embedding for Text ClassificationEuropean Conference on Information Retrieval (ECIR), 2020

Zhibin Lu

Pan Du

J. Nie

267

144

12 Apr 2020

Frequency, Acceptability, and Selection: A case study of clause-embeddingGlossa (Glossa), 2020

Aaron Steven White

Kyle Rawlins

113

08 Apr 2020

Improving BERT with Self-Supervised AttentionIEEE Access (IEEE Access), 2020

Yiren Chen

Xiaoyu Kou

Jiangang Bai

Yunhai Tong

123

08 Apr 2020

CALM: Continuous Adaptive Learning for Language Modeling

Kristjan Arumae

Parminder Bhatia

CLL KELM

08 Apr 2020

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited DevicesAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

402

930

06 Apr 2020

How Furiously Can Colourless Green Ideas Sleep? Sentence Acceptability in ContextTransactions of the Association for Computational Linguistics (TACL), 2020

174

02 Apr 2020

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-TrainingInternational Conference on Machine Learning (ICML), 2020

...

182

417

28 Feb 2020

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained TransformersNeural Information Processing Systems (NeurIPS), 2020

1.2K

1,742

25 Feb 2020

Improving BERT Fine-Tuning via Self-Ensemble and Self-DistillationJournal of Computational Science and Technology (JCST), 2020

Yige Xu

Xipeng Qiu

L. Zhou

Xuanjing Huang

144

24 Feb 2020

The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Xiaodong Liu

...

182

19 Feb 2020

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping

278

674

15 Feb 2020

BERT-of-Theseus: Compressing BERT by Progressive Module ReplacingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

668

219

07 Feb 2020

BLiMP: The Benchmark of Linguistic Minimal Pairs for EnglishTransactions of the Association for Computational Linguistics (TACL), 2019

465

616

02 Dec 2019

Neural language modeling of free word order argument structure

Charlotte Rochereau

Benoît Sagot

Emmanuel Dupoux

134

30 Nov 2019

Do Attention Heads in BERT Track Syntactic Dependencies?

232

144

27 Nov 2019

Learning to Few-Shot Learn Across Diverse Natural Language Classification TasksInternational Conference on Computational Linguistics (COLING), 2019

270

126

10 Nov 2019

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Xiaodong Liu

627

590

08 Nov 2019

What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning

Jaejun Lee

Raphael Tang

Jimmy J. Lin

263

138

08 Nov 2019

MML: Maximal Multiverse Learning for Robust Fine-Tuning of Language Models

Itzik Malkiel

Lior Wolf

05 Nov 2019