v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018

Alex Warstadt

Amanpreet Singh

Samuel R. Bowman

ArXiv (abs)PDF HTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown

Generating Training Data with Language Models: Towards Zero-Shot Language UnderstandingNeural Information Processing Systems (NeurIPS), 2022

Yu Zhang

230

274

09 Feb 2022

What are the best systems? New perspectives on NLP Benchmarking

468

08 Feb 2022

Nonparametric Uncertainty Quantification for Single Deterministic Neural NetworkNeural Information Processing Systems (NeurIPS), 2022

180

07 Feb 2022

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer ModelsInternational Conference on Learning Representations (ICLR), 2022

Xiaodong Liu

176

06 Feb 2022

AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models

Dongkuan Xu

Xiaodong Liu

Ahmed Hassan Awadallah

Jianfeng Gao

198

29 Jan 2022

Describing Differences between Text Distributions with Natural LanguageInternational Conference on Machine Learning (ICML), 2022

Ruiqi Zhong

300

28 Jan 2022

Black-box Prompt Learning for Pre-trained Language Models

Yong Lin

Tong Zhang

290

21 Jan 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment AnalysisInternational Conference on Language Resources and Evaluation (LREC), 2022

Shamsuddeen Hassan Muhammad

David Ifeoluwa Adelani

...

305

119

20 Jan 2022

The Dark Side of the Language: Pre-trained Transformers in the DarkNetRecent Advances in Natural Language Processing (RANLP), 2022

Fabio Massimo Zanzotto

VLM

249

14 Jan 2022

How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets

Marianna Apidianaki

149

12 Jan 2022

Latency Adjustable Transformer Encoder for Language UnderstandingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Sajjad Kachuee

M. Sharifkhani

579

10 Jan 2022

Transformer Uncertainty Estimation with Hierarchical Stochastic AttentionAAAI Conference on Artificial Intelligence (AAAI), 2021

Jiahuan Pei

Cheng-Yu Wang

Gyuri Szarvas

171

27 Dec 2021

An Empirical Investigation of the Role of Pre-training in Lifelong Learning

362

167

16 Dec 2021

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework

Xin Jiang

Qun Liu

Hinrich Schütze

RALM

278

14 Dec 2021

Pruning Pretrained Encoders with a Multitask Objective

Patrick Xia

Richard Shin

129

10 Dec 2021

FLAVA: A Foundational Language And Vision Alignment Model

Amanpreet Singh

Douwe Kiela

355

863

08 Dec 2021

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning

...

304

230

22 Nov 2021

Can depth-adaptive BERT perform better on binary classification tasks

177

22 Nov 2021

Merging Models with Fisher-Weighted Averaging

Michael Matena

Colin Raffel

FedML MoMe

543

523

18 Nov 2021

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Pengcheng He

Jianfeng Gao

Weizhu Chen

826

1,585

18 Nov 2021

Few-Shot Self-Rationalization with Natural Language Prompts

267

115

16 Nov 2021

Variation and generality in encoding of syntactic anomaly information in sentence embeddingsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021

Qinxuan Wu

Allyson Ettinger

177

12 Nov 2021

Defining and Quantifying the Emergence of Sparse Concepts in DNNsComputer Vision and Pattern Recognition (CVPR), 2021

599

11 Nov 2021

A Survey on Green Deep Learning

Lei Li

457

08 Nov 2021

MetaICL: Learning to Learn In ContextNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Luke Zettlemoyer

681

575

29 Oct 2021

Alignment Attention by Matching Key and Query DistributionsNeural Information Processing Systems (NeurIPS), 2021

Shujian Zhang

Xinjie Fan

Huangjie Zheng

Korawat Tanwisuth

Mingyuan Zhou

OOD

220

25 Oct 2021

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality TransferConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Eleftheria Briakou

Sweta Agrawal

Joel R. Tetreault

Marine Carpuat

202

20 Oct 2021

The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding

421

19 Oct 2021

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

Dongkuan Xu

...

Sanguthevar Rajasekaran

Hang Liu

Caiwen Ding

CLL VLM

214

15 Oct 2021

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

470

315

15 Oct 2021

Exploring Universal Intrinsic Task Subspace via Prompt Tuning

Yujia Qin

Xiaozhi Wang

Yusheng Su

Yankai Lin

Ning Ding

...

Juanzi Li

Lei Hou

Peng Li

Maosong Sun

Jie Zhou

VLM VPVLM

313

15 Oct 2021

bert2BERT: Towards Reusable Pretrained Language Models

Cheng Chen

Yichun Yin

Lifeng Shang

Xin Jiang

Zhiyuan Liu

Qun Liu

VLM

215

14 Oct 2021

A Survey On Neural Word Embeddings

Erhan Sezerer

Selma Tekir

AI4TS

258

05 Oct 2021

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts

Zhengyan Zhang

Yankai Lin

Zhiyuan Liu

Peng Li

Maosong Sun

Jie Zhou

MoE

420

162

05 Oct 2021

Focused Contrastive Training for Test-based Constituency Analysis

Benjamin Roth

Erion cCano

106

30 Sep 2021

Shaking Syntactic Trees on the Sesame Street: Multilingual Probing with Controllable Perturbations

Ekaterina Taktasheva

Vladislav Mikhailov

Ekaterina Artemova

223

28 Sep 2021

Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpusConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

215

24 Sep 2021

Revisiting the Uniform Information Density HypothesisConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Lena Jäger

210

23 Sep 2021

Dynamic Knowledge Distillation for Pre-trained Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Lei Li

Yankai Lin

Shuhuai Ren

Peng Li

Jie Zhou

Xu Sun

249

23 Sep 2021

Survey: Transformer based Video-Language Pre-training

Ludan Ruan

Qin Jin

VLM ViT

205

21 Sep 2021

Training Dynamic based data filtering may not work for NLP datasets

121

19 Sep 2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

David Ifeoluwa Adelani

Xiaoyu Shen

161

19 Sep 2021

Text Detoxification using Large Pre-trained Neural Models

298

18 Sep 2021

Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers

Jason Phang

Haokun Liu

Samuel R. Bowman

244

17 Sep 2021

SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations

15 Sep 2021

EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation

Hang Xu

Xiaodan Liang

177

15 Sep 2021

ARCH: Efficient Adversarial Regularized Training with Caching

Xiaodong Liu

172

15 Sep 2021

Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension

Naoya Inoue

H. Trivedi

Steven K. Sinha

Niranjan Balasubramanian

Kentaro Inui

134

14 Sep 2021

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Michihiro Yasunaga

J. Leskovec

Abigail Z. Jacobs

185

14 Sep 2021

Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations

Mohammad Taher Pilehvar

167

13 Sep 2021