ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.12471
  4. Cited By
Neural Network Acceptability Judgments
v1v2v3 (latest)

Neural Network Acceptability Judgments

31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
ArXiv (abs)PDFHTML

Papers citing "Neural Network Acceptability Judgments"

50 / 950 papers shown
Benchmarking down-scaled (not so large) pre-trained language models
Benchmarking down-scaled (not so large) pre-trained language modelsConference on Natural Language Processing (NLP), 2021
Yi Men
P. Schulze
C. Heumann
115
1
0
11 May 2021
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction
  from Language Models
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Anne Beyer
Sharid Loáiciga
David Schlangen
179
19
0
07 May 2021
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing
  Regressions In NLP Model Updates
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model UpdatesAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Yuqing Xie
Yi-An Lai
Yuanjun Xiong
Yi Zhang
Stefano Soatto
UQCV
152
19
0
07 May 2021
Entailment as Few-Shot Learner
Entailment as Few-Shot Learner
Sinong Wang
Han Fang
Madian Khabsa
Hanzi Mao
Hao Ma
210
192
0
29 Apr 2021
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Morph Call: Probing Morphosyntactic Content of Multilingual Transformers
Vladislav Mikhailov
O. Serikov
Ekaterina Artemova
246
10
0
26 Apr 2021
On Geodesic Distances and Contextual Embedding Compression for Text
  Classification
On Geodesic Distances and Contextual Embedding Compression for Text Classification
Rishi Jha
Kai Mihata
93
7
0
22 Apr 2021
SoT: Delving Deeper into Classification Head for Transformer
SoT: Delving Deeper into Classification Head for Transformer
Jiangtao Xie
Rui Zeng
Qilong Wang
Ziqi Zhou
P. Li
ViT
219
12
0
22 Apr 2021
Sensitivity as a Complexity Measure for Sequence Classification Tasks
Sensitivity as a Complexity Measure for Sequence Classification TasksTransactions of the Association for Computational Linguistics (TACL), 2021
Michael Hahn
Dan Jurafsky
Richard Futrell
322
24
0
21 Apr 2021
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in
  NLP
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
635
195
0
18 Apr 2021
GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation
GPT3Mix: Leveraging Large-scale Language Models for Text AugmentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Kang Min Yoo
Dongju Park
Jaewook Kang
Sang-Woo Lee
Woomyeong Park
392
274
0
18 Apr 2021
Contrastive Out-of-Distribution Detection for Pretrained Transformers
Contrastive Out-of-Distribution Detection for Pretrained TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Wenxuan Zhou
Fangyu Liu
Muhao Chen
229
111
0
18 Apr 2021
Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation
  for Few-shot Learning
Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xisen Jin
Bill Yuchen Lin
Mohammad Rostami
Xiang Ren
BDLCLL
298
44
0
18 Apr 2021
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean
  Crawled Corpus
Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled CorpusConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jesse Dodge
Maarten Sap
Ana Marasović
William Agnew
Gabriel Ilharco
Dirk Groeneveld
Margaret Mitchell
Matt Gardner
AILaw
309
562
0
18 Apr 2021
What to Pre-Train on? Efficient Intermediate Task Selection
What to Pre-Train on? Efficient Intermediate Task SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Clifton A. Poth
Jonas Pfeiffer
Andreas Rucklé
Iryna Gurevych
253
106
0
16 Apr 2021
Effect of Visual Extensions on Natural Language Understanding in
  Vision-and-Language Models
Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Taichi Iki
Akiko Aizawa
VLM
232
21
0
16 Apr 2021
Probing Across Time: What Does RoBERTa Know and When?
Probing Across Time: What Does RoBERTa Know and When?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Leo Z. Liu
Yizhong Wang
Jungo Kasai
Hannaneh Hajishirzi
Noah A. Smith
KELM
330
96
0
16 Apr 2021
How to Train BERT with an Academic Budget
How to Train BERT with an Academic BudgetConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Peter Izsak
Moshe Berchansky
Omer Levy
338
128
0
15 Apr 2021
Annealing Knowledge Distillation
Annealing Knowledge DistillationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
A. Jafari
Mehdi Rezagholizadeh
Pranav Sharma
A. Ghodsi
186
90
0
14 Apr 2021
Masked Language Modeling and the Distributional Hypothesis: Order Word
  Matters Pre-training for Little
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for LittleConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Koustuv Sinha
Robin Jia
Dieuwke Hupkes
J. Pineau
Adina Williams
Douwe Kiela
260
275
0
14 Apr 2021
On the Use of Linguistic Features for the Evaluation of Generative
  Dialogue Systems
On the Use of Linguistic Features for the Evaluation of Generative Dialogue Systems
Ian Berlot-Attwell
Frank Rudzicz
98
2
0
13 Apr 2021
Understanding Transformers for Bot Detection in Twitter
Understanding Transformers for Bot Detection in Twitter
Andrés García-Silva
Cristian Berrío
José Manuél Gómez-Pérez
88
4
0
13 Apr 2021
Targeted Adversarial Training for Natural Language Understanding
Targeted Adversarial Training for Natural Language UnderstandingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
L. Pereira
Xiaodong Liu
Hao Cheng
Hoifung Poon
Jianfeng Gao
Ichiro Kobayashi
151
12
0
12 Apr 2021
FUDGE: Controlled Text Generation With Future Discriminators
FUDGE: Controlled Text Generation With Future DiscriminatorsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Kevin Kaichuang Yang
Dan Klein
308
386
0
12 Apr 2021
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization
  Approach
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization ApproachConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Simiao Zuo
Chen Liang
Haoming Jiang
Xiaodong Liu
Pengcheng He
Jianfeng Gao
Weizhu Chen
T. Zhao
242
10
0
11 Apr 2021
Adapting Language Models for Zero-shot Learning by Meta-tuning on
  Dataset and Prompt Collections
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt CollectionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ruiqi Zhong
Kristy Lee
Zheng Zhang
Dan Klein
461
181
0
10 Apr 2021
EXPATS: A Toolkit for Explainable Automated Text Scoring
EXPATS: A Toolkit for Explainable Automated Text Scoring
Hitoshi Manabe
Masato Hagiwara
97
5
0
07 Apr 2021
Exploring the Role of BERT Token Representations to Explain Sentence
  Probing Results
Exploring the Role of BERT Token Representations to Explain Sentence Probing ResultsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Hosein Mohebbi
Ali Modarressi
Mohammad Taher Pilehvar
MILM
223
33
0
03 Apr 2021
Evaluating the Morphosyntactic Well-formedness of Generated Texts
Evaluating the Morphosyntactic Well-formedness of Generated TextsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Adithya Pratapa
Antonios Anastasopoulos
Shruti Rijhwani
Aditi Chaudhary
David R. Mortensen
Graham Neubig
Yulia Tsvetkov
149
9
0
30 Mar 2021
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal
  Dependencies
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
P. Jayarao
Arpit Sharma
102
4
0
29 Mar 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter TransformersACM Computing Surveys (CSUR), 2021
Quentin Fournier
G. Caron
Daniel Aloise
378
136
0
26 Mar 2021
Approximating Instance-Dependent Noise via Instance-Confidence Embedding
Approximating Instance-Dependent Noise via Instance-Confidence Embedding
Yivan Zhang
Masashi Sugiyama
132
10
0
25 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New
  Multitask Benchmark
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkAAAI Conference on Artificial Intelligence (AAAI), 2021
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
264
146
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning
  Architectures
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning ArchitecturesIEEE Access (IEEE Access), 2021
Sushant Singh
A. Mahmood
AI4TS
325
120
0
23 Mar 2021
Unsupervised Contextual Paraphrase Generation using Lexical Control and
  Reinforcement Learning
Unsupervised Contextual Paraphrase Generation using Lexical Control and Reinforcement Learning
Sonal Garg
Sumanth Prabhu
Hemant Misra
G. Srinivasaraghavan
129
15
0
23 Mar 2021
TAG: Gradient Attack on Transformer-based Language Models
TAG: Gradient Attack on Transformer-based Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Jieren Deng
Yijue Wang
Ji Li
Chao Shang
Hang Liu
Sanguthevar Rajasekaran
Caiwen Ding
FedMLPILM
198
94
0
11 Mar 2021
FairFil: Contrastive Neural Debiasing Method for Pretrained Text
  Encoders
FairFil: Contrastive Neural Debiasing Method for Pretrained Text EncodersInternational Conference on Learning Representations (ICLR), 2021
Pengyu Cheng
Weituo Hao
Siyang Yuan
Shijing Si
Lawrence Carin
225
117
0
11 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey
  and Research Challenges
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research ChallengesACM Computing Surveys (CSUR), 2021
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
404
274
0
08 Mar 2021
Rissanen Data Analysis: Examining Dataset Characteristics via
  Description Length
Rissanen Data Analysis: Examining Dataset Characteristics via Description LengthInternational Conference on Machine Learning (ICML), 2021
Ethan Perez
Douwe Kiela
Dong Wang
202
25
0
05 Mar 2021
Token-Modification Adversarial Attacks for Natural Language Processing:
  A Survey
Token-Modification Adversarial Attacks for Natural Language Processing: A SurveyAI Communications (AI Commun.), 2021
Tom Roth
Yansong Gao
A. Abuadbba
Surya Nepal
Wei Liu
AAML
243
17
0
01 Mar 2021
SparseBERT: Rethinking the Importance Analysis in Self-attention
SparseBERT: Rethinking the Importance Analysis in Self-attentionInternational Conference on Machine Learning (ICML), 2021
Han Shi
Jiahui Gao
Xiaozhe Ren
Hang Xu
Xiaodan Liang
Zhenguo Li
James T. Kwok
194
59
0
25 Feb 2021
Analyzing Curriculum Learning for Sentiment Analysis along Task
  Difficulty, Pacing and Visualization Axes
Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization AxesWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2021
Anvesh Rao Vijjini
Kaveri Anuranjana
R. Mamidi
241
4
0
19 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model
  Pretraining
COCO-LM: Correcting and Contrasting Text Sequences for Language Model PretrainingNeural Information Processing Systems (NeurIPS), 2021
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
814
221
0
16 Feb 2021
AutoFreeze: Automatically Freezing Model Blocks to Accelerate
  Fine-tuning
AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning
Yuhan Liu
Saurabh Agarwal
Shivaram Venkataraman
OffRL
243
71
0
02 Feb 2021
Explaining Natural Language Processing Classifiers with Occlusion and
  Language Modeling
Explaining Natural Language Processing Classifiers with Occlusion and Language Modeling
David Harbecke
AAML
193
2
0
28 Jan 2021
CLiMP: A Benchmark for Chinese Language Model Evaluation
CLiMP: A Benchmark for Chinese Language Model EvaluationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Beilei Xiang
Changbing Yang
Yu Li
Alex Warstadt
Katharina Kann
ALM
163
56
0
26 Jan 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
Muppet: Massive Multi-task Representations with Pre-FinetuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Armen Aghajanyan
Anchit Gupta
Akshat Shrivastava
Xilun Chen
Luke Zettlemoyer
Sonal Gupta
194
288
0
26 Jan 2021
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal
  Classification Paradigm
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification ParadigmConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Akshay Krishna Sheshadri
Anvesh Rao Vijjini
S. Kharbanda
121
9
0
14 Jan 2021
I-BERT: Integer-only BERT Quantization
I-BERT: Integer-only BERT QuantizationInternational Conference on Machine Learning (ICML), 2021
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
467
370
0
05 Jan 2021
WARP: Word-level Adversarial ReProgramming
WARP: Word-level Adversarial ReProgrammingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Karen Hambardzumyan
Hrant Khachatrian
Jonathan May
AAML
662
368
0
01 Jan 2021
MiniLMv2: Multi-Head Self-Attention Relation Distillation for
  Compressing Pretrained Transformers
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained TransformersFindings (Findings), 2020
Wenhui Wang
Hangbo Bao
Shaohan Huang
Li Dong
Furu Wei
MQ
413
346
0
31 Dec 2020
Previous
123...1516171819
Next