Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1805.12471
Cited By
v1
v2
v3 (latest)
Neural Network Acceptability Judgments
31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Network Acceptability Judgments"
50 / 950 papers shown
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
230
274
0
09 Feb 2022
What are the best systems? New perspectives on NLP Benchmarking
Pierre Colombo
Nathan Noiry
Ekhine Irurozki
Nathan Huet
468
43
0
08 Feb 2022
Nonparametric Uncertainty Quantification for Single Deterministic Neural Network
Neural Information Processing Systems (NeurIPS), 2022
Nikita Kotelevskii
A. Artemenkov
Kirill Fedyanin
Fedor Noskov
Alexander Fishkov
Artem Shelmanov
Artem Vazhentsev
Aleksandr Petiushko
Maxim Panov
UQCV
BDL
180
42
0
07 Feb 2022
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
International Conference on Learning Representations (ICLR), 2022
Chen Liang
Haoming Jiang
Simiao Zuo
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
T. Zhao
176
17
0
06 Feb 2022
AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
Dongkuan Xu
Subhabrata Mukherjee
Xiaodong Liu
Debadeepta Dey
Wenhui Wang
Xiang Zhang
Ahmed Hassan Awadallah
Jianfeng Gao
198
5
0
29 Jan 2022
Describing Differences between Text Distributions with Natural Language
International Conference on Machine Learning (ICML), 2022
Ruiqi Zhong
Charles Burton Snell
Dan Klein
Jacob Steinhardt
VLM
300
55
0
28 Jan 2022
Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao
Zhichao Huang
Ruijia Xu
Xuechun Li
Yong Lin
Xiao Zhou
Tong Zhang
VLM
AAML
290
83
0
21 Jan 2022
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
International Conference on Language Resources and Evaluation (LREC), 2022
Shamsuddeen Hassan Muhammad
David Ifeoluwa Adelani
Sebastian Ruder
Ibrahim Said Ahmad
Idris Abdulmumin
...
Chris C. Emezue
Saheed Abdul
Anuoluwapo Aremu
Alipio Jeorge
P. Brazdil
305
119
0
20 Jan 2022
The Dark Side of the Language: Pre-trained Transformers in the DarkNet
Recent Advances in Natural Language Processing (RANLP), 2022
Leonardo Ranaldi
Aria Nourbakhsh
Arianna Patrizi
Elena Sofia Ruzzetti
Dario Onorati
Francesca Fallucchi
Fabio Massimo Zanzotto
VLM
249
21
0
14 Jan 2022
How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets
Aarne Talman
Marianna Apidianaki
S. Chatzikyriakidis
Jörg Tiedemann
ELM
149
1
0
12 Jan 2022
Latency Adjustable Transformer Encoder for Language Understanding
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Sajjad Kachuee
M. Sharifkhani
579
1
0
10 Jan 2022
Transformer Uncertainty Estimation with Hierarchical Stochastic Attention
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jiahuan Pei
Cheng-Yu Wang
Gyuri Szarvas
171
31
0
27 Dec 2021
An Empirical Investigation of the Role of Pre-training in Lifelong Learning
Sanket Vaibhav Mehta
Darshan Patil
Sarath Chandar
Emma Strubell
CLL
362
167
0
16 Dec 2021
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao
Fei Mi
Yasheng Wang
Minglei Li
Xin Jiang
Qun Liu
Hinrich Schütze
RALM
278
12
0
14 Dec 2021
Pruning Pretrained Encoders with a Multitask Objective
Patrick Xia
Richard Shin
129
0
0
10 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIP
VLM
355
863
0
08 Dec 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
304
230
0
22 Nov 2021
Can depth-adaptive BERT perform better on binary classification tasks
Jing Fan
Xin Zhang
Sheng Zhang
Yan Pan
Lixiang Guo
MQ
177
0
0
22 Nov 2021
Merging Models with Fisher-Weighted Averaging
Michael Matena
Colin Raffel
FedML
MoMe
543
523
0
18 Nov 2021
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
826
1,585
0
18 Nov 2021
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasović
Iz Beltagy
Doug Downey
Matthew E. Peters
LRM
267
115
0
16 Nov 2021
Variation and generality in encoding of syntactic anomaly information in sentence embeddings
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2021
Qinxuan Wu
Allyson Ettinger
177
2
0
12 Nov 2021
Defining and Quantifying the Emergence of Sparse Concepts in DNNs
Computer Vision and Pattern Recognition (CVPR), 2021
Jie Ren
Mingjie Li
Qirui Chen
Huiqi Deng
Quanshi Zhang
599
41
0
11 Nov 2021
A Survey on Green Deep Learning
Jingjing Xu
Wangchunshu Zhou
Zhiyi Fu
Hao Zhou
Lei Li
VLM
457
93
0
08 Nov 2021
MetaICL: Learning to Learn In Context
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Sewon Min
M. Lewis
Luke Zettlemoyer
Hannaneh Hajishirzi
LRM
681
575
0
29 Oct 2021
Alignment Attention by Matching Key and Query Distributions
Neural Information Processing Systems (NeurIPS), 2021
Shujian Zhang
Xinjie Fan
Huangjie Zheng
Korawat Tanwisuth
Mingyuan Zhou
OOD
220
16
0
25 Oct 2021
Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Eleftheria Briakou
Sweta Agrawal
Joel R. Tetreault
Marine Carpuat
202
36
0
20 Oct 2021
The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding
Pratik Fegade
Tianqi Chen
Phillip B. Gibbons
T. Mowry
421
34
0
19 Oct 2021
Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm
Shaoyi Huang
Dongkuan Xu
Ian En-Hsu Yen
Yijue Wang
Sung-En Chang
...
Shiyang Chen
Mimi Xie
Sanguthevar Rajasekaran
Hang Liu
Caiwen Ding
CLL
VLM
214
36
0
15 Oct 2021
SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
Tu Vu
Brian Lester
Noah Constant
Rami Al-Rfou
Daniel Cer
VLM
LRM
470
315
0
15 Oct 2021
Exploring Universal Intrinsic Task Subspace via Prompt Tuning
Yujia Qin
Xiaozhi Wang
Yusheng Su
Yankai Lin
Ning Ding
...
Juanzi Li
Lei Hou
Peng Li
Maosong Sun
Jie Zhou
VLM
VPVLM
313
31
0
15 Oct 2021
bert2BERT: Towards Reusable Pretrained Language Models
Cheng Chen
Yichun Yin
Lifeng Shang
Xin Jiang
Yujia Qin
Fengyu Wang
Zhi Wang
Xiao Chen
Zhiyuan Liu
Qun Liu
VLM
215
73
0
14 Oct 2021
A Survey On Neural Word Embeddings
Erhan Sezerer
Selma Tekir
AI4TS
258
20
0
05 Oct 2021
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
MoE
420
162
0
05 Oct 2021
Focused Contrastive Training for Test-based Constituency Analysis
Benjamin Roth
Erion cCano
106
0
0
30 Sep 2021
Shaking Syntactic Trees on the Sesame Street: Multilingual Probing with Controllable Perturbations
Ekaterina Taktasheva
Vladislav Mikhailov
Ekaterina Artemova
223
14
0
28 Sep 2021
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Daniela Trotta
R. Guarasci
Elisa Leonardelli
Sara Tonelli
215
37
0
24 Sep 2021
Revisiting the Uniform Information Density Hypothesis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Clara Meister
Tiago Pimentel
Patrick Haller
Lena Jäger
Robert Bamler
R. Levy
210
92
0
23 Sep 2021
Dynamic Knowledge Distillation for Pre-trained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Lei Li
Yankai Lin
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
249
58
0
23 Sep 2021
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
205
49
0
21 Sep 2021
Training Dynamic based data filtering may not work for NLP datasets
Arka Talukdar
Monika Dagar
Prachi Gupta
Varun G. Menon
NoLa
121
3
0
19 Sep 2021
Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
David Ifeoluwa Adelani
Miaoran Zhang
Xiaoyu Shen
A. Davody
Thomas Kleinbauer
Dietrich Klakow
161
7
0
19 Sep 2021
Text Detoxification using Large Pre-trained Neural Models
David Dale
Anton Voronov
Daryna Dementieva
V. Logacheva
Olga Kozlova
Nikita Semenov
Sergey Petrakov
298
93
0
18 Sep 2021
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers
Jason Phang
Haokun Liu
Samuel R. Bowman
244
36
0
17 Sep 2021
SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations
Hooman Sedghamiz
Shivam Raval
Enrico Santus
Tuka Alhanai
M. Ghassemi
SSL
80
19
0
15 Sep 2021
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Chenhe Dong
Guangrun Wang
Hang Xu
Jiefeng Peng
Xiaozhe Ren
Xiaodan Liang
177
28
0
15 Sep 2021
ARCH: Efficient Adversarial Regularized Training with Caching
Simiao Zuo
Chen Liang
Haoming Jiang
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
T. Zhao
AAML
172
3
0
15 Sep 2021
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension
Naoya Inoue
H. Trivedi
Steven K. Sinha
Niranjan Balasubramanian
Kentaro Inui
134
19
0
14 Sep 2021
LM-Critic: Language Models for Unsupervised Grammatical Error Correction
Michihiro Yasunaga
J. Leskovec
Abigail Z. Jacobs
185
54
0
14 Sep 2021
Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations
Mohsen Fayyaz
Ehsan Aghazadeh
Ali Modarressi
Hosein Mohebbi
Mohammad Taher Pilehvar
167
22
0
13 Sep 2021
Previous
1
2
3
...
13
14
15
...
17
18
19
Next