Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.12471
Cited By
Neural Network Acceptability Judgments
31 May 2018
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Network Acceptability Judgments"
50 / 878 papers shown
Title
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
22
1
0
08 Feb 2024
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang
Kush S. Bhatia
Hermann Kumbong
Christopher Ré
27
47
0
06 Feb 2024
Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes
Yingyi Chen
Qinghua Tao
F. Tonin
Johan A. K. Suykens
22
1
0
02 Feb 2024
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing
Sheng R. Li
Geng Yuan
Yuezhen Dai
Youtao Zhang
Yanzhi Wang
Xulong Tang
31
18
0
30 Jan 2024
EdgeOL: Efficient in-situ Online Learning on Edge Devices
Sheng R. Li
Geng Yuan
Yawen Wu
Yuezhen Dai
Chao Wu
Alex K. Jones
Jingtong Hu
Yanzhi Wang
Xulong Tang
35
1
0
30 Jan 2024
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
49
23
0
27 Jan 2024
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy
Yongkang Liu
Yiqun Zhang
Qian Li
Tong Liu
Shi Feng
Daling Wang
Yifei Zhang
Hinrich Schütze
38
6
0
26 Jan 2024
Instructional Fingerprinting of Large Language Models
Jiashu Xu
Fei Wang
Mingyu Derek Ma
Pang Wei Koh
Chaowei Xiao
Muhao Chen
WaLM
22
29
0
21 Jan 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Aly M. Kassem
Sherif Saad
AAML
25
1
0
21 Jan 2024
Quantum Transfer Learning for Acceptability Judgements
Giuseppe Buonaiuto
Raffaele Guarasci
Aniello Minutolo
G. De Pietro
M. Esposito
29
7
0
15 Jan 2024
Model Editing at Scale leads to Gradual and Catastrophic Forgetting
Akshat Gupta
Anurag Rao
Gopala Anumanchipalli
KELM
CLL
20
48
0
15 Jan 2024
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
Zhengxin Zhang
Dan Zhao
Xupeng Miao
Gabriele Oliaro
Qing Li
Yong-jia Jiang
Zhihao Jia
MQ
36
7
0
13 Jan 2024
The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance
A. Salinas
Fred Morstatter
39
49
0
08 Jan 2024
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining
Jacob P. Portes
Alex Trott
Sam Havens
Daniel King
Abhinav Venigalla
Moin Nadeem
Nikhil Sardana
D. Khudia
Jonathan Frankle
18
16
0
29 Dec 2023
Can persistent homology whiten Transformer-based black-box models? A case study on BERT compression
Luis Balderas
Miguel Lastra
José M. Benítez
24
1
0
17 Dec 2023
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld
Anas Awadalla
Iz Beltagy
Akshita Bhagia
Ian H. Magnusson
Hao Peng
Oyvind Tafjord
Pete Walsh
Kyle Richardson
Jesse Dodge
114
1
0
15 Dec 2023
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Collin Burns
Pavel Izmailov
Jan Hendrik Kirchner
Bowen Baker
Leo Gao
...
Adrien Ecoffet
Manas Joglekar
Jan Leike
Ilya Sutskever
Jeff Wu
ELM
39
258
0
14 Dec 2023
GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction
Jiacheng Ruan
Jingsheng Gao
Mingye Xie
Suncheng Xiang
Zefang Yu
Ting Liu
Yuzhuo Fu
MoE
50
4
0
12 Dec 2023
Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks
Mohammad-Javad Davari
Eugene Belilovsky
MoMe
40
56
0
11 Dec 2023
GTA: Gated Toxicity Avoidance for LM Performance Preservation
Heegyu Kim
Hyunsouk Cho
19
1
0
11 Dec 2023
Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning
Jianwei Li
Sheng Liu
Qi Lei
PILM
SILM
AAML
25
4
0
10 Dec 2023
Graph Convolutions Enrich the Self-Attention in Transformers!
Jeongwhan Choi
Hyowon Wi
Jayoung Kim
Yehjin Shin
Kookjin Lee
Nathaniel Trask
Noseong Park
27
4
0
07 Dec 2023
LayerCollapse: Adaptive compression of neural networks
Soheil Zibakhsh Shabgahi
Mohammad Soheil Shariff
F. Koushanfar
AI4CE
18
1
0
29 Nov 2023
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
Daryna Dementieva
Daniil Moskovskiy
David Dale
Alexander Panchenko
36
16
0
23 Nov 2023
Sparse Low-rank Adaptation of Pre-trained Language Models
Ning Ding
Xingtai Lv
Qiaosen Wang
Yulin Chen
Bowen Zhou
Zhiyuan Liu
Maosong Sun
22
55
0
20 Nov 2023
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
16
3
0
19 Nov 2023
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text
Yanzhu Guo
Guokan Shang
Michalis Vazirgiannis
Chloé Clavel
31
48
0
16 Nov 2023
GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks
Shivanshu Gupta
Clemens Rosenbaum
Ethan R. Elenberg
LRM
32
6
0
16 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
22
3
0
15 Nov 2023
DALA: A Distribution-Aware LoRA-Based Adversarial Attack against Language Models
Yibo Wang
Xiangjue Dong
James Caverlee
Philip S. Yu
23
2
0
14 Nov 2023
How Well Do Large Language Models Understand Syntax? An Evaluation by Asking Natural Language Questions
Houquan Zhou
Yang Hou
Zhenghua Li
Xuebin Wang
Zhefeng Wang
Xinyu Duan
Min Zhang
ELM
8
5
0
14 Nov 2023
How are Prompts Different in Terms of Sensitivity?
Sheng Lu
Hendrik Schuff
Iryna Gurevych
34
18
0
13 Nov 2023
STEER: Unified Style Transfer with Expert Reinforcement
Skyler Hallinan
Faeze Brahman
Ximing Lu
Jaehun Jung
Sean Welleck
Yejin Choi
OffRL
13
14
0
13 Nov 2023
Mirror: A Universal Framework for Various Information Extraction Tasks
Tong Zhu
Junfei Ren
Zijian Yu
Mengsong Wu
Guoliang Zhang
Xiaoye Qu
Wenliang Chen
Zhefeng Wang
Baoxing Huai
Min Zhang
29
14
0
09 Nov 2023
Large GPT-like Models are Bad Babies: A Closer Look at the Relationship between Linguistic Competence and Psycholinguistic Measures
Julius Steuer
Marius Mosbach
Dietrich Klakow
22
10
0
08 Nov 2023
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Le Yu
Yu Bowen
Haiyang Yu
Fei Huang
Yongbin Li
MoMe
28
272
0
06 Nov 2023
Not all layers are equally as important: Every Layer Counts BERT
Lucas Georges Gabriel Charpentier
David Samuel
20
15
0
03 Nov 2023
Ling-CL: Understanding NLP Models through Linguistic Curricula
Mohamed Elgaar
Hadi Amiri
21
2
0
31 Oct 2023
Evaluating Neural Language Models as Cognitive Models of Language Acquisition
Héctor Javier Vázquez Martínez
Annika Lea Heuser
Charles D. Yang
Jordan Kodner
17
8
0
31 Oct 2023
Mean BERTs make erratic language teachers: the effectiveness of latent bootstrapping in low-resource settings
David Samuel
16
2
0
30 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
William Rudman
Catherine Chen
Carsten Eickhoff
11
4
0
26 Oct 2023
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP
Yoshitomo Matsubara
VLM
26
1
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
19
1
0
26 Oct 2023
How well can machine-generated texts be identified and can language models be trained to avoid identification?
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
23
1
0
25 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
R. Wang
Rui Yan
19
4
0
24 Oct 2023
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
Tianshi Che
Ji Liu
Yang Zhou
Jiaxiang Ren
Jiwen Zhou
Victor S. Sheng
H. Dai
Dejing Dou
25
50
0
23 Oct 2023
Statistical Depth for Ranking and Characterizing Transformer-Based Text Embeddings
Parker Seegmiller
S. Preum
31
3
0
23 Oct 2023
Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives
Mario Giulianelli
Sarenne Wallbridge
Raquel Fernández
25
13
0
20 Oct 2023
Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Miaoxi Zhu
Qihuang Zhong
Li Shen
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
MQ
VLM
29
1
0
20 Oct 2023
Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection
Jianwei Li
Weizhi Gao
Qi Lei
Dongkuan Xu
22
2
0
19 Oct 2023
Previous
1
2
3
4
5
6
...
16
17
18
Next