Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.02338
Cited By
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
5 October 2020
Tianlu Wang
Xuezhi Wang
Yao Qin
Ben Packer
Kang Li
Jilin Chen
Alex Beutel
Ed H. Chi
SILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation"
43 / 43 papers shown
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Yang Wang
Chenghao Xiao
Yi Zhou
Stuart E. Middleton
Noura Al Moubayed
C. D. Lin
AAML
303
1
0
29 Jul 2025
Coordinated Robustness Evaluation Framework for Vision-Language Models
Ashwin Ramesh Babu
Sajad Mousavi
Vineet Gundecha
Sahand Ghorbanpour
Avisek Naug
Antonio Guillen
Ricardo Luna Gutierrez
Soumyendu Sarkar
AAML
183
0
0
05 Jun 2025
Model Hemorrhage and the Robustness Limits of Large Language Models
Ziyang Ma
Hui Yuan
Guang Dai
Gui-Song Xia
Bo Du
Liangpei Zhang
Dacheng Tao
317
1
0
31 Mar 2025
Confidence Elicitation: A New Attack Vector for Large Language Models
International Conference on Learning Representations (ICLR), 2025
Brian Formento
Chuan-Sheng Foo
See-Kiong Ng
AAML
583
2
0
07 Feb 2025
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vatsal Gupta
Pranshu Pandya
Tushar Kataria
Vivek Gupta
Dan Roth
AAML
566
2
0
03 Jan 2025
CERT-ED: Certifiably Robust Text Classification for Edit Distance
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Zhuoqun Huang
Yipeng Wang
Seunghee Shin
Benjamin I. P. Rubinstein
AAML
280
1
0
01 Aug 2024
Automated Adversarial Discovery for Safety Classifiers
Yash Kumar Lal
Preethi Lahoti
Aradhana Sinha
Yao Qin
Ananth Balashankar
290
1
0
24 Jun 2024
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models
Qin Yang
Meisam Mohammady
Han Wang
Ali Payani
Ashish Kundu
Kai Shu
Yan Yan
Yuan Hong
268
2
0
29 May 2024
Enhance Robustness of Language Models Against Variation Attack through Graph Integration
Ziteng Xiong
Lizhi Qing
Yangyang Kang
Jiawei Liu
Hongsong Li
Changlong Sun
Xiaozhong Liu
Wei Lu
204
2
0
18 Apr 2024
Towards Robust Domain Generation Algorithm Classification
ACM Asia Conference on Computer and Communications Security (AsiaCCS), 2024
Arthur Drichel
Marc Meyer
Ulrike Meyer
AAML
197
4
0
09 Apr 2024
PID Control-Based Self-Healing to Improve the Robustness of Large Language Models
Zhuotong Chen
Zihu Wang
Yifan Yang
Qianxiao Li
Zheng Zhang
AAML
245
3
0
31 Mar 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng
Yuhao Zhou
Zhiheng Xi
Tao Gui
Tao Gui
Xuanjing Huang
AAML
221
2
0
24 Mar 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Aly M. Kassem
Sherif Saad
AAML
299
3
0
21 Jan 2024
ROIC-DM: Robust Text Inference and Classification via Diffusion Model
Shilong Yuan
Wei Yuan
Hongzhi Yin
Tieke He
DiffM
370
5
0
07 Jan 2024
SenTest: Evaluating Robustness of Sentence Encoders
Tanmay Chavan
Shantanu Patankar
Aditya Kane
Omkar Gokhale
Geetanjali Kale
Raviraj Joshi
199
1
0
29 Nov 2023
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks
Aradhana Sinha
Ananth Balashankar
Ahmad Beirami
Thi Avrahami
Jilin Chen
Alex Beutel
AAML
231
6
0
25 Oct 2023
LEAP: Efficient and Automated Test Method for NLP Software
International Conference on Automated Software Engineering (ASE), 2023
Ming-Ming Xiao
Yan Xiao
Hai Dong
Shunhui Ji
Pengcheng Zhang
AAML
194
14
0
22 Aug 2023
From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Lifan Yuan
Dehan Kong
...
Longtao Huang
H. Xue
Zhiyuan Liu
Maosong Sun
Heng Ji
AAML
ELM
223
6
0
29 May 2023
Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals
Ananth Balashankar
Xuezhi Wang
Yao Qin
Ben Packer
Nithum Thain
Jilin Chen
Ed H. Chi
Alex Beutel
153
1
0
22 May 2023
Consistent Text Categorization using Data Augmentation in e-Commerce
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
G. Horowitz
Stav Yanovsky Daye
Noa Avigdor-Elgrabli
Ariel Raviv
158
6
0
09 May 2023
Masked Language Model Based Textual Adversarial Example Detection
ACM Asia Conference on Computer and Communications Security (AsiaCCS), 2023
Xiaomei Zhang
Zhaoxi Zhang
Qi Zhong
Xufei Zheng
Yanjun Zhang
Shengshan Hu
L. Zhang
AAML
341
8
0
18 Apr 2023
Backdoor Learning for NLP: Recent Advances, Challenges, and Future Research Directions
Marwan Omar
SILM
AAML
208
21
0
14 Feb 2023
Robustness of Learning from Task Instructions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jiasheng Gu
Hongyu Zhao
Hanzi Xu
Liang Nie
Hongyuan Mei
Wenpeng Yin
OOD
390
41
0
07 Dec 2022
Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Fanchao Qi
Longtao Huang
Zhiyuan Liu
Maosong Sun
SILM
220
94
0
19 Oct 2022
Flexible text generation for counterfactual fairness probing
Zee Fryer
Vera Axelrod
Ben Packer
Alex Beutel
Jilin Chen
Kellie Webster
122
22
0
28 Jun 2022
Plug and Play Counterfactual Text Generation for Model Robustness
Nishtha Madaan
Srikanta J. Bedathur
Diptikalyan Saha
184
4
0
21 Jun 2022
ER-Test: Evaluating Explanation Regularization Methods for Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Brihi Joshi
Aaron Chan
Ziyi Liu
Shaoliang Nie
Maziar Sanjabi
Hamed Firooz
Xiang Ren
AAML
375
7
0
25 May 2022
Low Resource Style Transfer via Domain Adaptive Meta Learning
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Xiangyang Li
Xiang Long
Yu Xia
Sujian Li
186
11
0
25 May 2022
Phrase-level Textual Adversarial Attack with Label Preservation
Yibin Lei
Yu Cao
Dianqi Li
Wanrong Zhu
Meng Fang
Mykola Pechenizkiy
AAML
202
28
0
22 May 2022
Exploring the Universal Vulnerability of Prompt-based Learning Paradigm
Lei Xu
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Zhiyuan Liu
SILM
VPVLM
194
88
0
11 Apr 2022
Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Hanjie Chen
Yangfeng Ji
OOD
AAML
VLM
223
26
0
23 Mar 2022
A Survey of Adversarial Defences and Robustness in NLP
Shreyansh Goyal
Sumanth Doddapaneni
Mitesh M.Khapra
B. Ravindran
AAML
490
35
0
12 Mar 2022
Survey on Automated Short Answer Grading with Deep Learning: from Word Embeddings to Transformers
Stefan Haller
Adina Aldea
C. Seifert
N. Strisciuglio
145
56
0
11 Mar 2022
Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
IEEE Access (IEEE Access), 2022
Marwan Omar
Soohyeon Choi
Daehun Nyang
David A. Mohaisen
241
76
0
03 Jan 2022
Measure and Improve Robustness in NLP Models: A Survey
Xuezhi Wang
Haohan Wang
Diyi Yang
535
158
0
15 Dec 2021
Are Vision Transformers Robust to Patch Perturbations?
European Conference on Computer Vision (ECCV), 2021
Jindong Gu
Volker Tresp
Yao Qin
AAML
ViT
245
78
0
20 Nov 2021
Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer
Fanchao Qi
Yangyi Chen
Xurui Zhang
Mukai Li
Zhiyuan Liu
Maosong Sun
AAML
SILM
352
228
0
14 Oct 2021
SEPP: Similarity Estimation of Predicted Probabilities for Defending and Detecting Adversarial Text
Pacific Asia Conference on Language, Information and Computation (PACLIC), 2021
Hoang-Quoc Nguyen-Son
Seira Hidano
Kazuhide Fukushima
S. Kiyomoto
AAML
193
0
0
12 Oct 2021
TREATED:Towards Universal Defense against Textual Adversarial Attacks
Bin Zhu
Zhaoquan Gu
Le Wang
Zhihong Tian
AAML
106
8
0
13 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yangyi Chen
Jingtong Su
Wei Wei
AAML
125
35
0
09 Sep 2021
Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning
Findings (Findings), 2020
Chenglei Si
Zhengyan Zhang
Fanchao Qi
Zhiyuan Liu
Yasheng Wang
Qun Liu
Maosong Sun
AAML
SILM
272
73
0
31 Dec 2020
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2020
Yunpeng Ren
Ziao Wang
Yiyuan Wang
Xiaofeng Zhang
162
13
0
23 Oct 2020
Generating Natural Adversarial Examples
Zhengli Zhao
Dheeru Dua
Sameer Singh
GAN
AAML
580
643
0
31 Oct 2017
1