Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11932
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
27 July 2019
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILM
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Github (511★)
Papers citing
"Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment"
50 / 567 papers shown
Title
Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Jonas Becker
Jan Philip Wahle
Bela Gipp
Terry Ruas
120
11
0
24 May 2024
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
Yimeng Zhang
Xin Chen
Jinghan Jia
Yihua Zhang
Chongyu Fan
Jiancheng Liu
Mingyi Hong
Ke Ding
Sijia Liu
DiffM
116
68
0
24 May 2024
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
Yiming Chen
Chen Zhang
Danqing Luo
L. F. D’Haro
R. Tan
Haizhou Li
AAML
ELM
84
3
0
23 May 2024
Data Contamination Calibration for Black-box LLMs
Wen-song Ye
Jiaqi Hu
Liyao Li
Haobo Wang
Gang Chen
Junbo Zhao
64
9
0
20 May 2024
A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers
Tom Roth
Inigo Jauregi Unanue
A. Abuadbba
Massimo Piccardi
AAML
SILM
55
1
0
20 May 2024
UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers
Duo Peng
Qi Ke
Jun Liu
83
4
0
18 May 2024
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
Xinzhe Li
Ming Liu
87
0
0
17 May 2024
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
165
27
0
15 May 2024
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
AAML
96
4
0
07 May 2024
On Adversarial Examples for Text Classification by Perturbing Latent Representations
Korn Sooksatra
Bikram Khanal
Pablo Rivas
SILM
AAML
62
3
0
06 May 2024
CEval: A Benchmark for Evaluating Counterfactual Text Generation
Van Bach Nguyen
Jorg Schlotterer
Christin Seifert
99
7
0
26 Apr 2024
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations
Sukmin Cho
Soyeong Jeong
Jeongyeon Seo
Taeho Hwang
Jong C. Park
SILM
AAML
107
33
0
22 Apr 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
Jiabao Ji
Bairu Hou
Zhen Zhang
Guanhua Zhang
Wenqi Fan
Qing Li
Yang Zhang
Gaowen Liu
Sijia Liu
Shiyu Chang
AAML
73
8
0
18 Apr 2024
GenFighter: A Generative and Evolutive Textual Attack Removal
Md Athikul Islam
Edoardo Serra
Sushil Jajodia
AAML
36
0
0
17 Apr 2024
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda
Johannes Schneider
131
35
0
15 Apr 2024
VertAttack: Taking advantage of Text Classifiers' horizontal vision
Jonathan Rusert
AAML
105
1
0
12 Apr 2024
Towards Robust Domain Generation Algorithm Classification
Arthur Drichel
Marc Meyer
Ulrike Meyer
AAML
67
3
0
09 Apr 2024
Goal-guided Generative Prompt Injection Attack on Large Language Models
Chong Zhang
Mingyu Jin
Qinkai Yu
Chengzhi Liu
Haochen Xue
Xiaobo Jin
AAML
SILM
94
16
0
06 Apr 2024
Adversarial Attacks and Dimensionality in Text Classifiers
Nandish Chattopadhyay
Atreya Goswami
Anupam Chattopadhyay
SILM
AAML
50
1
0
03 Apr 2024
READ: Improving Relation Extraction from an ADversarial Perspective
Dawei Li
William Hogan
Jingbo Shang
AAML
88
0
0
02 Apr 2024
Machine Learning Robustness: A Primer
Houssem Ben Braiek
Foutse Khomh
AAML
OOD
103
8
0
01 Apr 2024
PID Control-Based Self-Healing to Improve the Robustness of Large Language Models
Zhuotong Chen
Zihu Wang
Yifan Yang
Qianxiao Li
Zheng Zhang
AAML
84
1
0
31 Mar 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
101
7
0
27 Mar 2024
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence
Hsiu-Wei Yang
Abhinav Agrawal
Pavlos Fragkogiannis
Shubham Nitin Mulay
79
1
0
27 Mar 2024
Targeted Visualization of the Backbone of Encoder LLMs
Isaac Roberts
Alexander Schulz
L. Hermes
Barbara Hammer
49
0
0
26 Mar 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng
Yuhao Zhou
Zhiheng Xi
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
77
0
0
24 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
89
8
0
24 Mar 2024
Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation
Kyohoon Jin
Junho Lee
Juhwan Choi
Sangmin Song
Youngbin Kim
69
0
0
22 Mar 2024
Reversible Jump Attack to Textual Classifiers with Modification Reduction
Mingze Ni
Zhensu Sun
Wei Liu
AAML
56
0
0
21 Mar 2024
Don't be a Fool: Pooling Strategies in Offensive Language Detection from User-Intended Adversarial Attacks
Seunguk Yu
Juhwan Choi
Youngbin Kim
AAML
51
0
0
20 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAML
SILM
33
3
0
18 Mar 2024
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models
Hetvi Waghela
Sneha Rakshit
Jaydip Sen
AAML
68
7
0
17 Mar 2024
RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning
J. Asl
Prajwal Panzade
Eduardo Blanco
Daniel Takabi
Zhipeng Cai
SSL
41
2
0
17 Mar 2024
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction
Jiyuan Fu
Zhaoyu Chen
Kaixun Jiang
Haijing Guo
Jiafeng Wang
Shuyong Gao
Wenqiang Zhang
VLM
AAML
81
4
0
16 Mar 2024
Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness
Mohamed el Shehaby
Aditya Kotha
Ashraf Matrawy
AAML
72
0
0
15 Mar 2024
Token Alignment via Character Matching for Subword Completion
Ben Athiwaratkun
Shiqi Wang
Mingyue Shang
Yuchen Tian
Zijian Wang
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Rob Kwiatowski
Ramesh Nallapati
Bing Xiang
94
6
0
13 Mar 2024
The Impact of Quantization on the Robustness of Transformer-based Text Classifiers
Seyed Parsa Neshaei
Yasaman Boreshban
Gholamreza Ghassem-Sani
Seyed Abolghasem Mirroshandel
MQ
58
0
0
08 Mar 2024
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
LRM
107
62
0
29 Feb 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
112
6
0
28 Feb 2024
Unveiling Vulnerability of Self-Attention
Khai Jiet Liong
Hongqiu Wu
Haizhen Zhao
64
0
0
26 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
81
4
0
25 Feb 2024
PIDformer: Transformer Meets Control Theory
Tam Nguyen
César A. Uribe
Tan-Minh Nguyen
Richard G. Baraniuk
133
9
0
25 Feb 2024
Prompt Perturbation Consistency Learning for Robust Language Models
Yao Qiang
Subhrangshu Nandi
Ninareh Mehrabi
Greg Ver Steeg
Anoop Kumar
Anna Rumshisky
Aram Galstyan
135
10
0
24 Feb 2024
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Yihao Zhang
Hangzhou He
Jingyu Zhu
Huanran Chen
Yifei Wang
Zeming Wei
AAML
120
15
0
23 Feb 2024
RITFIS: Robust input testing framework for LLMs-based intelligent software
Ming-Ming Xiao
Yan Xiao
Hai Dong
Shunhui Ji
Pengcheng Zhang
AAML
90
5
0
21 Feb 2024
Investigating the Impact of Model Instability on Explanations and Uncertainty
Sara Vera Marjanović
Isabelle Augenstein
Christina Lioma
AAML
80
0
0
20 Feb 2024
Groot: Adversarial Testing for Generative Text-to-Image Models with Tree-based Semantic Transformation
Yi Liu
Guowei Yang
Gelei Deng
Feiyue Chen
Yuqi Chen
Ling Shi
Tianwei Zhang
Yang Liu
VLM
48
9
0
19 Feb 2024
Stealthy Attack on Large Language Model based Recommendation
Jinghao Zhang
Yuting Liu
Qiang Liu
Shu Wu
Guibing Guo
Liang Wang
85
14
0
18 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
113
20
0
18 Feb 2024
A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models
Cuong Dang
Dung D. Le
Thai Le
AAML
61
2
0
18 Feb 2024
Previous
1
2
3
4
5
6
...
10
11
12
Next