Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.14956
Cited By
Generating Natural Language Attacks in a Hard Label Black Box Setting
29 December 2020
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Natural Language Attacks in a Hard Label Black Box Setting"
41 / 41 papers shown
Title
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento
Chuan-Sheng Foo
See-Kiong Ng
AAML
99
0
0
07 Feb 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Yang Wang
Chenghua Lin
ELM
35
0
0
05 Jan 2025
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
67
0
0
19 Nov 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
32
6
0
27 Mar 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng
Yuhao Zhou
Zhiheng Xi
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
35
0
0
24 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAML
SILM
25
3
0
18 Mar 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Yinghui Li
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
24
5
0
25 Feb 2024
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
Han Liu
Zhi Xu
Xiaotong Zhang
Feng Zhang
Fenglong Ma
Hongyang Chen
Hong Yu
Xianchao Zhang
AAML
14
7
0
02 Feb 2024
Fooling the Textual Fooler via Randomizing Latent Representations
Duy C. Hoang
Quang H. Nguyen
Saurav Manchanda
MinLong Peng
Kok-Seng Wong
Khoa D. Doan
SILM
AAML
15
0
0
02 Oct 2023
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
Bochuan Cao
Yu Cao
Lu Lin
Jinghui Chen
AAML
28
133
0
18 Sep 2023
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation
Sahar Sadrizadeh
Ljiljana Dolamic
P. Frossard
AAML
SILM
29
2
0
29 Aug 2023
LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack
HaiXiang Zhu
Zhaoqing Yang
Weiwei Shang
Yuren Wu
AAML
FAtt
10
3
0
01 Aug 2023
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks
Xinyu Zhang
Hanbin Hong
Yuan Hong
Peng Huang
Binghui Wang
Zhongjie Ba
Kui Ren
SILM
29
18
0
31 Jul 2023
On Evaluating Adversarial Robustness of Large Vision-Language Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Chongxuan Li
Ngai-man Cheung
Min-Bin Lin
VLM
AAML
MLLM
19
166
0
26 May 2023
Masked Language Model Based Textual Adversarial Example Detection
Xiaomei Zhang
Zhaoxi Zhang
Qi Zhong
Xufei Zheng
Yanjun Zhang
Shengshan Hu
L. Zhang
AAML
26
2
0
18 Apr 2023
AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs
Helene Orsini
Hongyan Bao
Yujun Zhou
Xiangrui Xu
Yufei Han
Longyang Yi
Wei Wang
Xin Gao
Xiangliang Zhang
AAML
21
1
0
13 Dec 2022
On the Security Vulnerabilities of Text-to-SQL Models
Xutan Peng
Yipeng Zhang
Jingfeng Yang
Mark Stevenson
SILM
23
10
0
28 Nov 2022
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
30
1
0
25 Oct 2022
Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Fanchao Qi
Longtao Huang
Zhiyuan Liu
Maosong Sun
SILM
12
45
0
19 Oct 2022
Rethinking Textual Adversarial Defense for Pre-trained Language Models
Jiayi Wang
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
AAML
SILM
15
11
0
21 Jul 2022
RAF: Recursive Adversarial Attacks on Face Recognition Using Extremely Limited Queries
Keshav Kasichainula
Hadi Mansourifar
W. Shi
AAML
21
1
0
04 Jul 2022
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar
Rishabh Maheshwary
Vikram Pudi
AIMat
20
14
0
30 Apr 2022
Understanding, Detecting, and Separating Out-of-Distribution Samples and Adversarial Samples in Text Classification
Cheng-Han Chiang
Hung-yi Lee
OODD
23
1
0
09 Apr 2022
Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model
Jiayi Wang
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
AAML
19
4
0
19 Mar 2022
Robust Textual Embedding against Word-level Adversarial Attacks
Yichen Yang
Xiaosen Wang
Kun He
AAML
14
16
0
28 Feb 2022
Threats to Pre-trained Language Models: Survey and Taxonomy
Shangwei Guo
Chunlong Xie
Jiwei Li
Lingjuan Lyu
Tianwei Zhang
PILM
27
29
0
14 Feb 2022
TextHacker: Learning based Hybrid Local Search Algorithm for Text Hard-label Adversarial Attack
Zhen Yu
Xiaosen Wang
Wanxiang Che
Kun He
AAML
23
14
0
20 Jan 2022
Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Marwan Omar
Soohyeon Choi
Daehun Nyang
David A. Mohaisen
24
57
0
03 Jan 2022
Effective and Imperceptible Adversarial Textual Attack via Multi-objectivization
Shengcai Liu
Ning Lu
W. Hong
Chao Qian
Ke Tang
AAML
14
14
0
02 Nov 2021
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework
Lifan Yuan
Yichi Zhang
Yangyi Chen
Wei Wei
AAML
17
32
0
28 Oct 2021
Adversarial Examples for Evaluating Math Word Problem Solvers
Vivek Kumar
Rishabh Maheshwary
Vikram Pudi
AAML
22
32
0
13 Sep 2021
Detecting Textual Adversarial Examples through Randomized Substitution and Vote
Xiaosen Wang
Yifeng Xiong
Kun He
AAML
17
11
0
13 Sep 2021
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
24
30
0
10 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Yangyi Chen
Jingtong Su
Wei Wei
AAML
17
32
0
09 Sep 2021
Efficient Combinatorial Optimization for Word-level Adversarial Textual Attack
Shengcai Liu
Ning Lu
Cheng Chen
Ke Tang
AAML
10
31
0
06 Sep 2021
Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
Jiehang Zeng
Xiaoqing Zheng
Jianhan Xu
Linyang Li
Liping Yuan
Xuanjing Huang
AAML
13
66
0
08 May 2021
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
98
227
0
15 Apr 2021
Token-Modification Adversarial Attacks for Natural Language Processing: A Survey
Tom Roth
Yansong Gao
A. Abuadbba
Surya Nepal
Wei Liu
AAML
23
12
0
01 Mar 2021
Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks
Huiying Li
Shawn Shan
Emily Wenger
Jiayun Zhang
Haitao Zheng
Ben Y. Zhao
AAML
18
42
0
24 Jun 2020
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
245
914
0
21 Apr 2018
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
255
13,364
0
25 Aug 2014
1