Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.07328
Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems
23 July 2017
Robin Jia
Percy Liang
AAML
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adversarial Examples for Evaluating Reading Comprehension Systems"
50 / 890 papers shown
Title
Machine Translation Testing via Syntactic Tree Pruning
Quanjun Zhang
Juan Zhai
Chunrong Fang
Jiawei Liu
Dongrui Liu
Haichuan Hu
Qingyu Wang
23
3
0
01 Jan 2024
From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems
Gulsum Yigit
M. Amasyalı
AAML
25
0
0
26 Dec 2023
Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion
Nishtha Madaan
Srikanta J. Bedathur
DiffM
38
0
0
21 Dec 2023
Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off
Yuansan Liu
Ruqing Zhang
Mingkun Zhang
Wei Chen
Maarten de Rijke
J. Guo
Xueqi Cheng
AAML
22
6
0
16 Dec 2023
The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation
Rongwu Xu
Brian S. Lin
Shujian Yang
Tianqi Zhang
Weiyan Shi
Lei Bai
Zhixuan Fang
Wei Xu
Han Qiu
44
51
0
14 Dec 2023
Towards Equipping Transformer with the Ability of Systematic Compositionality
Chen Huang
Peixin Qin
Wenqiang Lei
Jiancheng Lv
27
1
0
12 Dec 2023
METAL: Metamorphic Testing Framework for Analyzing Large-Language Model Qualities
Sangwon Hyun
Mingyu Guo
Muhammad Ali Babar
31
8
0
11 Dec 2023
On the Robustness of Large Multimodal Models Against Image Adversarial Attacks
Xuanimng Cui
Alejandro Aparcedo
Young Kyun Jang
Ser-Nam Lim
AAML
VLM
19
38
0
06 Dec 2023
System 2 Attention (is something you might need too)
Jason Weston
Sainbayar Sukhbaatar
RALM
OffRL
LRM
22
57
0
20 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
31
3
0
16 Nov 2023
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
Yuhang Zhou
Paiheng Xu
Xiaoyu Liu
Bang An
Wei Ai
Furong Huang
LRM
71
20
0
15 Nov 2023
DALA: A Distribution-Aware LoRA-Based Adversarial Attack against Language Models
Yibo Wang
Xiangjue Dong
James Caverlee
Philip S. Yu
23
2
0
14 Nov 2023
Measuring Adversarial Datasets
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
39
1
0
06 Nov 2023
Perturbation-based Active Learning for Question Answering
Fan Luo
Mihai Surdeanu
16
0
0
04 Nov 2023
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
Hongyi Zheng
Abulhair Saparov
AAML
LRM
11
7
0
01 Nov 2023
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
9
1
0
30 Oct 2023
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Zexuan Zhong
Ziqing Huang
Alexander Wettig
Danqi Chen
AAML
13
59
0
29 Oct 2023
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering
Sukmin Cho
Jeongyeon Seo
Soyeong Jeong
Jong C. Park
RALM
21
2
0
26 Oct 2023
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks
Aradhana Sinha
Ananth Balashankar
Ahmad Beirami
Thi Avrahami
Jilin Chen
Alex Beutel
AAML
27
4
0
25 Oct 2023
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Mosh Levy
Shauli Ravfogel
Yoav Goldberg
38
5
0
24 Oct 2023
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
25
5
0
24 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
22
0
0
21 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets
Sagnik Ray Choudhury
Jushaan Kalra
16
0
0
20 Oct 2023
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
30
24
0
20 Oct 2023
Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting
Zecheng Tang
Kaiqi Feng
Juntao Li
Min Zhang
28
2
0
20 Oct 2023
No offence, Bert -- I insult only humans! Multiple addressees sentence-level attack on toxicity detection neural network
Sergey Berezin
R. Farahbakhsh
Noel Crespi
11
0
0
19 Oct 2023
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
Xiaodong Yu
Hao Cheng
Xiaodong Liu
Dan Roth
Jianfeng Gao
HILM
AAML
17
15
0
19 Oct 2023
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Shikhar Murty
Orr Paradise
Pratyusha Sharma
13
0
0
18 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
147
146
0
16 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
15
4
0
13 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
38
6
0
11 Oct 2023
Low-Resource Languages Jailbreak GPT-4
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
SILM
31
170
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
19
180
0
02 Oct 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
Lingfeng Shen
Sihao Chen
Linfeng Song
Lifeng Jin
Baolin Peng
Haitao Mi
Daniel Khashabi
Dong Yu
32
21
0
28 Sep 2023
On the Relationship between Skill Neurons and Robustness in Prompt Tuning
Leon Ackermann
Xenia Ohmer
AAML
21
0
0
21 Sep 2023
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
Konstantinos Voudouris
Ryan Burnell
Danaja Rutar
Lucy G. Cheke
José Hernández-Orallo
25
7
0
21 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
24
13
0
19 Sep 2023
Context-aware Adversarial Attack on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
AAML
48
0
0
16 Sep 2023
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration
Rachneet Sachdeva
Martin Tutek
Iryna Gurevych
OODD
27
10
0
14 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
21
0
0
10 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
42
2
0
08 Sep 2023
Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Raz Lapid
Ron Langberg
Moshe Sipper
AAML
16
103
0
04 Sep 2023
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
25
3
0
26 Aug 2023
Adversarial Illusions in Multi-Modal Embeddings
Tingwei Zhang
Rishi Jha
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
34
8
0
22 Aug 2023
On the Adversarial Robustness of Multi-Modal Foundation Models
Christian Schlarmann
Matthias Hein
AAML
116
85
0
21 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELM
SILM
AAML
41
23
0
17 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
41
6
0
15 Aug 2023
Automated Testing and Improvement of Named Entity Recognition Systems
Boxi Yu
Yi-Nuo Hu
Qiuyang Mang
Wen-Ying Hu
Pinjia He
23
6
0
14 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
19
0
0
08 Aug 2023
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
89
1,251
0
27 Jul 2023
Previous
1
2
3
4
5
6
...
16
17
18
Next