Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.07328
Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems
23 July 2017
Robin Jia
Abigail Z. Jacobs
AAML
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adversarial Examples for Evaluating Reading Comprehension Systems"
50 / 926 papers shown
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongyi Zheng
Abulhair Saparov
AAML
LRM
257
11
0
01 Nov 2023
A Lightweight Method to Generate Unanswerable Questions in English
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
206
2
0
30 Oct 2023
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zexuan Zhong
Ziqing Huang
Alexander Wettig
Danqi Chen
AAML
297
111
0
29 Oct 2023
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sukmin Cho
Jeongyeon Seo
Soyeong Jeong
Jong C. Park
RALM
291
2
0
26 Oct 2023
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks
Aradhana Sinha
Ananth Balashankar
Ahmad Beirami
Thi Avrahami
Jilin Chen
Alex Beutel
AAML
231
6
0
25 Oct 2023
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mosh Levy
Haiqin Yang
Yoav Goldberg
277
10
0
24 Oct 2023
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
236
7
0
24 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Minh Nguyen
Nancy F. Chen
241
0
0
21 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets
Conference on Computational Natural Language Learning (CoNLL), 2023
Sagnik Ray Choudhury
Jushaan Kalra
140
1
0
20 Oct 2023
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
276
35
0
20 Oct 2023
Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting
Zecheng Tang
Kaiqi Feng
Juntao Li
Min Zhang
243
2
0
20 Oct 2023
No offence, Bert -- I insult only humans! Multiple addressees sentence-level attack on toxicity detection neural network
Sergey Berezin
R. Farahbakhsh
Noel Crespi
99
0
0
19 Oct 2023
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
Xiaodong Yu
Hao Cheng
Xiaodong Liu
Dan Roth
Jianfeng Gao
HILM
AAML
219
30
0
19 Oct 2023
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Shikhar Murty
Orr Paradise
Pratyusha Sharma
145
0
0
18 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
461
228
0
16 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
176
7
0
13 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
256
7
0
11 Oct 2023
Low-Resource Languages Jailbreak GPT-4
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
SILM
436
267
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
International Conference on Learning Representations (ICLR), 2023
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
512
303
0
02 Oct 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
Lingfeng Shen
Sihao Chen
Linfeng Song
Lifeng Jin
Baolin Peng
Haitao Mi
Daniel Khashabi
Dong Yu
251
28
0
28 Sep 2023
On the Relationship between Skill Neurons and Robustness in Prompt Tuning
International Conference on Language Resources and Evaluation (LREC), 2023
Leon Ackermann
Xenia Ohmer
AAML
164
0
0
21 Sep 2023
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
Konstantinos Voudouris
Ryan Burnell
Danaja Rutar
Lucy G. Cheke
José Hernández-Orallo
166
10
0
21 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
189
25
0
19 Sep 2023
Context-aware Adversarial Attack on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
AAML
237
0
0
16 Sep 2023
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Rachneet Sachdeva
Martin Tutek
Iryna Gurevych
OODD
312
16
0
14 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
289
0
0
10 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
209
2
0
08 Sep 2023
Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Applied Sciences (Appl. Sci.), 2023
Raz Lapid
Ron Langberg
Moshe Sipper
AAML
343
150
0
04 Sep 2023
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
183
10
0
26 Aug 2023
Adversarial Illusions in Multi-Modal Embeddings
USENIX Security Symposium (USENIX Security), 2023
Tingwei Zhang
Rishi Jha
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
790
27
0
22 Aug 2023
On the Adversarial Robustness of Multi-Modal Foundation Models
Christian Schlarmann
Matthias Hein
AAML
376
139
0
21 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELM
SILM
AAML
272
47
0
17 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Subrat Kishore Dutta
Michael Backes
Yun Shen
Yang Zhang
AAML
249
10
0
15 Aug 2023
Automated Testing and Improvement of Named Entity Recognition Systems
Boxi Yu
Yi-Nuo Hu
Qiuyang Mang
Wen-Ying Hu
Pinjia He
229
11
0
14 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
273
0
0
08 Aug 2023
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
623
2,304
0
27 Jul 2023
Explaining Math Word Problem Solvers
International Conference on Natural Language Processing and Information Retrieval (ICNLPIR), 2022
Abby Newcomb
Jugal Kalita
124
1
0
24 Jul 2023
Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models
Yimu Wang
Peng Shi
Hongyang Zhang
SILM
173
4
0
24 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Zióu Zheng
Xiao-Dan Zhu
AAML
LRM
282
6
0
06 Jul 2023
Evade ChatGPT Detectors via A Single Space
Shuyang Cai
Wanyun Cui
DeLMO
221
23
0
05 Jul 2023
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
J. Wu
Dit-Yan Yeung
SILM
275
0
0
04 Jul 2023
Analyzing Multiple-Choice Reading and Listening Comprehension Tests
Vatsal Raina
Adian Liusie
Mark Gales
ELM
217
4
0
03 Jul 2023
Evaluating Paraphrastic Robustness in Textual Entailment Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Dhruv Verma
Yash Kumar Lal
Shreyashee Sinha
Benjamin Van Durme
Adam Poliak
285
7
0
29 Jun 2023
A Survey on Out-of-Distribution Evaluation of Neural NLP Models
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Xinzhe Li
Ming Liu
Shang Gao
Wray Buntine
222
24
0
27 Jun 2023
Are aligned neural networks adversarially aligned?
Neural Information Processing Systems (NeurIPS), 2023
Nicholas Carlini
Milad Nasr
Christopher A. Choquette-Choo
Matthew Jagielski
Irena Gao
...
Pang Wei Koh
Daphne Ippolito
Katherine Lee
Florian Tramèr
Ludwig Schmidt
AAML
286
313
0
26 Jun 2023
Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained Counterfactuals
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Robin Shing Moon Chan
Afra Amini
Mennatallah El-Assady
LRM
AAML
232
4
0
21 Jun 2023
Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation Extraction
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Haotian Chen
Bingsheng Chen
Xiangdong Zhou
254
8
0
20 Jun 2023
Jamp: Controlled Japanese Temporal Inference Dataset for Evaluating Generalization Capacity of Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tomoki Sugimoto
Yasumasa Onoe
Hitomi Yanaka
208
7
0
19 Jun 2023
Evaluating Superhuman Models with Consistency Checks
Lukas Fluri
Daniel Paleka
Florian Tramèr
ELM
322
49
0
16 Jun 2023
PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Xiangjue Dong
Yun He
Ziwei Zhu
James Caverlee
AAML
181
7
0
07 Jun 2023
Previous
1
2
3
4
5
...
17
18
19
Next