ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.07328
  4. Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems

Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017
Robin Jia
Abigail Z. Jacobs
    AAMLELM
ArXiv (abs)PDFHTML

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown
Noisy Exemplars Make Large Language Models More Robust: A
  Domain-Agnostic Behavioral Analysis
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hongyi Zheng
Abulhair Saparov
AAMLLRM
257
11
0
01 Nov 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in EnglishConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
206
2
0
30 Oct 2023
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Poisoning Retrieval Corpora by Injecting Adversarial PassagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zexuan Zhong
Ziqing Huang
Alexander Wettig
Danqi Chen
AAML
297
111
0
29 Oct 2023
Improving Zero-shot Reader by Reducing Distractions from Irrelevant
  Documents in Open-Domain Question Answering
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sukmin Cho
Jeongyeon Seo
Soyeong Jeong
Jong C. Park
RALM
291
2
0
26 Oct 2023
Break it, Imitate it, Fix it: Robustness by Generating Human-Like
  Attacks
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks
Aradhana Sinha
Ananth Balashankar
Ahmad Beirami
Thi Avrahami
Jilin Chen
Alex Beutel
AAML
231
6
0
25 Oct 2023
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading
  Comprehension Shortcut Triggers
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut TriggersConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Mosh Levy
Haiqin Yang
Yoav Goldberg
277
10
0
24 Oct 2023
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social
  Intelligence Understanding
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
236
7
0
24 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing
  Noisy Input
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy InputInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Minh Nguyen
Nancy F. Chen
241
0
0
21 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets
Implications of Annotation Artifacts in Edge Probing Test DatasetsConference on Computational Natural Language Learning (CoNLL), 2023
Sagnik Ray Choudhury
Jushaan Kalra
140
1
0
20 Oct 2023
Mind the instructions: a holistic evaluation of consistency and
  interactions in prompt-based learning
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
276
35
0
20 Oct 2023
Beyond Hard Samples: Robust and Effective Grammatical Error Correction
  with Cycle Self-Augmenting
Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting
Zecheng Tang
Kaiqi Feng
Juntao Li
Min Zhang
243
2
0
20 Oct 2023
No offence, Bert -- I insult only humans! Multiple addressees
  sentence-level attack on toxicity detection neural network
No offence, Bert -- I insult only humans! Multiple addressees sentence-level attack on toxicity detection neural network
Sergey Berezin
R. Farahbakhsh
Noel Crespi
99
0
0
19 Oct 2023
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large
  Language Models via Transferable Adversarial Attacks
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
Xiaodong Yu
Hao Cheng
Xiaodong Liu
Dan Roth
Jianfeng Gao
HILMAAML
219
30
0
19 Oct 2023
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Shikhar Murty
Orr Paradise
Pratyusha Sharma
145
0
0
18 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by
  Adversarial Attacks
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
461
228
0
16 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
PerturbScore: Connecting Discrete and Continuous Perturbations in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
176
7
0
13 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context
  Perturbation
RobustGEC: Robust Grammatical Error Correction Against Subtle Context PerturbationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
256
7
0
11 Oct 2023
Low-Resource Languages Jailbreak GPT-4
Low-Resource Languages Jailbreak GPT-4
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
SILM
436
267
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Making Retrieval-Augmented Language Models Robust to Irrelevant ContextInternational Conference on Learning Representations (ICLR), 2023
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALMLRM
512
303
0
02 Oct 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
The Trickle-down Impact of Reward (In-)consistency on RLHF
Lingfeng Shen
Sihao Chen
Linfeng Song
Lifeng Jin
Baolin Peng
Haitao Mi
Daniel Khashabi
Dong Yu
251
28
0
28 Sep 2023
On the Relationship between Skill Neurons and Robustness in Prompt
  Tuning
On the Relationship between Skill Neurons and Robustness in Prompt TuningInternational Conference on Language Resources and Evaluation (LREC), 2023
Leon Ackermann
Xenia Ohmer
AAML
164
0
0
21 Sep 2023
Inferring Capabilities from Task Performance with Bayesian Triangulation
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
Konstantinos Voudouris
Ryan Burnell
Danaja Rutar
Lucy G. Cheke
José Hernández-Orallo
166
10
0
21 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
189
25
0
19 Sep 2023
Context-aware Adversarial Attack on Named Entity Recognition
Context-aware Adversarial Attack on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
AAML
237
0
0
16 Sep 2023
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain
  Performance and Calibration
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and CalibrationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Rachneet Sachdeva
Martin Tutek
Iryna Gurevych
OODD
312
16
0
14 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable
  Questions
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
289
0
0
10 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models'
  Over-Reliance on Superficial Clue
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
209
2
0
08 Sep 2023
Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Open Sesame! Universal Black Box Jailbreaking of Large Language ModelsApplied Sciences (Appl. Sci.), 2023
Raz Lapid
Ron Langberg
Moshe Sipper
AAML
343
150
0
04 Sep 2023
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation
  Approach for the Generation and Detection of Problematic Content
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
183
10
0
26 Aug 2023
Adversarial Illusions in Multi-Modal Embeddings
Adversarial Illusions in Multi-Modal EmbeddingsUSENIX Security Symposium (USENIX Security), 2023
Tingwei Zhang
Rishi Jha
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
790
27
0
22 Aug 2023
On the Adversarial Robustness of Multi-Modal Foundation Models
On the Adversarial Robustness of Multi-Modal Foundation Models
Christian Schlarmann
Matthias Hein
AAML
376
139
0
21 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models
  to Prompt Injection
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt InjectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELMSILMAAML
272
47
0
17 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness
  on Longitudinal Versions of Large Language Models
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Subrat Kishore Dutta
Michael Backes
Yun Shen
Yang Zhang
AAML
249
10
0
15 Aug 2023
Automated Testing and Improvement of Named Entity Recognition Systems
Automated Testing and Improvement of Named Entity Recognition Systems
Boxi Yu
Yi-Nuo Hu
Qiuyang Mang
Wen-Ying Hu
Pinjia He
229
11
0
14 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position
  Bias
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
273
0
0
08 Aug 2023
Universal and Transferable Adversarial Attacks on Aligned Language
  Models
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
623
2,304
0
27 Jul 2023
Explaining Math Word Problem Solvers
Explaining Math Word Problem SolversInternational Conference on Natural Language Processing and Information Retrieval (ICNLPIR), 2022
Abby Newcomb
Jugal Kalita
124
1
0
24 Jul 2023
Gradient-Based Word Substitution for Obstinate Adversarial Examples
  Generation in Language Models
Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models
Yimu Wang
Peng Shi
Hongyang Zhang
SILM
173
4
0
24 Jul 2023
NatLogAttack: A Framework for Attacking Natural Language Inference
  Models with Natural Logic
NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural LogicAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zióu Zheng
Xiao-Dan Zhu
AAMLLRM
282
6
0
06 Jul 2023
Evade ChatGPT Detectors via A Single Space
Evade ChatGPT Detectors via A Single Space
Shuyang Cai
Wanyun Cui
DeLMO
221
23
0
05 Jul 2023
SCAT: Robust Self-supervised Contrastive Learning via Adversarial
  Training for Text Classification
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
J. Wu
Dit-Yan Yeung
SILM
275
0
0
04 Jul 2023
Analyzing Multiple-Choice Reading and Listening Comprehension Tests
Analyzing Multiple-Choice Reading and Listening Comprehension Tests
Vatsal Raina
Adian Liusie
Mark Gales
ELM
217
4
0
03 Jul 2023
Evaluating Paraphrastic Robustness in Textual Entailment Models
Evaluating Paraphrastic Robustness in Textual Entailment ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Dhruv Verma
Yash Kumar Lal
Shreyashee Sinha
Benjamin Van Durme
Adam Poliak
285
7
0
29 Jun 2023
A Survey on Out-of-Distribution Evaluation of Neural NLP Models
A Survey on Out-of-Distribution Evaluation of Neural NLP ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Xinzhe Li
Ming Liu
Shang Gao
Wray Buntine
222
24
0
27 Jun 2023
Are aligned neural networks adversarially aligned?
Are aligned neural networks adversarially aligned?Neural Information Processing Systems (NeurIPS), 2023
Nicholas Carlini
Milad Nasr
Christopher A. Choquette-Choo
Matthew Jagielski
Irena Gao
...
Pang Wei Koh
Daphne Ippolito
Katherine Lee
Florian Tramèr
Ludwig Schmidt
AAML
286
313
0
26 Jun 2023
Which Spurious Correlations Impact Reasoning in NLI Models? A Visual
  Interactive Diagnosis through Data-Constrained Counterfactuals
Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained CounterfactualsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Robin Shing Moon Chan
Afra Amini
Mennatallah El-Assady
LRMAAML
232
4
0
21 Jun 2023
Did the Models Understand Documents? Benchmarking Models for Language
  Understanding in Document-Level Relation Extraction
Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Haotian Chen
Bingsheng Chen
Xiangdong Zhou
254
8
0
20 Jun 2023
Jamp: Controlled Japanese Temporal Inference Dataset for Evaluating
  Generalization Capacity of Language Models
Jamp: Controlled Japanese Temporal Inference Dataset for Evaluating Generalization Capacity of Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Tomoki Sugimoto
Yasumasa Onoe
Hitomi Yanaka
208
7
0
19 Jun 2023
Evaluating Superhuman Models with Consistency Checks
Evaluating Superhuman Models with Consistency Checks
Lukas Fluri
Daniel Paleka
Florian Tramèr
ELM
322
49
0
16 Jun 2023
PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts
PromptAttack: Probing Dialogue State Trackers with Adversarial PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Xiangjue Dong
Yun He
Ziwei Zhu
James Caverlee
AAML
181
7
0
07 Jun 2023
Previous
12345...171819
Next