Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017

Robin Jia

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown

Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Hongyi Zheng

Abulhair Saparov

AAML LRM

264

01 Nov 2023

A Lightweight Method to Generate Unanswerable Questions in EnglishConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Vagrant Gautam

Miaoran Zhang

Dietrich Klakow

206

30 Oct 2023

Poisoning Retrieval Corpora by Injecting Adversarial PassagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Alexander Wettig

301

112

29 Oct 2023

Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

296

26 Oct 2023

Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks

235

25 Oct 2023

Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut TriggersConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Mosh Levy

Haiqin Yang

Yoav Goldberg

277

24 Oct 2023

DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xiao-Yu Guo

Yuan-Fang Li

Gholamreza Haffari

236

24 Oct 2023

Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy InputInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Minh Nguyen

Nancy F. Chen

242

21 Oct 2023

Implications of Annotation Artifacts in Edge Probing Test DatasetsConference on Computational Natural Language Learning (CoNLL), 2023

Sagnik Ray Choudhury

Jushaan Kalra

146

20 Oct 2023

Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning

Lucas Weber

Elia Bruni

Dieuwke Hupkes

278

20 Oct 2023

Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting

Zecheng Tang

Kaiqi Feng

Juntao Li

Min Zhang

245

20 Oct 2023

No offence, Bert -- I insult only humans! Multiple addressees sentence-level attack on toxicity detection neural network

Sergey Berezin

R. Farahbakhsh

Noel Crespi

19 Oct 2023

ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks

Xiaodong Liu

219

19 Oct 2023

Pseudointelligence: A Unifying Framework for Language Model Evaluation

Shikhar Murty

Orr Paradise

Pratyusha Sharma

145

18 Oct 2023

Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks

461

228

16 Oct 2023

PerturbScore: Connecting Discrete and Continuous Perturbations in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xipeng Qiu

176

13 Oct 2023

RobustGEC: Robust Grammatical Error Correction Against Subtle Context PerturbationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yue Zhang

Leyang Cui

Enbo Zhao

Wei Bi

Shuming Shi

264

11 Oct 2023

Low-Resource Languages Jailbreak GPT-4

443

273

03 Oct 2023

Making Retrieval-Augmented Language Models Robust to Irrelevant ContextInternational Conference on Learning Representations (ICLR), 2023

520

305

02 Oct 2023

The Trickle-down Impact of Reward (In-)consistency on RLHF

Lingfeng Shen

Linfeng Song

Daniel Khashabi

Dong Yu

251

28 Sep 2023

On the Relationship between Skill Neurons and Robustness in Prompt TuningInternational Conference on Language Resources and Evaluation (LREC), 2023

Leon Ackermann

Xenia Ohmer

AAML

167

21 Sep 2023

Inferring Capabilities from Task Performance with Bayesian Triangulation

John Burden

Konstantinos Voudouris

Ryan Burnell

Danaja Rutar

Lucy G. Cheke

José Hernández-Orallo

167

21 Sep 2023

Model Leeching: An Extraction Attack Targeting LLMs

Lewis Birch

William Hackett

Stefan Trawicki

N. Suri

Peter Garraghan

196

19 Sep 2023

Context-aware Adversarial Attack on Named Entity Recognition

239

16 Sep 2023

CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and CalibrationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Rachneet Sachdeva

Martin Tutek

Iryna Gurevych

OODD

312

14 Sep 2023

AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions

Son Quoc Tran

Gia-Huy Do

Phong Nguyen-Thuan Do

Matt Kretchmar

Xinya Du

292

10 Sep 2023

GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue

Jing Liu

218

08 Sep 2023

Open Sesame! Universal Black Box Jailbreaking of Large Language ModelsApplied Sciences (Appl. Sci.), 2023

344

151

04 Sep 2023

Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content

Jack Miller

183

26 Aug 2023

Adversarial Illusions in Multi-Modal EmbeddingsUSENIX Security Symposium (USENIX Security), 2023

805

22 Aug 2023

On the Adversarial Robustness of Multi-Modal Foundation Models

Christian Schlarmann

Matthias Hein

AAML

376

139

21 Aug 2023

Evaluating the Instruction-Following Robustness of Large Language Models to Prompt InjectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

275

17 Aug 2023

Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models

Michael Backes

250

15 Aug 2023

Automated Testing and Improvement of Named Entity Recognition Systems

231

14 Aug 2023

Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias

Son Quoc Tran

Matt Kretchmar

277

08 Aug 2023

Universal and Transferable Adversarial Attacks on Aligned Language Models

J. Zico Kolter

628

2,331

27 Jul 2023

Explaining Math Word Problem SolversInternational Conference on Natural Language Processing and Information Retrieval (ICNLPIR), 2022

Abby Newcomb

Jugal Kalita

125

24 Jul 2023

Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models

174

24 Jul 2023

NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural LogicAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Zióu Zheng

Xiao-Dan Zhu

AAML LRM

282

06 Jul 2023

Evade ChatGPT Detectors via A Single Space

Shuyang Cai

Wanyun Cui

DeLMO

222

05 Jul 2023

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

J. Wu

Dit-Yan Yeung

SILM

275

04 Jul 2023

Analyzing Multiple-Choice Reading and Listening Comprehension Tests

217

03 Jul 2023

Evaluating Paraphrastic Robustness in Textual Entailment ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

288

29 Jun 2023

A Survey on Out-of-Distribution Evaluation of Neural NLP ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

225

27 Jun 2023

Are aligned neural networks adversarially aligned?Neural Information Processing Systems (NeurIPS), 2023

Nicholas Carlini

Milad Nasr

Christopher A. Choquette-Choo

Matthew Jagielski

Irena Gao

...

Pang Wei Koh

287

312

26 Jun 2023

Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained CounterfactualsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Robin Shing Moon Chan

Afra Amini

Mennatallah El-Assady

LRM AAML

232

21 Jun 2023

Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Haotian Chen

Bingsheng Chen

Xiangdong Zhou

254

20 Jun 2023

Jamp: Controlled Japanese Temporal Inference Dataset for Evaluating Generalization Capacity of Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Tomoki Sugimoto

Yasumasa Onoe

Hitomi Yanaka

208

19 Jun 2023

Evaluating Superhuman Models with Consistency Checks

322

16 Jun 2023

PromptAttack: Probing Dialogue State Trackers with Adversarial PromptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

187

07 Jun 2023