Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017

Robin Jia

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown

FlippedRAG: Black-Box Opinion Manipulation Adversarial Attacks to Retrieval-Augmented Generation Models

444

06 Jan 2025

Adversarial Robustness through Dynamic Ensemble Learning

254

20 Dec 2024

What makes a good metric? Evaluating automatic metrics for text-to-image consistency

Candace Ross

Melissa Hall

Adriana Romero Soriano

Adina Williams

405

18 Dec 2024

Adversarial Hubness in Multi-Modal Retrieval

593

18 Dec 2024

Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language ModelThe Web Conference (WWW), 2024

215

03 Dec 2024

Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script

382

03 Dec 2024

Aligning Generalisation Between Humans and Machines

...

Gabriella Skitalinskaya

Clemens Stachl

Gido M. van de Ven

T. Villmann

710

23 Nov 2024

The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims

279

21 Nov 2024

IAE: Irony-based Adversarial Examples for Sentiment Analysis SystemsIEEE Access (IEEE Access), 2024

Xiaoyin Yi

Jiacheng Huang

AAML

315

12 Nov 2024

Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images

Arka Daw

Megan Hong-Thanh Chung

Maria Mahbub

Amir Sadovnik

AAML

261

16 Oct 2024

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win RatesInternational Conference on Learning Representations (ICLR), 2024

Qian Liu

303

09 Oct 2024

TaeBench: Improving Quality of Toxic Adversarial ExamplesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

336

08 Oct 2024

ECon: On the Detection and Resolution of Evidence ConflictsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tengxiao Liu

Yangqiu Song

Yue Zhang

Pengfei Liu

Zheng Zhang

266

05 Oct 2024

Gamified crowd-sourcing of high-quality data for visual fine-tuning

306

05 Oct 2024

Towards Robust Extractive Question Answering Models: Rethinking the Training MethodologyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Son Quoc Tran

Matt Kretchmar

OOD

234

29 Sep 2024

Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure

Mahasweta Chakraborti

Bert Joseph Prestoza

Nicholas Vincent

Seth Frey

274

27 Sep 2024

Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

349

26 Sep 2024

DARE: Diverse Visual Question Answering with Robustness EvaluationTransactions of the Association for Computational Linguistics (TACL), 2024

348

26 Sep 2024

Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie SynopsesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Hung-Ting Su

Hung-yi Lee

Winston H. Hsu

LRM

131

22 Sep 2024

Contextual Breach: Assessing the Robustness of Transformer-based QA Models

Asir Saadat

Nahian Ibn Asad

AAML

353

17 Sep 2024

LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet

291

106

27 Aug 2024

Adversarial Attack for Explanation Robustness of Rationalization ModelsEuropean Conference on Artificial Intelligence (ECAI), 2024

Yuankai Zhang

Lingxiao Kong

Haozhao Wang

Ruixuan Li

Jun Wang

Yuhua Li

Wei Liu

AAML

393

20 Aug 2024

Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension

197

09 Aug 2024

Optimal and efficient text counterfactuals using Graph Neural NetworksBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024

Dimitris Lymperopoulos

Maria Lymperaiou

Giorgos Filandrianos

Giorgos Stamou

184

04 Aug 2024

Enhancing Adversarial Text Attacks on BERT Models with Projected Gradient Descent

253

29 Jul 2024

Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

Wei Lu

Xiaozhong Liu

AAML

230

18 Jul 2024

AutoBencher: Towards Declarative Benchmark Construction

Percy Liang

Tatsunori Hashimoto

204

11 Jul 2024

Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective

Yu-An Liu

Jiafeng Guo

403

09 Jul 2024

Defense Against Syntactic Textual Backdoor Attacks with Token Substitution

Xinglin Li

Xianwen He

Yao Li

Minhao Cheng

200

04 Jul 2024

The Art of Saying No: Contextual Noncompliance in Language Models

Faeze Brahman

Sachin Kumar

Vidhisha Balachandran

...

Yejin Choi

Hannaneh Hajishirzi

288

02 Jul 2024

A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese

Tin Van Huynh

Kiet Van Nguyen

Ngan Luu-Thuy Nguyen

350

25 Jun 2024

It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension

Sagi Shaier

Lawrence E Hunter

Katharina von der Wense

273

24 Jun 2024

First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning

333

23 Jun 2024

Saliency Attention and Semantic Similarity-Driven Adversarial Perturbation

263

18 Jun 2024

People will agree what I think: Investigating LLM's False Consensus Effect

Junhyuk Choi

Yeseon Hong

Bugeun Kim

343

16 Jun 2024

RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Kiseung Kim

Jay-Yoon Lee

RALM

305

09 Jun 2024

The Price of Implicit Bias in Adversarially Robust GeneralizationNeural Information Processing Systems (NeurIPS), 2024

319

07 Jun 2024

What Makes Language Models Good-enough?

Daiki Asami

Saku Sugawara

234

06 Jun 2024

MultiMax: Sparse and Multi-Modal Attention Learning

Yuxuan Zhou

Mario Fritz

Margret Keuper

610

03 Jun 2024

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Ruifeng Xu

354

31 May 2024

KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR

304

22 May 2024

DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual ExplanationsInternational Conference on Medical Imaging with Deep Learning (MIDL), 2024

304

15 May 2024

BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

203

09 May 2024

On Adversarial Examples for Text Classification by Perturbing Latent Representations

203

06 May 2024

Assessing Adversarial Robustness of Large Language Models: An Empirical Study

Roger Wattenhofer

169

04 May 2024

Harmonic LLMs are Trustworthy

235

30 Apr 2024

Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL

Edward Choi

210

29 Apr 2024

Characterizing LLM Abstention Behavior in Science QA with Context Perturbations

Bingbing Wen

Bill Howe

Lucy Lu Wang

183

18 Apr 2024

Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

Liang Pang

161

17 Apr 2024

Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?

Miriam Anschütz

Edoardo Mosca

Georg Groh

212

10 Apr 2024