ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.07328
  4. Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems

Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017
Robin Jia
Abigail Z. Jacobs
    AAMLELM
ArXiv (abs)PDFHTML

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown
FlippedRAG: Black-Box Opinion Manipulation Adversarial Attacks to Retrieval-Augmented Generation Models
FlippedRAG: Black-Box Opinion Manipulation Adversarial Attacks to Retrieval-Augmented Generation Models
Zhuo Chen
Jiawei Liu
Miaokun Chen
Haotan Liu
Qikai Cheng
Qikai Cheng
Fan Zhang
Wei Lu
Jing Liu
AAML
433
1
0
06 Jan 2025
Adversarial Robustness through Dynamic Ensemble Learning
Adversarial Robustness through Dynamic Ensemble Learning
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAML
251
1
0
20 Dec 2024
What makes a good metric? Evaluating automatic metrics for text-to-image
  consistency
What makes a good metric? Evaluating automatic metrics for text-to-image consistency
Candace Ross
Melissa Hall
Adriana Romero Soriano
Adina Williams
389
8
0
18 Dec 2024
Adversarial Hubness in Multi-Modal Retrieval
Adversarial Hubness in Multi-Modal Retrieval
Tingwei Zhang
Fnu Suya
Rishi Jha
Collin Zhang
Vitaly Shmatikov
AAML
590
5
0
18 Dec 2024
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on
  Masked Language Model
Multi-Granularity Tibetan Textual Adversarial Attack Method Based on Masked Language ModelThe Web Conference (WWW), 2024
Xi Cao
Nuo Qun
Quzong Gesang
Yulei Zhu
Trashi Nyima
AAML
213
5
0
03 Dec 2024
Pay Attention to the Robustness of Chinese Minority Language Models!
  Syllable-level Textual Adversarial Attack on Tibetan Script
Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script
Xi Cao
Dolma Dawa
Nuo Qun
Trashi Nyima
AAML
381
5
0
03 Dec 2024
Aligning Generalisation Between Humans and Machines
Aligning Generalisation Between Humans and Machines
Filip Ilievski
Barbara Hammer
F. V. Harmelen
Benjamin Paassen
S. Saralajew
...
Vered Shwartz
Gabriella Skitalinskaya
Clemens Stachl
Gido M. van de Ven
T. Villmann
707
5
0
23 Nov 2024
The Master-Slave Encoder Model for Improving Patent Text Summarization:
  A New Approach to Combining Specifications and Claims
The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims
Shu Zhou
Xin Wang
Zhengda Zhou
Haohan Yi
Xuhui Zheng
Hao Wan
272
2
0
21 Nov 2024
IAE: Irony-based Adversarial Examples for Sentiment Analysis Systems
IAE: Irony-based Adversarial Examples for Sentiment Analysis SystemsIEEE Access (IEEE Access), 2024
Xiaoyin Yi
Jiacheng Huang
AAML
312
1
0
12 Nov 2024
Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal
  from Images
Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images
Arka Daw
Megan Hong-Thanh Chung
Maria Mahbub
Amir Sadovnik
AAML
258
0
0
16 Oct 2024
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win RatesInternational Conference on Learning Representations (ICLR), 2024
Xiaosen Zheng
Tianyu Pang
Chao Du
Qian Liu
Jing Jiang
Min Lin
277
22
0
09 Oct 2024
TaeBench: Improving Quality of Toxic Adversarial Examples
TaeBench: Improving Quality of Toxic Adversarial ExamplesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Xuan Zhu
Dmitriy Bespalov
Liwen You
Ninad Kulkarni
Yanjun Qi
AAML
334
0
0
08 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
ECon: On the Detection and Resolution of Evidence ConflictsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
260
13
0
05 Oct 2024
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Shashank Yadav
Rohan Tomar
Garvit Jain
Chirag Ahooja
Shubham Chaudhary
Charles Elkan
299
1
0
05 Oct 2024
Towards Robust Extractive Question Answering Models: Rethinking the
  Training Methodology
Towards Robust Extractive Question Answering Models: Rethinking the Training MethodologyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Son Quoc Tran
Matt Kretchmar
OOD
221
1
0
29 Sep 2024
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk
  Assessment and Disclosure
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
Mahasweta Chakraborti
Bert Joseph Prestoza
Nicholas Vincent
Seth Frey
268
1
0
27 Sep 2024
Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut
  Learning in Text Classification by Language Models
Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yuqing Zhou
Ruixiang Tang
Ziyu Yao
Ziwei Zhu
346
6
0
26 Sep 2024
DARE: Diverse Visual Question Answering with Robustness Evaluation
DARE: Diverse Visual Question Answering with Robustness EvaluationTransactions of the Association for Computational Linguistics (TACL), 2024
Hannah Sterz
Jonas Pfeiffer
Ivan Vulić
OODVLM
342
4
0
26 Sep 2024
Unveiling Narrative Reasoning Limits of Large Language Models with Trope
  in Movie Synopses
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie SynopsesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hung-Ting Su
Ya-Ching Hsu
Xudong Lin
Xiang Qian Shi
Yulei Niu
Han-Yuan Hsu
Hung-yi Lee
Winston H. Hsu
LRM
128
4
0
22 Sep 2024
Contextual Breach: Assessing the Robustness of Transformer-based QA Models
Contextual Breach: Assessing the Robustness of Transformer-based QA Models
Asir Saadat
Nahian Ibn Asad
AAML
342
0
0
17 Sep 2024
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Nathaniel Li
Ziwen Han
Ian Steneker
Willow Primack
Riley Goodside
Hugh Zhang
Zifan Wang
Cristina Menghini
Summer Yue
AAMLMU
285
103
0
27 Aug 2024
Adversarial Attack for Explanation Robustness of Rationalization Models
Adversarial Attack for Explanation Robustness of Rationalization ModelsEuropean Conference on Artificial Intelligence (ECAI), 2024
Yuankai Zhang
Lingxiao Kong
Haozhao Wang
Ruixuan Li
Jun Wang
Yuhua Li
Wei Liu
AAML
387
1
0
20 Aug 2024
Investigating a Benchmark for Training-set free Evaluation of Linguistic
  Capabilities in Machine Reading Comprehension
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
194
0
0
09 Aug 2024
Optimal and efficient text counterfactuals using Graph Neural Networks
Optimal and efficient text counterfactuals using Graph Neural NetworksBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2024
Dimitris Lymperopoulos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
181
1
0
04 Aug 2024
Enhancing Adversarial Text Attacks on BERT Models with Projected
  Gradient Descent
Enhancing Adversarial Text Attacks on BERT Models with Projected Gradient Descent
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAMLSILM
241
6
0
29 Jul 2024
Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation
  of Large Language Models
Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models
Zhuo Chen
Jiawei Liu
Haotan Liu
Qikai Cheng
Qikai Cheng
Wei Lu
Xiaozhong Liu
AAML
226
16
0
18 Jul 2024
AutoBencher: Towards Declarative Benchmark Construction
AutoBencher: Towards Declarative Benchmark Construction
Xiang Lisa Li
Emmy Liu
Abigail Z. Jacobs
Tatsunori Hashimoto
Percy Liang
Tatsunori Hashimoto
188
1
0
11 Jul 2024
Robust Neural Information Retrieval: An Adversarial and
  Out-of-distribution Perspective
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective
Yu-An Liu
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
395
20
0
09 Jul 2024
Defense Against Syntactic Textual Backdoor Attacks with Token
  Substitution
Defense Against Syntactic Textual Backdoor Attacks with Token Substitution
Xinglin Li
Xianwen He
Yao Li
Minhao Cheng
196
1
0
04 Jul 2024
The Art of Saying No: Contextual Noncompliance in Language Models
The Art of Saying No: Contextual Noncompliance in Language Models
Faeze Brahman
Sachin Kumar
Vidhisha Balachandran
Pradeep Dasigi
Valentina Pyatkin
...
Jack Hessel
Yulia Tsvetkov
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
288
57
0
02 Jul 2024
A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese
A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
341
2
0
25 Jun 2024
It Is Not About What You Say, It Is About How You Say It: A Surprisingly
  Simple Approach for Improving Reading Comprehension
It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
Sagi Shaier
Lawrence E Hunter
Katharina von der Wense
269
4
0
24 Jun 2024
First Heuristic Then Rational: Dynamic Use of Heuristics in Language
  Model Reasoning
First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning
Yoichi Aoki
Keito Kudo
Tatsuki Kuribayashi
Shusaku Sone
Masaya Taniguchi
Keisuke Sakaguchi
Kentaro Inui
LRM
333
1
0
23 Jun 2024
Saliency Attention and Semantic Similarity-Driven Adversarial
  Perturbation
Saliency Attention and Semantic Similarity-Driven Adversarial Perturbation
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAML
259
8
0
18 Jun 2024
People will agree what I think: Investigating LLM's False Consensus Effect
People will agree what I think: Investigating LLM's False Consensus Effect
Junhyuk Choi
Yeseon Hong
Bugeun Kim
333
2
0
16 Jun 2024
RE-RAG: Improving Open-Domain QA Performance and Interpretability with
  Relevance Estimator in Retrieval-Augmented Generation
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Kiseung Kim
Jay-Yoon Lee
RALM
301
12
0
09 Jun 2024
The Price of Implicit Bias in Adversarially Robust Generalization
The Price of Implicit Bias in Adversarially Robust GeneralizationNeural Information Processing Systems (NeurIPS), 2024
Nikolaos Tsilivis
Natalie Frank
Nathan Srebro
Julia Kempe
319
4
0
07 Jun 2024
What Makes Language Models Good-enough?
What Makes Language Models Good-enough?
Daiki Asami
Saku Sugawara
231
2
0
06 Jun 2024
MultiMax: Sparse and Multi-Modal Attention Learning
MultiMax: Sparse and Multi-Modal Attention Learning
Yuxuan Zhou
Mario Fritz
Margret Keuper
595
1
0
03 Jun 2024
Enhancing Noise Robustness of Retrieval-Augmented Language Models with
  Adaptive Adversarial Training
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
Feiteng Fang
Yuelin Bai
Shiwen Ni
Min Yang
Xiaojun Chen
Ruifeng Xu
AAMLRALM
354
71
0
31 May 2024
KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization
  in EHR
KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR
Hajung Kim
Chanhwi Kim
Hoonick Lee
Kyochul Jang
Jiwoo Lee
Kyungjae Lee
Gangwoo Kim
Jaewoo Kang
297
2
0
22 May 2024
DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based
  Counterfactual Explanations
DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual ExplanationsInternational Conference on Medical Imaging with Deep Learning (MIDL), 2024
Nima Fathi
Amar Kumar
Brennan Nichyporuk
Mohammad Havaei
Tal Arbel
DiffMCML
298
5
0
15 May 2024
BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order
  Optimization
BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization
Satyadwyoom Kumar
Saurabh Gupta
Arun Balaji Buduru
AAML
199
0
0
09 May 2024
On Adversarial Examples for Text Classification by Perturbing Latent
  Representations
On Adversarial Examples for Text Classification by Perturbing Latent Representations
Korn Sooksatra
Bikram Khanal
Pablo Rivas
SILMAAML
187
3
0
06 May 2024
Assessing Adversarial Robustness of Large Language Models: An Empirical
  Study
Assessing Adversarial Robustness of Large Language Models: An Empirical Study
Zeyu Yang
Zhao Meng
Xiaochen Zheng
Roger Wattenhofer
ELMAAML
163
21
0
04 May 2024
Harmonic LLMs are Trustworthy
Harmonic LLMs are Trustworthy
Nicholas S. Kersting
Mohammad Rahman
Suchismitha Vedala
Yang Wang
235
1
0
30 Apr 2024
Towards Unbiased Evaluation of Detecting Unanswerable Questions in
  EHRSQL
Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Yongjin Yang
Sihyeon Kim
Sangmook Kim
Gyubok Lee
Se-Young Yun
Edward Choi
206
3
0
29 Apr 2024
Characterizing LLM Abstention Behavior in Science QA with Context
  Perturbations
Characterizing LLM Abstention Behavior in Science QA with Context Perturbations
Bingbing Wen
Bill Howe
Lucy Lu Wang
176
19
0
18 Apr 2024
Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
Minghe Gao
Shuang Chen
Liang Pang
Xingtai Lv
Jisheng Dang
Wenqiao Zhang
Juncheng Li
Siliang Tang
Yueting Zhuang
Tat-Seng Chua
LRM
159
10
0
17 Apr 2024
Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on
  Simplified Corpora?
Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?
Miriam Anschütz
Edoardo Mosca
Georg Groh
197
2
0
10 Apr 2024
Previous
12345...171819
Next