ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.07328
  4. Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems

Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017
Robin Jia
Percy Liang
    AAML
    ELM
ArXivPDFHTML

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 890 papers shown
Title
Machine Translation Testing via Syntactic Tree Pruning
Machine Translation Testing via Syntactic Tree Pruning
Quanjun Zhang
Juan Zhai
Chunrong Fang
Jiawei Liu
Dongrui Liu
Haichuan Hu
Qingyu Wang
23
3
0
01 Jan 2024
From Text to Multimodal: A Comprehensive Survey of Adversarial Example
  Generation in Question Answering Systems
From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems
Gulsum Yigit
M. Amasyalı
AAML
23
0
0
26 Dec 2023
Navigating the Structured What-If Spaces: Counterfactual Generation via
  Structured Diffusion
Navigating the Structured What-If Spaces: Counterfactual Generation via Structured Diffusion
Nishtha Madaan
Srikanta J. Bedathur
DiffM
38
0
0
21 Dec 2023
Perturbation-Invariant Adversarial Training for Neural Ranking Models:
  Improving the Effectiveness-Robustness Trade-Off
Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off
Yuansan Liu
Ruqing Zhang
Mingkun Zhang
Wei Chen
Maarten de Rijke
J. Guo
Xueqi Cheng
AAML
22
6
0
16 Dec 2023
The Earth is Flat because...: Investigating LLMs' Belief towards
  Misinformation via Persuasive Conversation
The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation
Rongwu Xu
Brian S. Lin
Shujian Yang
Tianqi Zhang
Weiyan Shi
Lei Bai
Zhixuan Fang
Wei Xu
Han Qiu
44
51
0
14 Dec 2023
Towards Equipping Transformer with the Ability of Systematic
  Compositionality
Towards Equipping Transformer with the Ability of Systematic Compositionality
Chen Huang
Peixin Qin
Wenqiang Lei
Jiancheng Lv
27
1
0
12 Dec 2023
METAL: Metamorphic Testing Framework for Analyzing Large-Language Model
  Qualities
METAL: Metamorphic Testing Framework for Analyzing Large-Language Model Qualities
Sangwon Hyun
Mingyu Guo
Muhammad Ali Babar
31
8
0
11 Dec 2023
On the Robustness of Large Multimodal Models Against Image Adversarial
  Attacks
On the Robustness of Large Multimodal Models Against Image Adversarial Attacks
Xuanimng Cui
Alejandro Aparcedo
Young Kyun Jang
Ser-Nam Lim
AAML
VLM
19
38
0
06 Dec 2023
System 2 Attention (is something you might need too)
System 2 Attention (is something you might need too)
Jason Weston
Sainbayar Sukhbaatar
RALM
OffRL
LRM
22
57
0
20 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
31
3
0
16 Nov 2023
Explore Spurious Correlations at the Concept Level in Language Models
  for Text Classification
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification
Yuhang Zhou
Paiheng Xu
Xiaoyu Liu
Bang An
Wei Ai
Furong Huang
LRM
71
20
0
15 Nov 2023
DALA: A Distribution-Aware LoRA-Based Adversarial Attack against
  Language Models
DALA: A Distribution-Aware LoRA-Based Adversarial Attack against Language Models
Yibo Wang
Xiangjue Dong
James Caverlee
Philip S. Yu
23
2
0
14 Nov 2023
Measuring Adversarial Datasets
Measuring Adversarial Datasets
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
39
1
0
06 Nov 2023
Perturbation-based Active Learning for Question Answering
Perturbation-based Active Learning for Question Answering
Fan Luo
Mihai Surdeanu
16
0
0
04 Nov 2023
Noisy Exemplars Make Large Language Models More Robust: A
  Domain-Agnostic Behavioral Analysis
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
Hongyi Zheng
Abulhair Saparov
AAML
LRM
11
7
0
01 Nov 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
9
1
0
30 Oct 2023
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Poisoning Retrieval Corpora by Injecting Adversarial Passages
Zexuan Zhong
Ziqing Huang
Alexander Wettig
Danqi Chen
AAML
13
59
0
29 Oct 2023
Improving Zero-shot Reader by Reducing Distractions from Irrelevant
  Documents in Open-Domain Question Answering
Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering
Sukmin Cho
Jeongyeon Seo
Soyeong Jeong
Jong C. Park
RALM
21
2
0
26 Oct 2023
Break it, Imitate it, Fix it: Robustness by Generating Human-Like
  Attacks
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks
Aradhana Sinha
Ananth Balashankar
Ahmad Beirami
Thi Avrahami
Jilin Chen
Alex Beutel
AAML
27
4
0
25 Oct 2023
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading
  Comprehension Shortcut Triggers
Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers
Mosh Levy
Shauli Ravfogel
Yoav Goldberg
38
5
0
24 Oct 2023
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social
  Intelligence Understanding
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
25
5
0
24 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing
  Noisy Input
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
22
0
0
21 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets
Implications of Annotation Artifacts in Edge Probing Test Datasets
Sagnik Ray Choudhury
Jushaan Kalra
16
0
0
20 Oct 2023
Mind the instructions: a holistic evaluation of consistency and
  interactions in prompt-based learning
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
30
24
0
20 Oct 2023
Beyond Hard Samples: Robust and Effective Grammatical Error Correction
  with Cycle Self-Augmenting
Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting
Zecheng Tang
Kaiqi Feng
Juntao Li
Min Zhang
26
2
0
20 Oct 2023
No offence, Bert -- I insult only humans! Multiple addressees
  sentence-level attack on toxicity detection neural network
No offence, Bert -- I insult only humans! Multiple addressees sentence-level attack on toxicity detection neural network
Sergey Berezin
R. Farahbakhsh
Noel Crespi
11
0
0
19 Oct 2023
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large
  Language Models via Transferable Adversarial Attacks
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks
Xiaodong Yu
Hao Cheng
Xiaodong Liu
Dan Roth
Jianfeng Gao
HILM
AAML
17
15
0
19 Oct 2023
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Pseudointelligence: A Unifying Framework for Language Model Evaluation
Shikhar Murty
Orr Paradise
Pratyusha Sharma
13
0
0
18 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by
  Adversarial Attacks
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
147
146
0
16 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
15
4
0
13 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context
  Perturbation
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
38
6
0
11 Oct 2023
Low-Resource Languages Jailbreak GPT-4
Low-Resource Languages Jailbreak GPT-4
Zheng-Xin Yong
Cristina Menghini
Stephen H. Bach
SILM
28
170
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
19
180
0
02 Oct 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
The Trickle-down Impact of Reward (In-)consistency on RLHF
Lingfeng Shen
Sihao Chen
Linfeng Song
Lifeng Jin
Baolin Peng
Haitao Mi
Daniel Khashabi
Dong Yu
32
21
0
28 Sep 2023
On the Relationship between Skill Neurons and Robustness in Prompt
  Tuning
On the Relationship between Skill Neurons and Robustness in Prompt Tuning
Leon Ackermann
Xenia Ohmer
AAML
21
0
0
21 Sep 2023
Inferring Capabilities from Task Performance with Bayesian Triangulation
Inferring Capabilities from Task Performance with Bayesian Triangulation
John Burden
Konstantinos Voudouris
Ryan Burnell
Danaja Rutar
Lucy G. Cheke
José Hernández-Orallo
25
7
0
21 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
24
13
0
19 Sep 2023
Context-aware Adversarial Attack on Named Entity Recognition
Context-aware Adversarial Attack on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
AAML
48
0
0
16 Sep 2023
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain
  Performance and Calibration
CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration
Rachneet Sachdeva
Martin Tutek
Iryna Gurevych
OODD
27
10
0
14 Sep 2023
AGent: A Novel Pipeline for Automatically Creating Unanswerable
  Questions
AGent: A Novel Pipeline for Automatically Creating Unanswerable Questions
Son Quoc Tran
Gia-Huy Do
Phong Nguyen-Thuan Do
Matt Kretchmar
Xinya Du
21
0
0
10 Sep 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models'
  Over-Reliance on Superficial Clue
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue
Yanrui Du
Sendong Zhao
Yuhan Chen
Rai Bai
Jing Liu
Huaqin Wu
Haifeng Wang
Bing Qin
42
2
0
08 Sep 2023
Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Raz Lapid
Ron Langberg
Moshe Sipper
AAML
16
103
0
04 Sep 2023
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation
  Approach for the Generation and Detection of Problematic Content
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
25
3
0
26 Aug 2023
Adversarial Illusions in Multi-Modal Embeddings
Adversarial Illusions in Multi-Modal Embeddings
Tingwei Zhang
Rishi Jha
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
34
8
0
22 Aug 2023
On the Adversarial Robustness of Multi-Modal Foundation Models
On the Adversarial Robustness of Multi-Modal Foundation Models
Christian Schlarmann
Matthias Hein
AAML
116
85
0
21 Aug 2023
Evaluating the Instruction-Following Robustness of Large Language Models
  to Prompt Injection
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
Zekun Li
Baolin Peng
Pengcheng He
Xifeng Yan
ELM
SILM
AAML
41
23
0
17 Aug 2023
Robustness Over Time: Understanding Adversarial Examples' Effectiveness
  on Longitudinal Versions of Large Language Models
Robustness Over Time: Understanding Adversarial Examples' Effectiveness on Longitudinal Versions of Large Language Models
Yugeng Liu
Tianshuo Cong
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
41
6
0
15 Aug 2023
Automated Testing and Improvement of Named Entity Recognition Systems
Automated Testing and Improvement of Named Entity Recognition Systems
Boxi Yu
Yi-Nuo Hu
Qiuyang Mang
Wen-Ying Hu
Pinjia He
23
6
0
14 Aug 2023
Single-Sentence Reader: A Novel Approach for Addressing Answer Position
  Bias
Single-Sentence Reader: A Novel Approach for Addressing Answer Position Bias
Son Quoc Tran
Matt Kretchmar
19
0
0
08 Aug 2023
Universal and Transferable Adversarial Attacks on Aligned Language
  Models
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
89
1,251
0
27 Jul 2023
Previous
123456...161718
Next