Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.07328
Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems
23 July 2017
Robin Jia
Percy Liang
AAML
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adversarial Examples for Evaluating Reading Comprehension Systems"
50 / 890 papers shown
Title
Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study
Hai Ye
Yuyang Ding
Juntao Li
Hwee Tou Ng
OOD
TTA
21
9
0
09 Feb 2023
Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples
Chumeng Liang
Xiaoyu Wu
Yang Hua
Jiaru Zhang
Yiming Xue
Tao Song
Zhengui Xue
Ruhui Ma
Haibing Guan
DiffM
WIGM
16
116
0
09 Feb 2023
Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend
Ning Lu
Shengcai Liu
Zhirui Zhang
Qi. Wang
Haifeng Liu
Jiaheng Zhang
AAML
82
5
0
06 Feb 2023
TextShield: Beyond Successfully Detecting Adversarial Sentences in Text Classification
Lingfeng Shen
Ze Zhang
Haiyun Jiang
Ying Chen
AAML
41
5
0
03 Feb 2023
The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models
Son Quoc Tran
Phong Nguyen-Thuan Do
Uyen Le
Matt Kretchmar
ELM
AAML
30
7
0
31 Jan 2023
Large Language Models Can Be Easily Distracted by Irrelevant Context
Freda Shi
Xinyun Chen
Kanishka Misra
Nathan Scales
David Dohan
Ed H. Chi
Nathanael Scharli
Denny Zhou
ReLM
RALM
LRM
30
530
0
31 Jan 2023
Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation
Xiang Lin
Prathyusha Jwalapuram
Shafiq R. Joty
DiffM
31
0
0
31 Jan 2023
Node Injection for Class-specific Network Poisoning
Ansh Sharma
Rahul Kukreja
Mayank Kharbanda
Tanmoy Chakraborty
AAML
GNN
18
12
0
28 Jan 2023
Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Shuaichen Chang
Jun Wang
Mingwen Dong
Lin Pan
Henghui Zhu
...
William Yang Wang
Zhiguo Wang
Vittorio Castelli
Patrick K. L. Ng
Bing Xiang
OOD
41
34
0
21 Jan 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records
Gyubok Lee
Hyeonji Hwang
Seongsu Bae
Yeonsu Kwon
W. Shin
Seongjun Yang
Minjoon Seo
Jong-Yeup Kim
E. Choi
21
18
0
16 Jan 2023
Can Large Language Models Change User Preference Adversarially?
Varshini Subhash
AAML
37
8
0
05 Jan 2023
EDoG: Adversarial Edge Detection For Graph Neural Networks
Xiaojun Xu
Yue Yu
Hanzhang Wang
Alok Lal
C.A. Gunter
Bo Li
AAML
32
10
0
27 Dec 2022
Analyzing Semantic Faithfulness of Language Models via Input Intervention on Question Answering
Akshay Chaturvedi
Swarnadeep Bhar
Soumadeep Saha
Utpal Garain
Nicholas Asher
33
4
0
21 Dec 2022
Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
Xinyu Pi
Bin Wang
Yan Gao
Jiaqi Guo
Zhoujun Li
Jian-Guang Lou
LMTD
30
30
0
20 Dec 2022
Towards Efficient and Domain-Agnostic Evasion Attack with High-dimensional Categorical Inputs
Hongyan Bao
Yufei Han
Yujun Zhou
Xin Gao
Xiangliang Zhang
AAML
32
3
0
13 Dec 2022
Feature-Level Debiased Natural Language Understanding
Yougang Lyu
Piji Li
Yechang Yang
Maarten de Rijke
Pengjie Ren
Yukun Zhao
Dawei Yin
Z. Ren
32
10
0
11 Dec 2022
Mitigating Adversarial Gray-Box Attacks Against Phishing Detectors
Giovanni Apruzzese
V. S. Subrahmanian
AAML
31
20
0
11 Dec 2022
A Comprehensive Survey on Multi-hop Machine Reading Comprehension Approaches
A. Mohammadi
Reza Ramezani
Ahmad Baraani
27
3
0
08 Dec 2022
A Comprehensive Survey on Multi-hop Machine Reading Comprehension Datasets and Metrics
A. Mohammadi
Reza Ramezani
Ahmad Baraani
35
1
0
08 Dec 2022
Robust Speech Recognition via Large-Scale Weak Supervision
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
49
3,290
0
06 Dec 2022
Which Shortcut Solution Do Question Answering Models Prefer to Learn?
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
19
6
0
29 Nov 2022
Penalizing Confident Predictions on Largely Perturbed Inputs Does Not Improve Out-of-Distribution Generalization in Question Answering
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
OOD
AAML
30
0
0
29 Nov 2022
Neural Network Verification as Piecewise Linear Optimization: Formulations for the Composition of Staircase Functions
Tu Anh-Nguyen
Joey Huchette
22
2
0
27 Nov 2022
World Knowledge in Multiple Choice Reading Comprehension
Adian Liusie
Vatsal Raina
Mark J. F. Gales
24
7
0
13 Nov 2022
NaturalAdversaries: Can Naturalistic Adversaries Be as Effective as Artificial Adversaries?
Saadia Gabriel
Hamid Palangi
Yejin Choi
AAML
37
1
0
08 Nov 2022
Are AlphaZero-like Agents Robust to Adversarial Perturbations?
Li-Cheng Lan
Huan Zhang
Ti-Rong Wu
Meng-Yu Tsai
I-Chen Wu
Cho-Jui Hsieh
AAML
19
10
0
07 Nov 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Ziqiang Cao
Sujian Li
Hua-Hong Wu
HILM
24
10
0
01 Nov 2022
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models
Dong-Ho Lee
Akshen Kadakia
Brihi Joshi
Aaron Chan
Ziyi Liu
...
Takashi Shibuya
Ryosuke Mitani
Toshiyuki Sekiya
Jay Pujara
Xiang Ren
LRM
40
9
0
30 Oct 2022
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
AAML
44
16
0
28 Oct 2022
ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics
Chantal Amrhein
Nikita Moghe
Liane Guillou
ELM
31
22
0
27 Oct 2022
TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack
Yu Cao
Dianqi Li
Meng Fang
Dinesh Manocha
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
26
15
0
27 Oct 2022
Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness
Jiahao Zhao
Wenji Mao
DRL
OOD
17
3
0
26 Oct 2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering
Q. Si
Yuanxin Liu
Zheng Lin
Peng Fu
Weiping Wang
VLM
39
1
0
26 Oct 2022
Look to the Right: Mitigating Relative Position Bias in Extractive Question Answering
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
OOD
20
7
0
26 Oct 2022
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
Victor Zhong
Weijia Shi
Wen-tau Yih
Luke Zettlemoyer
17
19
0
25 Oct 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALM
HILM
47
92
0
25 Oct 2022
TAPE: Assessing Few-shot Russian Language Understanding
Ekaterina Taktasheva
Tatiana Shavrina
Alena Fenogenova
Denis Shevelev
Nadezhda Katricheva
...
Svetlana Iordanskaia
Alena Spiridonova
Valentina Kurenshchikova
Ekaterina Artemova
Vladislav Mikhailov
AAML
45
10
0
23 Oct 2022
Lexical Generalization Improves with Larger Models and Longer Training
Elron Bandel
Yoav Goldberg
Yanai Elazar
52
6
0
23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELM
OOD
KELM
72
19
0
22 Oct 2022
Training Dynamics for Curriculum Learning: A Study on Monolingual and Cross-lingual NLU
Fenia Christopoulou
Gerasimos Lampouras
Ignacio Iacobacci
40
3
0
22 Oct 2022
ADDMU: Detection of Far-Boundary Adversarial Examples with Data and Model Uncertainty Estimation
Fan Yin
Yao Li
Cho-Jui Hsieh
Kai-Wei Chang
AAML
67
4
0
22 Oct 2022
Precisely the Point: Adversarial Augmentations for Faithful and Informative Text Generation
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Sujian Li
Yajuan Lyu
33
3
0
22 Oct 2022
Identifying Human Strategies for Generating Word-Level Adversarial Examples
Maximilian Mozes
Bennett Kleinberg
Lewis D. Griffin
AAML
27
1
0
20 Oct 2022
Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Fanchao Qi
Longtao Huang
Zhiyuan Liu
Maosong Sun
SILM
14
45
0
19 Oct 2022
Prompting GPT-3 To Be Reliable
Chenglei Si
Zhe Gan
Zhengyuan Yang
Shuohang Wang
Jianfeng Wang
Jordan L. Boyd-Graber
Lijuan Wang
KELM
LRM
50
279
0
17 Oct 2022
Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task
Swaroop Mishra
Anjana Arunkumar
Chris Bryan
Chitta Baral
22
1
0
14 Oct 2022
A Survey of Parameters Associated with the Quality of Benchmarks in NLP
Swaroop Mishra
Anjana Arunkumar
Chris Bryan
Chitta Baral
31
1
0
14 Oct 2022
Assessing Out-of-Domain Language Model Performance from Few Examples
Prasann Singhal
Jarad Forristal
Xi Ye
Greg Durrett
LRM
25
5
0
13 Oct 2022
Are Sample-Efficient NLP Models More Robust?
Nelson F. Liu
Ananya Kumar
Percy Liang
Robin Jia
VLM
OOD
14
6
0
12 Oct 2022
SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Nazneen Rajani
Weixin Liang
Lingjiao Chen
Margaret Mitchell
James Zou
40
16
0
11 Oct 2022
Previous
1
2
3
4
5
6
...
16
17
18
Next