ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.07328
  4. Cited By
Adversarial Examples for Evaluating Reading Comprehension Systems

Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017
Robin Jia
Abigail Z. Jacobs
    AAMLELM
ArXiv (abs)PDFHTML

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown
Towards Efficient and Domain-Agnostic Evasion Attack with
  High-dimensional Categorical Inputs
Towards Efficient and Domain-Agnostic Evasion Attack with High-dimensional Categorical InputsAAAI Conference on Artificial Intelligence (AAAI), 2022
Hongyan Bao
Yufei Han
Yujun Zhou
Xin Gao
Xiangliang Zhang
AAML
143
5
0
13 Dec 2022
Feature-Level Debiased Natural Language Understanding
Feature-Level Debiased Natural Language UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2022
Yougang Lyu
Piji Li
Yechang Yang
Maarten de Rijke
Sudipta Singha Roy
Yukun Zhao
D. Yin
Zhaochun Ren
231
12
0
11 Dec 2022
Mitigating Adversarial Gray-Box Attacks Against Phishing Detectors
Mitigating Adversarial Gray-Box Attacks Against Phishing DetectorsIEEE Transactions on Dependable and Secure Computing (TDSC), 2022
Giovanni Apruzzese
V. S. Subrahmanian
AAML
161
28
0
11 Dec 2022
A Comprehensive Survey on Multi-hop Machine Reading Comprehension
  Approaches
A Comprehensive Survey on Multi-hop Machine Reading Comprehension Approaches
A. Mohammadi
Reza Ramezani
Ahmad Baraani
227
4
0
08 Dec 2022
A Comprehensive Survey on Multi-hop Machine Reading Comprehension
  Datasets and Metrics
A Comprehensive Survey on Multi-hop Machine Reading Comprehension Datasets and Metrics
A. Mohammadi
Reza Ramezani
Ahmad Baraani
210
1
0
08 Dec 2022
Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak SupervisionInternational Conference on Machine Learning (ICML), 2022
Alec Radford
Jong Wook Kim
Tao Xu
Greg Brockman
C. McLeavey
Ilya Sutskever
OffRL
1.0K
5,722
0
06 Dec 2022
Which Shortcut Solution Do Question Answering Models Prefer to Learn?
Which Shortcut Solution Do Question Answering Models Prefer to Learn?AAAI Conference on Artificial Intelligence (AAAI), 2022
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
230
7
0
29 Nov 2022
Penalizing Confident Predictions on Largely Perturbed Inputs Does Not
  Improve Out-of-Distribution Generalization in Question Answering
Penalizing Confident Predictions on Largely Perturbed Inputs Does Not Improve Out-of-Distribution Generalization in Question Answering
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
OODAAML
132
1
0
29 Nov 2022
Neural Network Verification as Piecewise Linear Optimization:
  Formulations for the Composition of Staircase Functions
Neural Network Verification as Piecewise Linear Optimization: Formulations for the Composition of Staircase Functions
Tu Anh-Nguyen
Joey Huchette
158
2
0
27 Nov 2022
World Knowledge in Multiple Choice Reading Comprehension
World Knowledge in Multiple Choice Reading Comprehension
Adian Liusie
Vatsal Raina
Mark Gales
154
7
0
13 Nov 2022
NaturalAdversaries: Can Naturalistic Adversaries Be as Effective as
  Artificial Adversaries?
NaturalAdversaries: Can Naturalistic Adversaries Be as Effective as Artificial Adversaries?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Saadia Gabriel
Hamid Palangi
Yejin Choi
AAML
244
1
0
08 Nov 2022
Are AlphaZero-like Agents Robust to Adversarial Perturbations?
Are AlphaZero-like Agents Robust to Adversarial Perturbations?Neural Information Processing Systems (NeurIPS), 2022
Li-Cheng Lan
Huan Zhang
Tai-Lin Wu
Meng-Yu Tsai
I-Chen Wu
Cho-Jui Hsieh
AAML
184
15
0
07 Nov 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual
  Robustness
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Ziqiang Cao
Sujian Li
Hua Wu
HILM
193
11
0
01 Nov 2022
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging
  of NLP Models
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Dong-Ho Lee
Akshen Kadakia
Brihi Joshi
Aaron Chan
Ziyi Liu
...
Takashi Shibuya
Ryosuke Mitani
Toshiyuki Sekiya
Jay Pujara
Xiang Ren
LRM
196
11
0
30 Oct 2022
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Debiasing Masks: A New Framework for Shortcut Mitigation in NLUConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
AAML
166
18
0
28 Oct 2022
ACES: Translation Accuracy Challenge Sets for Evaluating Machine
  Translation Metrics
ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation MetricsConference on Machine Translation (WMT), 2022
Chantal Amrhein
Nikita Moghe
Liane Guillou
ELM
281
27
0
27 Oct 2022
TASA: Deceiving Question Answering Models by Twin Answer Sentences
  Attack
TASA: Deceiving Question Answering Models by Twin Answer Sentences AttackConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yu Cao
Dianqi Li
Meng Fang
Wanrong Zhu
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
191
22
0
27 Oct 2022
Disentangled Text Representation Learning with Information-Theoretic
  Perspective for Adversarial Robustness
Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial RobustnessIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Jiahao Zhao
Wenji Mao
DRLOOD
180
7
0
26 Oct 2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual
  Question Answering
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Q. Si
Yuanxin Liu
Zheng Lin
Peng Fu
Weiping Wang
VLM
290
2
0
26 Oct 2022
Look to the Right: Mitigating Relative Position Bias in Extractive
  Question Answering
Look to the Right: Mitigating Relative Position Bias in Extractive Question AnsweringBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
OOD
181
7
0
26 Oct 2022
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question
  Answering
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Victor Zhong
Weijia Shi
Anuj Kumar
Luke Zettlemoyer
216
29
0
25 Oct 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating
  Models to Reflect Conflicting Evidence
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting EvidenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALMHILM
332
127
0
25 Oct 2022
TAPE: Assessing Few-shot Russian Language Understanding
TAPE: Assessing Few-shot Russian Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ekaterina Taktasheva
Tatiana Shavrina
Alena Fenogenova
Denis Shevelev
Nadezhda Katricheva
...
Svetlana Iordanskaia
Alena Spiridonova
Valentina Kurenshchikova
Ekaterina Artemova
Vladislav Mikhailov
AAML
155
14
0
23 Oct 2022
Lexical Generalization Improves with Larger Models and Longer Training
Lexical Generalization Improves with Larger Models and Longer TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Elron Bandel
Yoav Goldberg
Yanai Elazar
220
7
0
23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question
  Answering Models
Exploring The Landscape of Distributional Robustness for Question Answering ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Anas Awadalla
Mitchell Wortsman
Gabriel Ilharco
Sewon Min
Ian H. Magnusson
Hannaneh Hajishirzi
Ludwig Schmidt
ELMOODKELM
225
23
0
22 Oct 2022
Training Dynamics for Curriculum Learning: A Study on Monolingual and
  Cross-lingual NLU
Training Dynamics for Curriculum Learning: A Study on Monolingual and Cross-lingual NLUConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Fenia Christopoulou
Gerasimos Lampouras
Ignacio Iacobacci
296
5
0
22 Oct 2022
ADDMU: Detection of Far-Boundary Adversarial Examples with Data and
  Model Uncertainty Estimation
ADDMU: Detection of Far-Boundary Adversarial Examples with Data and Model Uncertainty EstimationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Fan Yin
Yao Li
Cho-Jui Hsieh
Kai-Wei Chang
AAML
220
4
0
22 Oct 2022
Precisely the Point: Adversarial Augmentations for Faithful and
  Informative Text Generation
Precisely the Point: Adversarial Augmentations for Faithful and Informative Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Sujian Li
Yajuan Lyu
286
5
0
22 Oct 2022
Identifying Human Strategies for Generating Word-Level Adversarial
  Examples
Identifying Human Strategies for Generating Word-Level Adversarial ExamplesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Maximilian Mozes
Bennett Kleinberg
Lewis D. Griffin
AAML
230
2
0
20 Oct 2022
Why Should Adversarial Perturbations be Imperceptible? Rethink the
  Research Paradigm in Adversarial NLP
Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Fanchao Qi
Longtao Huang
Zhiyuan Liu
Maosong Sun
SILM
220
94
0
19 Oct 2022
Prompting GPT-3 To Be Reliable
Prompting GPT-3 To Be ReliableInternational Conference on Learning Representations (ICLR), 2022
Chenglei Si
Zhe Gan
Zhengyuan Yang
Shuohang Wang
Jianfeng Wang
Jordan L. Boyd-Graber
Lijuan Wang
KELMLRM
407
341
0
17 Oct 2022
Hardness of Samples Need to be Quantified for a Reliable Evaluation
  System: Exploring Potential Opportunities with a New Task
Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task
Swaroop Mishra
Anjana Arunkumar
Chris Bryan
Chitta Baral
218
2
0
14 Oct 2022
A Survey of Parameters Associated with the Quality of Benchmarks in NLP
A Survey of Parameters Associated with the Quality of Benchmarks in NLP
Swaroop Mishra
Anjana Arunkumar
Chris Bryan
Chitta Baral
202
1
0
14 Oct 2022
Assessing Out-of-Domain Language Model Performance from Few Examples
Assessing Out-of-Domain Language Model Performance from Few ExamplesConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Prasann Singhal
Jarad Forristal
Xi Ye
Greg Durrett
LRM
206
6
0
13 Oct 2022
Are Sample-Efficient NLP Models More Robust?
Are Sample-Efficient NLP Models More Robust?Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Nelson F. Liu
Ananya Kumar
Abigail Z. Jacobs
Robin Jia
VLMOOD
158
6
0
12 Oct 2022
SEAL : Interactive Tool for Systematic Error Analysis and Labeling
SEAL : Interactive Tool for Systematic Error Analysis and LabelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Nazneen Rajani
Weixin Liang
Lingjiao Chen
Margaret Mitchell
James Zou
158
17
0
11 Oct 2022
DeepPerform: An Efficient Approach for Performance Testing of
  Resource-Constrained Neural Networks
DeepPerform: An Efficient Approach for Performance Testing of Resource-Constrained Neural NetworksInternational Conference on Automated Software Engineering (ASE), 2022
Simin Chen
Mirazul Haque
Cong Liu
Wei Yang
209
24
0
10 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
State-of-the-art generalisation research in NLP: A taxonomy and reviewNature Machine Intelligence (Nat. Mach. Intell.), 2022
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Robert Bamler
Zhijing Jin
631
131
0
06 Oct 2022
U3E: Unsupervised and Erasure-based Evidence Extraction for Machine
  Reading Comprehension
U3E: Unsupervised and Erasure-based Evidence Extraction for Machine Reading ComprehensionInternational Conference on Cloud Computing and Intelligence Systems (ICCCIS), 2022
Suzhe He
Shumin Shi
Chenghao Wu
334
0
0
06 Oct 2022
ChemAlgebra: Algebraic Reasoning on Chemical Reactions
ChemAlgebra: Algebraic Reasoning on Chemical ReactionsIEEE International Joint Conference on Neural Network (IJCNN), 2022
Andrea Valenti
D. Bacciu
Antonio Vergari
OODLRM
196
0
0
05 Oct 2022
Text Characterization Toolkit
Text Characterization Toolkit
Daniel Simig
Tianlu Wang
Verna Dankers
Peter Henderson
Khuyagbaatar Batsuren
Dieuwke Hupkes
Mona T. Diab
166
0
0
04 Oct 2022
Using contradictions improves question answering systems
Using contradictions improves question answering systemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Étienne Fortier-Dubois
Domenic Rosati
249
0
0
28 Sep 2022
Semantic-based Pre-training for Dialogue Understanding
Semantic-based Pre-training for Dialogue UnderstandingInternational Conference on Computational Linguistics (COLING), 2022
Xuefeng Bai
Linfeng Song
Yue Zhang
249
8
0
19 Sep 2022
Possible Stories: Evaluating Situated Commonsense Reasoning under
  Multiple Possible Scenarios
Possible Stories: Evaluating Situated Commonsense Reasoning under Multiple Possible ScenariosInternational Conference on Computational Linguistics (COLING), 2022
Mana Ashida
Saku Sugawara
195
6
0
16 Sep 2022
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Machine Reading, Fast and Slow: When Do Models "Understand" Language?International Conference on Computational Linguistics (COLING), 2022
Sagnik Ray Choudhury
Anna Rogers
Isabelle Augenstein
LRM
177
21
0
15 Sep 2022
Instance Attack:An Explanation-based Vulnerability Analysis Framework
  Against DNNs for Malware Detection
Instance Attack:An Explanation-based Vulnerability Analysis Framework Against DNNs for Malware DetectionPeerJ Computer Science (PeerJ CS), 2022
Ruijin Sun
Shize Guo
Jinhong Guo
Changyou Xing
Luming Yang
Xi Guo
Zhisong Pan
AAML
286
2
0
06 Sep 2022
Rare but Severe Neural Machine Translation Errors Induced by Minimal
  Deletion: An Empirical Study on Chinese and English
Rare but Severe Neural Machine Translation Errors Induced by Minimal Deletion: An Empirical Study on Chinese and EnglishInternational Conference on Computational Linguistics (COLING), 2022
Ruikang Shi
Alvin Grissom II
D. Trinh
173
3
0
05 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine
  Reading Comprehension
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
236
6
0
05 Sep 2022
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors,
  and Lessons Learned
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
Deep Ganguli
Liane Lovitt
John Kernion
Amanda Askell
Yuntao Bai
...
Nicholas Joseph
Sam McCandlish
C. Olah
Jared Kaplan
Jack Clark
603
633
0
23 Aug 2022
A Novel Plug-and-Play Approach for Adversarially Robust Generalization
A Novel Plug-and-Play Approach for Adversarially Robust Generalization
Deepak Maurya
Adarsh Barik
Jean Honorio
OODAAML
287
0
0
19 Aug 2022
Previous
123...567...171819
Next