Adversarial Examples for Evaluating Reading Comprehension Systems

23 July 2017

Robin Jia

Papers citing "Adversarial Examples for Evaluating Reading Comprehension Systems"

50 / 926 papers shown

Towards Efficient and Domain-Agnostic Evasion Attack with High-dimensional Categorical InputsAAAI Conference on Artificial Intelligence (AAAI), 2022

143

13 Dec 2022

Feature-Level Debiased Natural Language UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2022

Maarten de Rijke

231

11 Dec 2022

Mitigating Adversarial Gray-Box Attacks Against Phishing DetectorsIEEE Transactions on Dependable and Secure Computing (TDSC), 2022

Giovanni Apruzzese

V. S. Subrahmanian

AAML

161

11 Dec 2022

A Comprehensive Survey on Multi-hop Machine Reading Comprehension Approaches

A. Mohammadi

Reza Ramezani

Ahmad Baraani

227

08 Dec 2022

A Comprehensive Survey on Multi-hop Machine Reading Comprehension Datasets and Metrics

A. Mohammadi

Reza Ramezani

Ahmad Baraani

210

08 Dec 2022

Robust Speech Recognition via Large-Scale Weak SupervisionInternational Conference on Machine Learning (ICML), 2022

1.0K

5,722

06 Dec 2022

Which Shortcut Solution Do Question Answering Models Prefer to Learn?AAAI Conference on Artificial Intelligence (AAAI), 2022

Kazutoshi Shinoda

Saku Sugawara

Akiko Aizawa

230

29 Nov 2022

Penalizing Confident Predictions on Largely Perturbed Inputs Does Not Improve Out-of-Distribution Generalization in Question Answering

132

29 Nov 2022

Neural Network Verification as Piecewise Linear Optimization: Formulations for the Composition of Staircase Functions

Tu Anh-Nguyen

Joey Huchette

158

27 Nov 2022

World Knowledge in Multiple Choice Reading Comprehension

Adian Liusie

Vatsal Raina

Mark Gales

154

13 Nov 2022

NaturalAdversaries: Can Naturalistic Adversaries Be as Effective as Artificial Adversaries?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Saadia Gabriel

Hamid Palangi

Yejin Choi

AAML

244

08 Nov 2022

Are AlphaZero-like Agents Robust to Adversarial Perturbations?Neural Information Processing Systems (NeurIPS), 2022

184

07 Nov 2022

FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual RobustnessConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Sujian Li

193

01 Nov 2022

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

Xiang Ren

196

30 Oct 2022

Debiasing Masks: A New Framework for Shortcut Mitigation in NLUConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Johannes Mario Meissner

Saku Sugawara

Akiko Aizawa

AAML

166

28 Oct 2022

ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation MetricsConference on Machine Translation (WMT), 2022

281

27 Oct 2022

TASA: Deceiving Question Answering Models by Twin Answer Sentences AttackConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Yibing Zhan

191

27 Oct 2022

Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial RobustnessIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Jiahao Zhao

Wenji Mao

DRL OOD

180

26 Oct 2022

Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Zheng Lin

290

26 Oct 2022

Look to the Right: Mitigating Relative Position Bias in Extractive Question AnsweringBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022

181

26 Oct 2022

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Victor Zhong

Weijia Shi

Anuj Kumar

Luke Zettlemoyer

216

25 Oct 2022

Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting EvidenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

332

127

25 Oct 2022

TAPE: Assessing Few-shot Russian Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Alena Fenogenova

...

Valentina Kurenshchikova

Ekaterina Artemova

Vladislav Mikhailov

AAML

155

23 Oct 2022

Lexical Generalization Improves with Larger Models and Longer TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Elron Bandel

Yoav Goldberg

Yanai Elazar

220

23 Oct 2022

Exploring The Landscape of Distributional Robustness for Question Answering ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

225

22 Oct 2022

Training Dynamics for Curriculum Learning: A Study on Monolingual and Cross-lingual NLUConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Fenia Christopoulou

Gerasimos Lampouras

Ignacio Iacobacci

296

22 Oct 2022

ADDMU: Detection of Far-Boundary Adversarial Examples with Data and Model Uncertainty EstimationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

220

22 Oct 2022

Precisely the Point: Adversarial Augmentations for Faithful and Informative Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Sujian Li

286

22 Oct 2022

Identifying Human Strategies for Generating Word-Level Adversarial ExamplesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

230

20 Oct 2022

Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Hongcheng Gao

Zhiyuan Liu

Maosong Sun

SILM

220

19 Oct 2022

Prompting GPT-3 To Be ReliableInternational Conference on Learning Representations (ICLR), 2022

Jordan L. Boyd-Graber

Lijuan Wang

KELM LRM

407

341

17 Oct 2022

Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task

218

14 Oct 2022

A Survey of Parameters Associated with the Quality of Benchmarks in NLP

202

14 Oct 2022

Assessing Out-of-Domain Language Model Performance from Few ExamplesConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

206

13 Oct 2022

Are Sample-Efficient NLP Models More Robust?Annual Meeting of the Association for Computational Linguistics (ACL), 2022

Robin Jia

158

12 Oct 2022

SEAL : Interactive Tool for Systematic Error Analysis and LabelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

158

11 Oct 2022

DeepPerform: An Efficient Approach for Performance Testing of Resource-Constrained Neural NetworksInternational Conference on Automated Software Engineering (ASE), 2022

209

10 Oct 2022

State-of-the-art generalisation research in NLP: A taxonomy and reviewNature Machine Intelligence (Nat. Mach. Intell.), 2022

Verna Dankers

...

631

131

06 Oct 2022

U3E: Unsupervised and Erasure-based Evidence Extraction for Machine Reading ComprehensionInternational Conference on Cloud Computing and Intelligence Systems (ICCCIS), 2022

Suzhe He

Shumin Shi

Chenghao Wu

334

06 Oct 2022

ChemAlgebra: Algebraic Reasoning on Chemical ReactionsIEEE International Joint Conference on Neural Network (IJCNN), 2022

196

05 Oct 2022

Text Characterization Toolkit

Daniel Simig

Tianlu Wang

Verna Dankers

Peter Henderson

Khuyagbaatar Batsuren

Dieuwke Hupkes

Mona T. Diab

166

04 Oct 2022

Using contradictions improves question answering systemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Étienne Fortier-Dubois

Domenic Rosati

249

28 Sep 2022

Semantic-based Pre-training for Dialogue UnderstandingInternational Conference on Computational Linguistics (COLING), 2022

Xuefeng Bai

Linfeng Song

Yue Zhang

249

19 Sep 2022

Possible Stories: Evaluating Situated Commonsense Reasoning under Multiple Possible ScenariosInternational Conference on Computational Linguistics (COLING), 2022

Mana Ashida

Saku Sugawara

195

16 Sep 2022

Machine Reading, Fast and Slow: When Do Models "Understand" Language?International Conference on Computational Linguistics (COLING), 2022

177

15 Sep 2022

Instance Attack:An Explanation-based Vulnerability Analysis Framework Against DNNs for Malware DetectionPeerJ Computer Science (PeerJ CS), 2022

286

06 Sep 2022

Rare but Severe Neural Machine Translation Errors Induced by Minimal Deletion: An Empirical Study on Chinese and EnglishInternational Conference on Computational Linguistics (COLING), 2022

Ruikang Shi

Alvin Grissom II

D. Trinh

173

05 Sep 2022

A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension

Xanh Ho

Johannes Mario Meissner

Saku Sugawara

Akiko Aizawa

OffRL

236

05 Sep 2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

Deep Ganguli

...

603

633

23 Aug 2022

A Novel Plug-and-Play Approach for Adversarially Robust Generalization

287

19 Aug 2022