v1v2v3 (latest)

Universal Adversarial Triggers for Attacking and Analyzing NLP

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

20 August 2019

Papers citing "Universal Adversarial Triggers for Attacking and Analyzing NLP"

50 / 662 papers shown

Are Multilingual BERT models robust? A Case Study on Adversarial Attacks for Multilingual Question Answering

149

15 Apr 2021

Gradient-based Adversarial Attacks against Text TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Chuan Guo

Alexandre Sablayrolles

Edouard Grave

Douwe Kiela

SILM

277

292

15 Apr 2021

Consistency Training with Virtual Adversarial Discrete PerturbationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Jungsoo Park

Gyuwan Kim

Jaewoo Kang

165

15 Apr 2021

Detoxifying Language Models Risks Marginalizing Minority VoicesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

255

137

13 Apr 2021

Evaluating Pre-Trained Models for User Feedback Analysis in Software Engineering: A Study on Classification of App-ReviewsEmpirical Software Engineering (EMSE), 2021

M. Hadi

Fatemeh H. Fard

304

12 Apr 2021

Factual Probing Is [MASK]: Learning vs. Learning to RecallNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Zexuan Zhong

Dan Friedman

Danqi Chen

334

441

12 Apr 2021

Double Perturbation: On the Robustness of Robustness and Counterfactual Bias EvaluationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

168

12 Apr 2021

FUDGE: Controlled Text Generation With Future DiscriminatorsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Kevin Kaichuang Yang

Dan Klein

316

386

12 Apr 2021

Achieving Model Robustness through Discrete Adversarial TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Maor Ivgi

Jonathan Berant

AAML

253

11 Apr 2021

Connecting Attributions and QA Model Behavior on Realistic CounterfactualsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Xi Ye

Rohan Nair

Greg Durrett

244

09 Apr 2021

Dynabench: Rethinking Benchmarking in NLPNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Douwe Kiela

...

Robin Jia

386

467

07 Apr 2021

Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction

Luoqiu Li

Xiang Chen

Zhen Bi

Xin Xie

Shumin Deng

Ningyu Zhang

Chuanqi Tan

Mosha Chen

Huajun Chen

AAML

433

01 Apr 2021

Explaining the Road Not Taken

Hua Shen

Ting-Hao 'Kenneth' Huang

FAtt XAI

196

27 Mar 2021

Plug-and-Blend: A Framework for Controllable Story Generation with Blended Control CodesArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2021

Zhiyu Lin

Mark O. Riedl

225

23 Mar 2021

Code-Mixing on Sesame Street: Dawn of the Adversarial PolyglotsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Samson Tan

Shafiq Joty

AAML

269

17 Mar 2021

Get Your Vitamin C! Robust Fact Verification with Contrastive EvidenceNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Tal Schuster

Adam Fisch

Regina Barzilay

248

262

15 Mar 2021

MERMAID: Metaphor Generation with Symbolism and Discriminative DecodingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Tuhin Chakrabarty

Xurui Zhang

Smaranda Muresan

Nanyun Peng

147

11 Mar 2021

ENTRUST: Argument Reframing with Language Models and EntailmentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Tuhin Chakrabarty

Christopher Hidey

Smaranda Muresan

156

11 Mar 2021

T-Miner: A Generative Approach to Defend Against Trojan Attacks on DNN-based Text ClassificationUSENIX Security Symposium (USENIX Security), 2021

232

07 Mar 2021

A Survey On Universal Adversarial AttackInternational Joint Conference on Artificial Intelligence (IJCAI), 2021

In So Kweon

322

105

02 Mar 2021

Certified Robustness to Programmable Transformations in LSTMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

170

15 Feb 2021

Realizable Universal Adversarial Perturbations for Malware

Raphael Labaca-Castro

188

12 Feb 2021

A Real-time Defense against Website Fingerprinting Attacks

169

08 Feb 2021

Model Agnostic Answer Reranking System for Adversarial Question AnsweringConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

209

05 Feb 2021

BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language GenerationConference on Fairness, Accountability and Transparency (FAccT), 2021

295

486

27 Jan 2021

Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author ProfilingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

191

27 Jan 2021

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel PairsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Kuan-Hao Huang

Kai-Wei Chang

335

26 Jan 2021

Adv-OLM: Generating Textual Adversaries via OLMConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

Vijit Malik

A. Bhat

Ashutosh Modi

212

21 Jan 2021

Data-to-text Generation by Splicing Together Nearest NeighborsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Sam Wiseman

A. Backurs

K. Stratos

283

20 Jan 2021

Towards Confident Machine Reading Comprehension

Rishav Chakravarti

Avirup Sil

151

20 Jan 2021

Persistent Anti-Muslim Bias in Large Language ModelsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2021

407

650

14 Jan 2021

Adversarial Machine Learning in Text Analysis and Generation

I. Alsmadi

AAML

210

14 Jan 2021

BERT & Family Eat Word Salad: Experiments with Text UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2021

Ashim Gupta

Giorgi Kvernadze

Vivek Srikumar

434

10 Jan 2021

DynaSent: A Dynamic Benchmark for Sentiment AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2020

Christopher Potts

Zhengxuan Wu

Atticus Geiger

Douwe Kiela

468

30 Dec 2020

Generating Natural Language Attacks in a Hard Label Black Box SettingAAAI Conference on Artificial Intelligence (AAAI), 2020

227

118

29 Dec 2020

Analysis of Dominant Classes in Universal Adversarial PerturbationsKnowledge-Based Systems (KBS), 2020

220

28 Dec 2020

To what extent do human explanations of model behavior align with actual model behavior?BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020

Grusha Prasad

Yixin Nie

Joey Tianyi Zhou

Robin Jia

Douwe Kiela

Adina Williams

232

24 Dec 2020

A Distributional Approach to Controlled Text GenerationInternational Conference on Learning Representations (ICLR), 2020

Muhammad Khalifa

Hady ElSahar

Marc Dymetman

461

129

21 Dec 2020

AdvExpander: Generating Natural Language Adversarial Examples by Expanding TextIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Zhihong Shao

Zitao Liu

142

18 Dec 2020

Multilingual Transfer Learning for QA Using Translation as Data AugmentationAAAI Conference on Artificial Intelligence (AAAI), 2020

225

10 Dec 2020

Transdisciplinary AI Observatory -- Retrospective Analyses and Future-Oriented Contradistinctions

Nadisha-Marie Aliman

L. Kester

Roman V. Yampolskiy

266

26 Nov 2020

On the Transferability of Adversarial Attacksagainst Neural Text ClassifierConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

278

17 Nov 2020

Adversarial Semantic Collisions

253

09 Nov 2020

Automatic Detection of Machine Generated Text: A Critical SurveyInternational Conference on Computational Linguistics (COLING), 2020

Ganesh Jawahar

Muhammad Abdul-Mageed

L. Lakshmanan

DeLMO

293

281

02 Nov 2020

A Targeted Attack on Black-Box Neural Machine Translation with Parallel Data Poisoning

Chang Xu

Jun Wang

Yuqing Tang

Francisco Guzman

Benjamin I. P. Rubinstein

Trevor Cohn

AAML

228

02 Nov 2020

Leveraging Extracted Model Adversaries for Improved Black Box AttacksBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020

Naveen Jafer Nizar

Ari Kobren

MIACV

30 Oct 2020

AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated PromptsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

420

482

29 Oct 2020

Concealed Data Poisoning Attacks on NLP Models

190

23 Oct 2020

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs

Yejin Choi

235

15 Oct 2020

Recipes for Safety in Open-domain Chatbots

Jason Weston

337

244

14 Oct 2020