Towards Robustness Against Natural Language Word Substitutions

International Conference on Learning Representations (ICLR), 2021

28 July 2021

Papers citing "Towards Robustness Against Natural Language Word Substitutions"

50 / 70 papers shown

Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal

303

29 Jul 2025

Bridging Robustness and Generalization Against Word Substitution Attacks in NLP via the Growth Bound Matrix ApproachAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Mohammed Bouri

Adnane Saoud

AAML SILM

217

14 Jul 2025

Rapid Urban Visibility Hotspots: Quantifying Building Vertex Visibility from Connected Vehicle Trajectories using Spatial Indexing

Artur Grigorev

Adriana-Simona Mihaita

298

03 Jun 2025

The Counting Power of Transformers

375

16 May 2025

Model Hemorrhage and the Robustness Limits of Large Language Models

317

31 Mar 2025

Confidence Elicitation: A New Attack Vector for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

573

07 Feb 2025

Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence BenchmarksInternational Conference on Computational Linguistics (COLING), 2025

Yang Wang

Chenghua Lin

ELM

374

05 Jan 2025

ProTransformer: Robustify Transformers via Plug-and-Play ParadigmNeural Information Processing Systems (NeurIPS), 2024

259

30 Oct 2024

A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese

Tin Van Huynh

Kiet Van Nguyen

Ngan Luu-Thuy Nguyen

338

25 Jun 2024

Transformer Encoder Satisfiability: Complexity and Impact on Formal Reasoning

209

28 May 2024

GenFighter: A Generative and Evolutive Textual Attack Removal

Md Athikul Islam

Edoardo Serra

Sushil Jajodia

AAML

168

17 Apr 2024

SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks

See-Kiong Ng

324

27 Mar 2024

Extreme Miscalibration and the Illusion of Adversarial Robustness

Sheng Zha

George Karypis

AAML

328

27 Feb 2024

Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning

364

19 Feb 2024

Fast Adversarial Training against Textual Adversarial Attacks

178

23 Jan 2024

Toward Stronger Textual Attack DetectorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

814

21 Oct 2023

Beyond Hard Samples: Robust and Effective Grammatical Error Correction with Cycle Self-Augmenting

Zecheng Tang

Kaiqi Feng

Juntao Li

Min Zhang

238

20 Oct 2023

Fooling the Textual Fooler via Randomizing Latent RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

263

02 Oct 2023

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLMAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

456

198

18 Sep 2023

LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial AttackAAAI Conference on Artificial Intelligence (AAAI), 2023

314

01 Aug 2023

Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial AttacksIEEE Symposium on Security and Privacy (IEEE S&P), 2023

366

31 Jul 2023

Transferable Adversarial Robustness for Categorical Data via Universal Robust EmbeddingsNeural Information Processing Systems (NeurIPS), 2023

Klim Kireev

Maksym Andriushchenko

Carmela Troncoso

Nicolas Flammarion

OOD

289

06 Jun 2023

A Causal View of Entity Bias in (Large) Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Fei Wang

220

24 May 2023

Randomized Smoothing with Masked Inference for Adversarially Robust Text ClassificationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

236

11 May 2023

Toward Adversarial Training on Contextualized Language RepresentationInternational Conference on Learning Representations (ICLR), 2023

154

08 May 2023

The Best Defense is Attack: Repairing Semantics in Textual Adversarial ExamplesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Heng Yang

Ke Li

AAML

303

06 May 2023

ANTONIO: Towards a Systematic Method of Generating NLP Benchmarks for Verification

Ekaterina Komendantskaya

253

06 May 2023

Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

618

129

02 May 2023

RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models

396

21 Apr 2023

Masked Language Model Based Textual Adversarial Example DetectionACM Asia Conference on Computer and Communications Security (AsiaCCS), 2023

339

18 Apr 2023

Backdoor Learning for NLP: Recent Advances, Challenges, and Future Research Directions

Marwan Omar

SILM AAML

199

14 Feb 2023

Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend

341

06 Feb 2023

TextShield: Beyond Successfully Detecting Adversarial Sentences in Text ClassificationInternational Conference on Learning Representations (ICLR), 2023

Lingfeng Shen

338

03 Feb 2023

On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on CodexConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

414

30 Jan 2023

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven OptimizationInternational Conference on Learning Representations (ICLR), 2022

172

19 Dec 2022

Preserving Semantics in Textual Adversarial AttacksEuropean Conference on Artificial Intelligence (ECAI), 2022

237

08 Nov 2022

Textual Manifold-based Defense Against Natural Language Adversarial ExamplesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

D. M. Nguyen

Anh Tuan Luu

AAML

265

05 Nov 2022

Emergent Linguistic Structures in Neural Networks are Fragile

Emanuele La Malfa

Matthew Wicker

Marta Kiatkowska

612

31 Oct 2022

Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial RobustnessIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Jiahao Zhao

Wenji Mao

DRL OOD

180

26 Oct 2022

TCAB: A Large-Scale Text Classification Attack Benchmark

305

21 Oct 2022

Probabilistic Categorical Adversarial Attack & Adversarial Training

Zitao Liu

207

17 Oct 2022

Rethinking Textual Adversarial Defense for Pre-trained Language ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

201

21 Jul 2022

Certified Robustness Against Natural Language Attacks by Causal InterventionInternational Conference on Machine Learning (ICML), 2022

Hanwang Zhang

357

24 May 2022

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock PredictionsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Jinjun Xiong

302

01 May 2022

Improving robustness of language models from a geometry-aware perspectiveFindings (Findings), 2022

136

28 Apr 2022

"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial AttacksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

216

10 Apr 2022

Text Adversarial Purification as Defense against Adversarial AttacksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Linyang Li

Demin Song

Xipeng Qiu

AAML

167

27 Mar 2022

Adversarial Training for Improving Model Robustness? Look at Both Prediction and InterpretationAAAI Conference on Artificial Intelligence (AAAI), 2022

Hanjie Chen

Yangfeng Ji

OOD AAML VLM

221

23 Mar 2022

On Robust Prefix-Tuning for Text ClassificationInternational Conference on Learning Representations (ICLR), 2022

Zonghan Yang

Yang Liu

VLM

179

19 Mar 2022

A Survey of Adversarial Defences and Robustness in NLP

477

12 Mar 2022