v1v2v3 (latest)

Imitation Attacks and Defenses for Black-box Machine Translation Systems

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

30 April 2020

Papers citing "Imitation Attacks and Defenses for Black-box Machine Translation Systems"

50 / 77 papers shown

SoK: Are Watermarks in LLMs Ready for Deployment?

270

24 Dec 2025

RegionMarker: A Region-Triggered Semantic Watermarking Framework for Embedding-as-a-Service Copyright Protection

358

17 Nov 2025

δ

-STEAL: LLM Stealing Attack with Local Differential Privacy

171

24 Oct 2025

Selective Adversarial Attacks on LLM Benchmarks

177

15 Oct 2025

Basic Reading DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

238

26 Jul 2025

Towards Secure MLOps: Surveying Attacks, Mitigation Strategies, and Research Challenges

Raj Patel

Himanshu Tripathi

Jasper Stone

Noorbakhsh Amiri Golilarz

308

30 May 2025

Attack and defense techniques in large language models: A survey and new perspectives

357

02 May 2025

StyleRec: A Benchmark Dataset for Prompt Recovery in Writing Style TransformationBigData Congress [Services Society] (BSS), 2024

370

06 Apr 2025

Towards Data Governance of Frontier AI Models

Jason Hausenloy

Duncan McClements

Madhavendra Thakur

561

05 Dec 2024

NMT-Obfuscator Attack: Ignore a sentence in translation with only one word

341

19 Nov 2024

WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation WatermarksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

434

29 Aug 2024

Rethinking Targeted Adversarial Attacks For Neural Machine Translation

Junjie Wu

226

07 Jul 2024

DORY: Deliberative Prompt Recovery for LLM

378

31 May 2024

A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers

313

20 May 2024

An Empirical Study on the Robustness of Massively Multilingual Neural Machine TranslationInternational Conference on Language Resources and Evaluation (LREC), 2024

Supryadi Supryadi

Leiyu Pan

Deyi Xiong

196

13 May 2024

Revisiting character-level adversarial attacks

302

07 May 2024

ModelShield: Adaptive and Robust Watermark against Model Extraction AttackIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024

617

03 May 2024

WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection

375

03 Mar 2024

Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation

273

23 Feb 2024

Watermarking Makes Language Models Radioactive

Pierre Fernandez

225

22 Feb 2024

Stolen Subwords: Importance of Vocabularies for Machine Translation Model Stealing

Vilém Zouhar

AAML

227

29 Jan 2024

Language Model InversionInternational Conference on Learning Representations (ICLR), 2023

Wenting Zhao

457

22 Nov 2023

Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey

Victoria Smith

Ali Shahin Shamsabadi

Carolyn Ashurst

Adrian Weller

PILM

545

27 Sep 2023

Anchor Points: Benchmarking Models with Much Fewer ExamplesConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Douwe Kiela

366

14 Sep 2023

A Classification-Guided Approach for Adversarial Attacks against Neural Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

300

29 Aug 2023

Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing AttacksACM Multimedia (ACM MM), 2023

Xianglong Liu

278

02 Aug 2023

Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data

Xinzhe Li

Ming Liu

Shang Gao

284

02 Jul 2023

A Survey on Out-of-Distribution Evaluation of Neural NLP ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

307

27 Jun 2023

You Don't Need Robust Machine Learning to Manage Adversarial Attack Risks

253

16 Jun 2023

Red Teaming Language Model Detectors with Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023

355

31 May 2023

The False Promise of Imitating Proprietary LLMs

Pieter Abbeel

484

267

25 May 2023

Iterative Adversarial Attack on Image-guided Story Ending GenerationIEEE transactions on multimedia (IEEE TMM), 2023

Youze Wang

Wenbo Hu

Richang Hong

283

16 May 2023

White-Box Multi-Objective Adversarial Attack on Dialogue GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

360

05 May 2023

GrOVe: Ownership Verification of Graph Neural Networks using EmbeddingsIEEE Symposium on Security and Privacy (IEEE S&P), 2023

Asim Waheed

Vasisht Duddu

Nadarajah Asokan

370

17 Apr 2023

False Claims against Model Ownership ResolutionUSENIX Security Symposium (USENIX Security), 2023

846

13 Apr 2023

Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey

351

11 Mar 2023

Stealing the Decoding Algorithms of Language ModelsConference on Computer and Communications Security (CCS), 2023

452

08 Mar 2023

Targeted Adversarial Attacks against Neural Machine TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Sahar Sadrizadeh

AmirHossein Dabiri Aghdam

Ljiljana Dolamic

P. Frossard

AAML

293

02 Mar 2023

Protecting Language Generation Models via Invisible WatermarkingInternational Conference on Machine Learning (ICML), 2023

Xuandong Zhao

Yu-Xiang Wang

Lei Li

WaLM

468

114

06 Feb 2023

TransFool: An Adversarial Attack against Neural Machine Translation Models

374

02 Feb 2023

Defending Against Disinformation Attacks in Open-Domain Question AnsweringConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

Nathaniel Weir

467

20 Dec 2022

Learned-Database Systems Security

463

20 Dec 2022

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Xiaochuang Han

Sachin Kumar

Yulia Tsvetkov

424

172

31 Oct 2022

Extracted BERT Model Leaks More Information than You Think!Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

237

21 Oct 2022

Probabilistic Inverse Modeling: An Application in HydrologySDM (SDM), 2022

Somya Sharma

Rahul Ghosh

Arvind Renganathan

Xiang Li

Snigdhansu Chatterjee

234

12 Oct 2022

Distillation-Resistant Watermarking for Model Protection in NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Xuandong Zhao

Lei Li

Yu-Xiang Wang

WaLM

311

07 Oct 2022

CATER: Intellectual Property Protection on Text Generation APIs via Conditional WatermarksNeural Information Processing Systems (NeurIPS), 2022

Yi Zeng

Jiwei Li

420

19 Sep 2022

Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking ModelsConference on Computer and Communications Security (CCS), 2022

Changlong Sun

Wei Lu

Xiaozhong Liu

AAML

322

14 Sep 2022

The Ethical Need for Watermarks in Machine-Generated Language

A. Grinbaum

Laurynas Adomaitis

WaLM

204

07 Sep 2022

Threat Assessment in Machine Learning based Systems

L. Tidjon

Foutse Khomh

182

30 Jun 2022