BDMMT: Backdoor Sample Detection for Language Models through Model
Mutation Testing

BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing

25 January 2023

Ting Liu

Papers citing "BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing"

6 / 6 papers shown

Title
Exposing the Ghost in the Transformer: Abnormal Detection for Large Language Models via Hidden State Forensics Shide Zhou K. Wang Ling Shi H. Wang 44 0 0 01 Apr 2025
Neutralizing Backdoors through Information Conflicts for Large Language Models Chen Chen Yuchen Sun Xueluan Gong Jiaxin Gao K. Lam KELM AAML 67 0 0 27 Nov 2024
Backdoor Attacks and Countermeasures in Natural Language Processing Models: A Comprehensive Security Review Pengzhou Cheng Zongru Wu Wei Du Haodong Zhao Wei Lu Gongshen Liu SILM AAML 18 15 0 12 Sep 2023
Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer Fanchao Qi Yangyi Chen Xurui Zhang Mukai Li Zhiyuan Liu Maosong Sun AAML SILM 77 171 0 14 Oct 2021
Mitigating backdoor attacks in LSTM-based Text Classification Systems by Backdoor Keyword Identification Chuanshuai Chen Jiazhu Dai SILM 48 126 0 11 Jul 2020
Backdooring and Poisoning Neural Networks with Image-Scaling Attacks Erwin Quiring Konrad Rieck AAML 31 70 0 19 Mar 2020