Backdooring Instruction-Tuned Large Language Models with Virtual Prompt
InjectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 |
Robust Speech Recognition via Large-Scale Weak SupervisionInternational Conference on Machine Learning (ICML), 2022 |
Why Should Adversarial Perturbations be Imperceptible? Rethink the
Research Paradigm in Adversarial NLPConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |