ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.17440
  4. Cited By
Modeling Adversarial Attack on Pre-trained Language Models as Sequential
  Decision Making

Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making

27 May 2023
Xuanjie Fang
Sijie Cheng
Yang Liu
Wen Wang
    AAML
ArXivPDFHTML

Papers citing "Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making"

7 / 7 papers shown
Title
TF-Attack: Transferable and Fast Adversarial Attacks on Large Language
  Models
TF-Attack: Transferable and Fast Adversarial Attacks on Large Language Models
Zelin Li
Kehai Chen
Lemao Liu
Xuefeng Bai
Mingming Yang
Yang Xiang
Min Zhang
AAML
25
0
0
26 Aug 2024
Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction
  Amplification
Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification
Boyang Zhang
Yicong Tan
Yun Shen
Ahmed Salem
Michael Backes
Savvas Zannettou
Yang Zhang
LLMAG
AAML
40
14
0
30 Jul 2024
Multi-granular Adversarial Attacks against Black-box Neural Ranking
  Models
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models
Yuansan Liu
Ruqing Zhang
J. Guo
Maarten de Rijke
Yixing Fan
Xueqi Cheng
AAML
46
13
0
02 Apr 2024
Generating Valid and Natural Adversarial Examples with Large Language
  Models
Generating Valid and Natural Adversarial Examples with Large Language Models
Zimu Wang
Wei Wang
Qi Chen
Qiufeng Wang
Anh Nguyen
AAML
21
4
0
20 Nov 2023
Privacy in Large Language Models: Attacks, Defenses and Future
  Directions
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
38
40
0
16 Oct 2023
Gradient-based Adversarial Attacks against Text Transformers
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
98
227
0
15 Apr 2021
Generating Natural Language Adversarial Examples
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
243
914
0
21 Apr 2018
1