Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.12171
Cited By
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
18 March 2024
Weikang Zhou
Xiao Wang
Limao Xiong
Han Xia
Yingshuang Gu
Mingxu Chai
Fukang Zhu
Caishuang Huang
Shihan Dou
Zhiheng Xi
Rui Zheng
Songyang Gao
Yicheng Zou
Hang Yan
Yifan Le
Ruohui Wang
Lijun Li
Jing Shao
Tao Gui
Qi Zhang
Xuanjing Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models"
3 / 3 papers shown
Title
Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs
Haoming Yang
Ke Ma
X. Jia
Yingfei Sun
Qianqian Xu
Q. Huang
AAML
45
0
0
03 May 2025
Learn Your Reference Model for Real Good Alignment
Alexey Gorbatovski
Boris Shaposhnikov
Alexey Malakhov
Nikita Surnachev
Yaroslav Aksenov
Ian Maksimov
Nikita Balagansky
Daniil Gavrilov
OffRL
45
25
0
15 Apr 2024
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
Jiahao Yu
Xingwei Lin
Zheng Yu
Xinyu Xing
SILM
110
292
0
19 Sep 2023
1