Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.03668
Cited By
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
7 February 2023
Yuxin Wen
Neel Jain
John Kirchenbauer
Micah Goldblum
Jonas Geiping
Tom Goldstein
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery"
50 / 195 papers shown
Title
Finetuning CLIP to Reason about Pairwise Differences
Dylan Sam
Devin Willmott
João Dias Semedo
J. Zico Kolter
VLM
56
3
0
15 Sep 2024
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models
Bang An
Sicheng Zhu
Ruiyi Zhang
Michael-Andrei Panaitescu-Liess
Yuancheng Xu
Furong Huang
AAML
31
11
0
01 Sep 2024
Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Prompting
Emmanuel Aboah Boateng
Cassiano O. Becker
Nabiha Asghar
Kabir Walia
Ashwin Srinivasan
Ehi Nosakhare
Victor Dibia
Soundar Srinivasan
LRM
31
0
0
18 Aug 2024
Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation
Artur Guimarães
Bruno Martins
João Magalhães
13
0
0
06 Aug 2024
Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models?
Mohammad Bahrami Karkevandi
Nishant Vishwamitra
Peyman Najafirad
AAML
43
1
0
05 Aug 2024
Mission Impossible: A Statistical Perspective on Jailbreaking LLMs
Jingtong Su
Mingyu Lee
SangKeun Lee
30
7
0
02 Aug 2024
Detecting, Explaining, and Mitigating Memorization in Diffusion Models
Yuxin Wen
Yuchen Liu
Chen Chen
Lingjuan Lyu
29
51
0
31 Jul 2024
Detecting and Understanding Vulnerabilities in Language Models via Mechanistic Interpretability
Jorge García-Carrasco
A. Maté
Juan Trujillo
AAML
23
3
0
29 Jul 2024
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models
Chunsan Hong
Tae-Hyun Oh
Minhyuk Sung
VLM
EGVM
24
0
0
24 Jul 2024
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models
Xiao Liu
Xiaoliu Guan
Yu Wu
Jiaxu Miao
26
7
0
22 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Chuheng Zhang
Lei Song
Li Zhao
Jiang Bian
Kee-Eung Kim
VLM
AAML
29
3
0
20 Jul 2024
Disrupting Diffusion-based Inpainters with Semantic Digression
Geonho Son
Juhun Lee
Simon S. Woo
DiffM
23
2
0
14 Jul 2024
Was it Slander? Towards Exact Inversion of Generative Language Models
Adrians Skapars
Edoardo Manino
Youcheng Sun
Lucas C. Cordeiro
18
3
0
10 Jul 2024
Optimising Hard Prompts with Few-Shot Meta-Prompting
Sayash Raaj Hiraou
19
2
0
09 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
66
6
0
07 Jul 2024
LoPT: Low-Rank Prompt Tuning for Parameter Efficient Language Models
Shouchang Guo
Sonam Damani
Keng-hao Chang
VLM
28
1
0
27 Jun 2024
On Discrete Prompt Optimization for Diffusion Models
Ruochen Wang
Ting Liu
Cho-Jui Hsieh
Boqing Gong
DiffM
26
6
0
27 Jun 2024
LLM2FEA: Discover Novel Designs with Generative Evolutionary Multitasking
Melvin Wong
Jiao Liu
Thiago Rios
Stefan Menzel
Yew-Soon Ong
40
2
0
21 Jun 2024
Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems
Đorđe Klisura
Anthony Rios
AAML
19
1
0
20 Jun 2024
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs
Krista Opsahl-Ong
Michael J Ryan
Josh Purtell
David Broman
Christopher Potts
Matei A. Zaharia
Omar Khattab
28
26
0
17 Jun 2024
"Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jailbreak
Lingrui Mei
Shenghua Liu
Yiwei Wang
Baolong Bi
Jiayi Mao
Xueqi Cheng
AAML
42
9
0
17 Jun 2024
Yo'LLaVA: Your Personalized Language and Vision Assistant
Thao Nguyen
Haotian Liu
Yuheng Li
Mu Cai
Utkarsh Ojha
Yong Jae Lee
VLM
MLLM
44
15
0
13 Jun 2024
Understanding Visual Concepts Across Models
Brandon Trabucco
Max Gurinas
Kyle Doherty
Ruslan Salakhutdinov
VLM
32
0
0
11 Jun 2024
DiffusionPID: Interpreting Diffusion via Partial Information Decomposition
Shaurya Dewan
Rushikesh Zawar
Prakanshul Saxena
Yingshan Chang
Andrew F. Luo
Yonatan Bisk
DiffM
38
3
0
07 Jun 2024
Ranking Manipulation for Conversational Search Engines
Samuel Pfrommer
Yatong Bai
Tanmay Gautam
Somayeh Sojoudi
SILM
39
4
0
05 Jun 2024
Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Xiaojun Jia
Tianyu Pang
Chao Du
Yihao Huang
Jindong Gu
Yang Liu
Xiaochun Cao
Min-Bin Lin
AAML
44
22
0
31 May 2024
Efficient LLM-Jailbreaking by Introducing Visual Modality
Zhenxing Niu
Yuyao Sun
Haodong Ren
Haoxuan Ji
Quan Wang
Xiaoke Ma
Gang Hua
Rong Jin
23
0
0
30 May 2024
Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Qizhang Li
Yiwen Guo
Wangmeng Zuo
Hao Chen
AAML
SILM
21
5
0
28 May 2024
R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Changhoon Kim
Kyle Min
Yezhou Yang
40
19
0
25 May 2024
Learning to Reason via Program Generation, Emulation, and Search
Nathaniel Weir
Muhammad Khalifa
Linlu Qiu
Orion Weller
Peter Clark
SyDa
ReLM
LRM
49
5
0
25 May 2024
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
Yimeng Zhang
Xin Chen
Jinghan Jia
Yihua Zhang
Chongyu Fan
Jiancheng Liu
Mingyi Hong
Ke Ding
Sijia Liu
DiffM
29
51
0
24 May 2024
DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction with Slot Querying
Guanghui Wang
Dexi Liu
Jian-Yun Nie
Qizhi Wan
Rong Hu
Xiping Liu
Wanlong Liu
Jiaming Liu
63
0
0
22 May 2024
Towards Modular LLMs by Building and Reusing a Library of LoRAs
O. Ostapenko
Zhan Su
E. Ponti
Laurent Charlin
Nicolas Le Roux
Matheus Pereira
Lucas Page-Caccia
Alessandro Sordoni
MoMe
32
30
0
18 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
29
12
0
16 May 2024
Espresso: Robust Concept Filtering in Text-to-Image Models
Anudeep Das
Vasisht Duddu
Rui Zhang
Nadarajah Asokan
EGVM
26
5
0
30 Apr 2024
LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing
Zeyang Ma
A. Chen
Dong Jae Kim
Tse-Husn Chen
Shaowei Wang
27
41
0
27 Apr 2024
Don't Say No: Jailbreaking LLM by Suppressing Refusal
Yukai Zhou
Wenjie Wang
AAML
29
15
0
25 Apr 2024
Trojan Detection in Large Language Models: Insights from The Trojan Detection Challenge
Narek Maloyan
Ekansh Verma
Bulat Nutfullin
Bislan Ashinov
41
7
0
21 Apr 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
Jiabao Ji
Bairu Hou
Zhen Zhang
Guanhua Zhang
Wenqi Fan
Qing Li
Yang Zhang
Gaowen Liu
Sijia Liu
Shiyu Chang
AAML
27
5
0
18 Apr 2024
Experimental Design for Active Transductive Inference in Large Language Models
Subhojyoti Mukherjee
Anusha Lalitha
Aniket Deshmukh
Ge Liu
Yifei Ma
B. Kveton
LRM
35
1
0
12 Apr 2024
Latent Guard: a Safety Framework for Text-to-image Generation
Runtao Liu
Ashkan Khakzar
Jindong Gu
Qifeng Chen
Philip H. S. Torr
Fabio Pizzati
21
23
0
11 Apr 2024
Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models
Jiachen Ma
Anda Cao
Zhiqing Xiao
Jie Zhang
Chaonan Ye
Junbo Zhao
14
29
0
02 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
30
32
0
01 Apr 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Patrick Chao
Edoardo Debenedetti
Alexander Robey
Maksym Andriushchenko
Francesco Croce
...
Nicolas Flammarion
George J. Pappas
F. Tramèr
Hamed Hassani
Eric Wong
ALM
ELM
AAML
52
92
0
28 Mar 2024
Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation
Yutong He
Alexander Robey
Naoki Murata
Yiding Jiang
J. Williams
George Pappas
Hamed Hassani
Yuki Mitsufuji
Ruslan Salakhutdinov
J. Zico Kolter
DiffM
91
4
0
28 Mar 2024
CPR: Retrieval Augmented Generation for Copyright Protection
Aditya Golatkar
Alessandro Achille
L. Zancato
Yu-Xiang Wang
Ashwin Swaminathan
Stefano Soatto
DiffM
25
16
0
27 Mar 2024
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
25
4
0
27 Mar 2024
Supervisory Prompt Training
Jean Ghislain Billa
Min Oh
Liang Du
VLM
LRM
RALM
13
2
0
26 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
37
18
0
21 Mar 2024
Helpful or Harmful? Exploring the Efficacy of Large Language Models for Online Grooming Prevention
Ellie Prosser
Matthew Edwards
19
4
0
14 Mar 2024
Previous
1
2
3
4
Next