ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.12548
  4. Cited By
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
v1v2v3 (latest)

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
25 May 2022
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
ArXiv (abs)PDFHTML

Papers citing "RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning"

50 / 274 papers shown
Title
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference
  Dataset and Modular Fine-tuning Schema
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Junru Lu
Siyu An
Min Zhang
Yulan He
Di Yin
Xing Sun
249
5
0
19 Feb 2024
MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement
  Learning for Discrete Prompt Optimization
MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization
Yasaman Jafari
Dheeraj Mekala
Rose Yu
Taylor Berg-Kirkpatrick
219
13
0
18 Feb 2024
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
Wendi Cui
Jiaxin Zhang
Zhuohang Li
Damien Lopez
Damien Lopez
Kamalika Das
Sricharan Kumar
Kumar Sricharan
346
7
0
17 Feb 2024
Efficient Prompt Optimization Through the Lens of Best Arm
  Identification
Efficient Prompt Optimization Through the Lens of Best Arm Identification
Chengshuai Shi
Kun Yang
Zihan Chen
Jundong Li
Jing Yang
Cong Shen
253
20
0
15 Feb 2024
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
Xing-ming Guo
Fangxu Yu
Huan Zhang
Lianhui Qin
Bin Hu
AAML
377
145
0
13 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
314
42
0
13 Feb 2024
Personalized Language Modeling from Personalized Human Feedback
Personalized Language Modeling from Personalized Human Feedback
Xinyu Li
Zachary C. Lipton
Liu Leqi
ALM
350
103
0
06 Feb 2024
Skill Set Optimization: Reinforcing Language Model Behavior via
  Transferable Skills
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable SkillsInternational Conference on Machine Learning (ICML), 2024
Kolby Nottingham
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Sameer Singh
Peter Clark
Roy Fox
223
11
0
05 Feb 2024
Intent-based Prompt Calibration: Enhancing prompt optimization with
  synthetic boundary cases
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases
Elad Levi
Eli Brosh
Matan Friedmann
150
18
0
05 Feb 2024
Are Large Language Models Good Prompt Optimizers?
Are Large Language Models Good Prompt Optimizers?
Ruotian Ma
Xiaolei Wang
Xin Zhou
Jian Li
Nan Du
Tao Gui
Tao Gui
Xuanjing Huang
LLMAGLRM
242
39
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
206
23
0
02 Feb 2024
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Stochastic Two Points Method for Deep Model Zeroth-order Optimization
Yijiang Pang
Jiayu Zhou
393
1
0
02 Feb 2024
Enhancing Ethical Explanations of Large Language Models through
  Iterative Symbolic Refinement
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement
Xin Quan
Marco Valentino
Louise A. Dennis
André Freitas
LRM
152
18
0
01 Feb 2024
PAP-REC: Personalized Automatic Prompt for Recommendation Language Model
PAP-REC: Personalized Automatic Prompt for Recommendation Language Model
Zelong Li
Jianchao Ji
Yingqiang Ge
Qingfeng Lan
Zelong Li
181
6
0
01 Feb 2024
On Prompt-Driven Safeguarding for Large Language Models
On Prompt-Driven Safeguarding for Large Language Models
Chujie Zheng
Fan Yin
Hao Zhou
Fandong Meng
Jie Zhou
Kai-Wei Chang
Shiyu Huang
Nanyun Peng
AAML
458
98
0
31 Jan 2024
Robust Prompt Optimization for Defending Language Models Against
  Jailbreaking Attacks
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
Andy Zhou
Bo Li
Haohan Wang
AAML
381
127
0
30 Jan 2024
Gradient-Based Language Model Red Teaming
Gradient-Based Language Model Red Teaming
Nevan Wichers
Carson E. Denison
Ahmad Beirami
219
40
0
30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Black-Box Access is Insufficient for Rigorous AI AuditsConference on Fairness, Accountability and Transparency (FAccT), 2024
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
488
127
0
25 Jan 2024
PRewrite: Prompt Rewriting with Reinforcement Learning
PRewrite: Prompt Rewriting with Reinforcement LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Weize Kong
Spurthi Amba Hombaiah
Mingyang Zhang
Qiaozhu Mei
Michael Bendersky
LLMAG
180
38
0
16 Jan 2024
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning
  from Mobile GUI
AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI
Lihang Pan
Bowen Wang
Chun Yu
Yuxuan Chen
Xiangyu Zhang
Yuanchun Shi
135
5
0
26 Dec 2023
Revisiting Few-Shot Object Detection with Vision-Language Models
Revisiting Few-Shot Object Detection with Vision-Language Models
Anish Madan
Neehar Peri
Shu Kong
Deva Ramanan
VLM
345
27
0
22 Dec 2023
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning
Xijie Huang
Li Lyna Zhang
Kwang-Ting Cheng
Fan Yang
Mao Yang
LRMReLM
245
16
0
14 Dec 2023
PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching
PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt MatchingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhenting Qi
Jue Chen
Shaojie Shi
Chao Qu
Yinghui Xu
Yuan Qi
ALM
210
12
0
09 Dec 2023
Prompt Optimization via Adversarial In-Context Learning
Prompt Optimization via Adversarial In-Context LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Do Xuan Long
Yiran Zhao
Hannah Brown
Yuxi Xie
James Xu Zhao
Nancy F. Chen
Kenji Kawaguchi
Michael Qizhe Xie
Junxian He
349
26
0
05 Dec 2023
A Survey on Prompting Techniques in LLMs
A Survey on Prompting Techniques in LLMs
Prabin Bhandari
153
10
0
28 Nov 2023
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt
  Engineer
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerInternational Conference on Learning Representations (ICLR), 2023
Junyuan Hong
Jiachen T. Wang
Chenhui Zhang
Zhangheng Li
Yue Liu
Zinan Lin
511
56
0
27 Nov 2023
Do Physicians Know How to Prompt? The Need for Automatic Prompt
  Optimization Help in Clinical Note Generation
Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation
Zonghai Yao
Ahmed Jaafar
Beining Wang
Zhichao Yang
Hong-ye Yu
LM&MA
235
3
0
16 Nov 2023
Automatic Engineering of Long Prompts
Automatic Engineering of Long Prompts
Cho-Jui Hsieh
Si Si
Felix X. Yu
Inderjit S. Dhillon
VLM
156
14
0
16 Nov 2023
Plum: Prompt Learning using Metaheuristic
Plum: Prompt Learning using Metaheuristic
Boyao Wang
Shuo Xing
Shizhe Diao
Wenhe Sun
Xiang Liu
Kashun Shum
Renjie Pi
Jipeng Zhang
Tong Zhang
VLMOffRLLRM
209
7
0
14 Nov 2023
Prompt Engineering a Prompt Engineer
Prompt Engineering a Prompt Engineer
Qinyuan Ye
Maxamed Axmed
Reid Pryzant
Fereshte Khani
VLMLLMAGLRM
303
82
0
09 Nov 2023
Scalable and Transferable Black-Box Jailbreaks for Language Models via
  Persona Modulation
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation
Rusheb Shah
Quentin Feuillade--Montixi
Soroush Pour
Arush Tagade
Stephen Casper
Javier Rando
242
182
0
06 Nov 2023
Implicit Chain of Thought Reasoning via Knowledge Distillation
Implicit Chain of Thought Reasoning via Knowledge Distillation
Yuntian Deng
Kiran Prasad
Roland Fernandez
P. Smolensky
Vishrav Chaudhary
Stuart M. Shieber
ReLMLRM
202
70
0
02 Nov 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image
  Detectors
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLMVLM
365
49
0
26 Oct 2023
Exploring Question Decomposition for Zero-Shot VQA
Exploring Question Decomposition for Zero-Shot VQANeural Information Processing Systems (NeurIPS), 2023
Zaid Khan
B. Vijaykumar
S. Schulter
Manmohan Chandraker
Yun Fu
ReLM
182
18
0
25 Oct 2023
MultiPrompter: Cooperative Prompt Optimization with Multi-Agent
  Reinforcement Learning
MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning
Dong-Ki Kim
Sungryull Sohn
Lajanugen Logeswaran
Dongsub Shim
Honglak Lee
LLMAG
181
4
0
25 Oct 2023
PromptAgent: Strategic Planning with Language Models Enables
  Expert-level Prompt Optimization
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationInternational Conference on Learning Representations (ICLR), 2023
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
417
184
0
25 Oct 2023
Unnatural language processing: How do language models handle
  machine-generated prompts?
Unnatural language processing: How do language models handle machine-generated prompts?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Corentin Kervadec
Francesca Franzon
Marco Baroni
218
7
0
24 Oct 2023
A Communication Theory Perspective on Prompting Engineering Methods for
  Large Language Models
A Communication Theory Perspective on Prompting Engineering Methods for Large Language ModelsJournal of Computational Science and Technology (JCST), 2023
Wailing Ng
Yuanqin He
Xuefang Zhao
Hanlin Gu
Chen Zhang
Haijun Yang
Lixin Fan
Qiang Yang
192
6
0
24 Oct 2023
Unleashing the potential of prompt engineering in Large Language Models:
  a comprehensive review
Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review
Banghao Chen
Zhaofeng Zhang
Nicolas Langrené
Shengxin Zhu
LLMAG
389
89
0
23 Oct 2023
Monte Carlo Thought Search: Large Language Model Querying for Complex
  Scientific Reasoning in Catalyst Design
Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst DesignConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Henry W Sprueill
Carl Edwards
Mariefel V. Olarte
Udishnu Sanyal
Heng Ji
Sutanay Choudhury
LRM
268
14
0
22 Oct 2023
Prompt Engineering Through the Lens of Optimal Control
Prompt Engineering Through the Lens of Optimal ControlJournal of Machine Learning (JML), 2023
Yifan Luo
Yiming Tang
Chengfeng Shen
Zhennan Zhou
Bin Dong
OffRL
214
14
0
22 Oct 2023
Auto-Instruct: Automatic Instruction Generation and Ranking for
  Black-Box Language Models
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Zhihan Zhang
Shuohang Wang
Wenhao Yu
Yichong Xu
Dan Iter
Qingkai Zeng
Yang Liu
Chenguang Zhu
Meng Jiang
SyDaALM
143
28
0
19 Oct 2023
Survival of the Most Influential Prompts: Efficient Black-Box Prompt
  Search via Clustering and Pruning
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning
Han Zhou
Xingchen Wan
Ivan Vulić
Anna Korhonen
LLMAG
146
25
0
19 Oct 2023
Quantifying Language Models' Sensitivity to Spurious Features in Prompt
  Design or: How I learned to start worrying about prompt formatting
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingInternational Conference on Learning Representations (ICLR), 2023
Melanie Sclar
Yejin Choi
Yulia Tsvetkov
Alane Suhr
289
530
0
17 Oct 2023
Denevil: Towards Deciphering and Navigating the Ethical Values of Large
  Language Models via Instruction Learning
Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction LearningInternational Conference on Learning Representations (ICLR), 2023
Shitong Duan
Xiaoyuan Yi
Peng Zhang
Tun Lu
Xing Xie
Ning Gu
210
23
0
17 Oct 2023
Rephrase, Augment, Reason: Visual Grounding of Questions for
  Vision-Language Models
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
ReLMLRM
220
13
0
09 Oct 2023
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Zihao Lin
Yan Sun
Yifan Shi
Xueqian Wang
Lifu Huang
Li Shen
Dacheng Tao
228
15
0
04 Oct 2023
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across
  Language Models
Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Zijun Wu
Yongkang Wu
Lili Mou
VLM
189
8
0
02 Oct 2023
What's the Magic Word? A Control Theory of LLM Prompting
What's the Magic Word? A Control Theory of LLM Prompting
Aman Bhargava
Cameron Witkowski
Manav Shah
Matt W. Thomson
LLMAG
392
46
0
02 Oct 2023
SPELL: Semantic Prompt Evolution based on a LLM
SPELL: Semantic Prompt Evolution based on a LLM
Yujian Betterest Li
Kai Wu
181
17
0
02 Oct 2023
Previous
123456
Next