ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.12548
  4. Cited By
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
v1v2v3 (latest)

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
25 May 2022
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
ArXiv (abs)PDFHTML

Papers citing "RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning"

50 / 274 papers shown
Title
ELPO: Ensemble Learning Based Prompt Optimization for Large Language Models
Qing Zhang
Bing Xu
X. R. Zhang
Yifan Shi
Yang Li
...
Ngai Wong
Yijie Chen
Hong Dai
X. Chen
M. Zhang
88
0
0
20 Nov 2025
GPS: General Per-Sample Prompter
GPS: General Per-Sample Prompter
Pawel Batorski
Paul Swoboda
12
1
0
18 Nov 2025
Transformer Injectivity & Geometric Robustness - Analytic Margins and Bi-Lipschitz Uniformity of Sequence-Level Hidden States
Transformer Injectivity & Geometric Robustness - Analytic Margins and Bi-Lipschitz Uniformity of Sequence-Level Hidden States
Mikael von Strauss
60
0
0
17 Nov 2025
A Toolbox for Improving Evolutionary Prompt Search
A Toolbox for Improving Evolutionary Prompt Search
Daniel Grießhaber
Maximilian Kimmich
J. Maucher
Ngoc Thang Vu
178
0
0
07 Nov 2025
Auto prompting without training labels: An LLM cascade for product quality assessment in e-commerce catalogs
Auto prompting without training labels: An LLM cascade for product quality assessment in e-commerce catalogs
Soham Satyadharma
Fatemeh Sheikholeslami
Swati Kaul
Aziz Umit Batur
Suleiman Ali Khan
RALMLRM
97
0
0
27 Oct 2025
How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation
How to Auto-optimize Prompts for Domain Tasks? Adaptive Prompting and Reasoning through Evolutionary Domain Knowledge Adaptation
Yang Zhao
Pu Wang
Hao Frank Yang
LRM
64
0
0
24 Oct 2025
Query Decomposition for RAG: Balancing Exploration-Exploitation
Query Decomposition for RAG: Balancing Exploration-Exploitation
Roxana Petcu
Kenton W. Murray
Daniel Khashabi
Evangelos Kanoulas
Maarten de Rijke
Dawn J Lawrie
Kevin Duh
80
0
0
21 Oct 2025
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
Zheng Huang
Enpei Zhang
Yinghao Cai
Weikang Qiu
Carl Yang
Elynn Chen
Xiang Zhang
Rex Ying
Dawei Zhou
Yujun Yan
DiffM
92
0
0
17 Oct 2025
PromptFlow: Training Prompts Like Neural Networks
PromptFlow: Training Prompts Like Neural Networks
Jingyi Wang
Hongyuan Zhu
Ye Niu
Yunhui Deng
VLM
102
0
0
14 Oct 2025
ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
ArenaBencher: Automatic Benchmark Evolution via Multi-Model Competitive Evaluation
Qin Liu
Jacob Dineen
Y. Huang
Sheng Zhang
Hoifung Poon
Ben Zhou
Muhao Chen
ELM
128
0
0
09 Oct 2025
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Yuwen Tan
Xiang Xiang
Kun He
John E. Hopcroft
86
0
0
08 Oct 2025
LLM Based Bayesian Optimization for Prompt Search
LLM Based Bayesian Optimization for Prompt Search
Adam Ballew
Jingbo Wang
Shaogang Ren
134
0
0
05 Oct 2025
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
Pingchen Lu
Zhi Hong
Zhiwei Shang
Zhiyong Wang
Yikun Ban
Yao Shu
Min Zhang
Shuang Qiu
Zhongxiang Dai
FedML
88
0
0
29 Sep 2025
Prompt and Parameter Co-Optimization for Large Language Models
Prompt and Parameter Co-Optimization for Large Language Models
Xiaohe Bo
Rui Li
Guoqing Liu
Quanyu Dai
Zeyu Zhang
Zihang Tian
Xu Chen
Zhenhua Dong
68
0
0
29 Sep 2025
No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization
No Loss, No Gain: Gated Refinement and Adaptive Compression for Prompt Optimization
Wenhang Shi
Yiren Chen
Shuqing Bian
Xinyi Zhang
Kai Tang
Pengfei Hu
Zhe Zhao
Wei Lu
Xiaoyong Du
76
0
0
27 Sep 2025
Bounds of Chain-of-Thought Robustness: Reasoning Steps, Embed Norms, and Beyond
Bounds of Chain-of-Thought Robustness: Reasoning Steps, Embed Norms, and Beyond
Dingzirui Wang
Xuanliang Zhang
Keyan Xu
Qingfu Zhu
Wanxiang Che
Yang Deng
LRM
154
0
0
25 Sep 2025
Topic Coverage-based Demonstration Retrieval for In-Context Learning
Topic Coverage-based Demonstration Retrieval for In-Context Learning
Wonbin Kweon
SeongKu Kang
Runchu Tian
Pengcheng Jiang
Jiawei Han
Hwanjo Yu
88
0
0
15 Sep 2025
MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
Yichen Han
Bojun Liu
Zhengpeng Zhou
Zhengpeng Zhou
Zeng Zhang
...
Wenli Wang
Isaac Shi
Lewei He
Lewei He
Tianyu Shi
LLMAGAI4CE
171
1
0
14 Sep 2025
Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization
Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization
Juhyeon Lee
Wonduk Seo
Hyunjin An
Seunghyun Lee
Yi Bu
LRM
128
1
0
02 Sep 2025
APIO: Automatic Prompt Induction and Optimization for Grammatical Error Correction and Text Simplification
APIO: Automatic Prompt Induction and Optimization for Grammatical Error Correction and Text Simplification
Artem Chernodub
Aman Saini
Yejin Huh
Vivek Kulkarni
Vipul Raheja
93
0
0
12 Aug 2025
Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning
Decoupling Understanding from Reasoning via Problem Space Mapping for Small-Scale Model Reasoning
Li Wang
Changhao Zhang
Zengqi Xiu
Kai Lu
Xin Yu
Kui Zhang
Wenjun Wu
LRM
104
0
0
07 Aug 2025
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs
Noah Ziems
Dilara Soylu
Lakshya A Agrawal
Isaac Miller
Liheng Lai
...
Dan Klein
Matei A. Zaharia
Karel DÓosterlinck
Christopher Potts
Omar Khattab
150
0
0
06 Aug 2025
LatentPrompt: Optimizing Promts in Latent Space
LatentPrompt: Optimizing Promts in Latent Space
Mateusz Bystroński
Grzegorz Piotrowski
Nitesh Chawla
Tomasz Kajdanowicz
VLM
55
0
0
04 Aug 2025
A Survey on AgentOps: Categorization, Challenges, and Future Directions
A Survey on AgentOps: Categorization, Challenges, and Future Directions
Zexin Wang
Jingjing Li
Quan Zhou
Haotian Si
Yuanhao Liu
Jianhui Li
Gaogang Xie
Fei Sun
Dan Pei
Changhua Pei
LLMAGAI4TS
154
0
0
04 Aug 2025
Test-Time Model Adaptation for Quantized Neural Networks
Test-Time Model Adaptation for Quantized Neural Networks
Zeshuai Deng
Guohao Chen
Shuaicheng Niu
Hui Luo
Shuhai Zhang
Yifan Yang
Renjie Chen
Wei Luo
Mingkui Tan
MQ
139
1
0
04 Aug 2025
TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards
TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards
Andreea Nica
Ivan Zakazov
Nicolas Mario Baldwin
Saibo Geng
Robert West
ReLMLRM
135
1
0
24 Jul 2025
P3: Prompts Promote Prompting
P3: Prompts Promote PromptingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xinyu Zhang
Yuanquan Hu
Fangchao Liu
Zhicheng Dou
LLMAG
70
1
0
21 Jul 2025
Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models
Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models
Anita Kriz
Elizabeth Laura Janes
Xing Shen
Tal Arbel
LRM
206
1
0
12 Jul 2025
RiOT: Efficient Prompt Refinement with Residual Optimization Tree
RiOT: Efficient Prompt Refinement with Residual Optimization TreeAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chenyi Zhou
Zhengyan Shi
Xingtai Lv
Lei Liang
H. Chen
Qiang Zhang
138
1
0
19 Jun 2025
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
Junqi Jiang
Tom Bewley
Salim I. Amoukou
Francesco Leofante
Antonio Rago
Saumitra Mishra
Francesca Toni
163
1
0
18 Jun 2025
FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning
FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning
Ganyu Wang
Jinjie Fang
Maxwell J. Ying
Bin Gu
Xi Chen
Boyu Wang
Yi Chang
Charles Ling
FedML
240
1
0
17 Jun 2025
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Dongge Han
Menglin Xia
Daniel Madrigal Diaz
Samuel Kessler
Ankur Mallick
Xuchao Zhang
Mirian Hipolito Garcia
Jin Xu
Victor Rühle
Saravan Rajmohan
LRM
187
0
0
10 Jun 2025
What Makes a Good Natural Language Prompt?
What Makes a Good Natural Language Prompt?Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Do Xuan Long
Duy Dinh
Ngoc-Hai Nguyen
Kenji Kawaguchi
Nancy F. Chen
Shafiq Joty
Min-Yen Kan
194
6
0
07 Jun 2025
ProRefine: Inference-Time Prompt Refinement with Textual Feedback
ProRefine: Inference-Time Prompt Refinement with Textual Feedback
Deepak Pandita
Tharindu Cyril Weerasooriya
A. Shah
Christopher Homan
Christopher Homan
Wei Wei
LLMAGReLMLRM
458
2
0
05 Jun 2025
Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question Answering
Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Songtao Jiang
Chenyi Zhou
Yan Zhang
Yeying Jin
Zuozhu Liu
LRM
205
2
0
01 Jun 2025
Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings
Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings
Anirudh Nair
Adi Banerjee
Laurent Mombaerts
Matthew Hagen
Tarik Borogovac
237
2
0
30 May 2025
Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development
Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development
Ming Shen
Raphael Shu
Anurag Pratik
James Gung
Yubin Ge
Monica Sunkara
Yi Zhang
LLMAG
234
3
0
22 May 2025
CIE: Controlling Language Model Text Generations Using Continuous Signals
CIE: Controlling Language Model Text Generations Using Continuous Signals
Vinay Samuel
Harshita Diddee
Yiming Zhang
Daphne Ippolito
311
0
0
19 May 2025
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
Ke Chen
Yufei Zhou
Xitong Zhang
Haohan Wang
221
4
0
19 May 2025
PromptPrism: A Linguistically-Inspired Taxonomy for Prompts
PromptPrism: A Linguistically-Inspired Taxonomy for Prompts
Sullam Jeoung
Yueyan Chen
Yi Zhang
Shuai Wang
Haibo Ding
Lin Lee Cheong
200
1
0
19 May 2025
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space
Hengli Li
Chenxi Li
Tong Wu
Xuekai Zhu
Yuxuan Wang
...
Eric Hanchen Jiang
Song-Chun Zhu
Zixia Jia
Ying Nian Wu
Zilong Zheng
LRM
369
17
0
19 May 2025
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)
Kirill Vasilevski
Benjamin Rombaut
Gopi Krishnan Rajbahadur
G. Oliva
Keheliya Gallaba
...
Haoxiang Zhang
Bouyan Chen
Kishanthan Thangarajah
Ahmed E. Hassan
Zhen Ming
273
0
0
15 May 2025
Model Performance-Guided Evaluation Data Selection for Effective Prompt Optimization
Model Performance-Guided Evaluation Data Selection for Effective Prompt OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Ximing Dong
Shaowei Wang
Dayi Lin
Ahmed E. Hassan
364
2
0
15 May 2025
PLHF: Prompt Optimization with Few-Shot Human Feedback
PLHF: Prompt Optimization with Few-Shot Human Feedback
Chun-Pai Yang
Kan Zheng
Shou-De Lin
205
1
0
11 May 2025
Prompt Engineering and the Effectiveness of Large Language Models in Enhancing Human Productivity
Prompt Engineering and the Effectiveness of Large Language Models in Enhancing Human Productivity
Rizal Khoirul Anam
77
2
0
10 May 2025
CAPO: Cost-Aware Prompt Optimization
CAPO: Cost-Aware Prompt Optimization
Tom Zehle
Moritz Schlager
Timo Heiß
Matthias Feurer
VLM
522
0
0
22 Apr 2025
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions
Luyang Fang
Xiaowei Yu
Jianfeng Cai
Yongkai Chen
Shushan Wu
...
Xiaoming Zhai
Dajiang Zhu
Wenxuan Zhong
Tianming Liu
Ping Ma
ALM
152
13
0
20 Apr 2025
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation
Jinyang Li
Sangwon Hyun
Muhammad Ali Babar
146
1
0
06 Apr 2025
Rethinking Reflection in Pre-Training
Rethinking Reflection in Pre-Training
Essential AI
Darsh J Shah
Peter Rushton
Somanshu Singla
Mohit Parmar
...
Philip Monk
Platon Mazarakis
Ritvik Kapila
Saurabh Srivastava
Tim Romanski
ReLMLRM
401
37
0
05 Apr 2025
GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
Wenliang Zheng
Sarkar Snigdha Sarathi Das
Yusen Zhang
Rui Zhang
271
2
0
04 Apr 2025
123456
Next