ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.12548
  4. Cited By
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
v1v2v3 (latest)

RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
25 May 2022
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
ArXiv (abs)PDFHTML

Papers citing "RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning"

24 / 274 papers shown
Title
Guiding Large Language Models via Directional Stimulus Prompting
Guiding Large Language Models via Directional Stimulus PromptingNeural Information Processing Systems (NeurIPS), 2023
Zekun Li
Baolin Peng
Pengcheng He
Michel Galley
Jianfeng Gao
Xi Yan
LLMAGLRMLM&Ro
399
129
0
22 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
623
17
0
17 Feb 2023
Evaluating the Robustness of Discrete Prompts
Evaluating the Robustness of Discrete PromptsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Yoichi Ishibashi
Danushka Bollegala
Katsuhito Sudoh
Satoshi Nakamura
139
20
0
11 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction
  Followers
The Wisdom of Hindsight Makes Language Models Better Instruction FollowersInternational Conference on Machine Learning (ICML), 2023
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
192
58
0
10 Feb 2023
Explanation Selection Using Unlabeled Data for Chain-of-Thought
  Prompting
Explanation Selection Using Unlabeled Data for Chain-of-Thought PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xi Ye
Greg Durrett
LRMReLM
198
14
0
09 Feb 2023
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt
  Tuning and Discovery
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryNeural Information Processing Systems (NeurIPS), 2023
Yuxin Wen
Neel Jain
John Kirchenbauer
Micah Goldblum
Jonas Geiping
Tom Goldstein
VLMDiffM
325
356
1
07 Feb 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
431
150
0
16 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Parameter-Efficient Fine-Tuning Design SpacesInternational Conference on Learning Representations (ICLR), 2023
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
254
76
0
04 Jan 2023
Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good
  movie, and a good prompt too?
Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good movie, and a good prompt too?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weijia Shi
Xiaochuang Han
Hila Gonen
Ari Holtzman
Yulia Tsvetkov
Luke Zettlemoyer
189
56
0
20 Dec 2022
Self-Adaptive In-Context Learning: An Information Compression
  Perspective for In-Context Example Selection and Ordering
Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and OrderingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Zhiyong Wu
Yaoxiang Wang
Jiacheng Ye
Lingpeng Kong
264
183
0
20 Dec 2022
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
PromptBoosting: Black-Box Text Classification with Ten Forward PassesInternational Conference on Machine Learning (ICML), 2022
Bairu Hou
J. O'Connor
Jacob Andreas
Shiyu Chang
Yang Zhang
VLM
196
51
0
19 Dec 2022
Decoder Tuning: Efficient Language Understanding as Decoding
Decoder Tuning: Efficient Language Understanding as DecodingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ganqu Cui
Wentao Li
Ning Ding
Longtao Huang
Zhiyuan Liu
Maosong Sun
197
7
0
16 Dec 2022
Demystifying Prompts in Language Models via Perplexity Estimation
Demystifying Prompts in Language Models via Perplexity EstimationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hila Gonen
Srini Iyer
Terra Blevins
Noah A. Smith
Luke Zettlemoyer
LRM
365
273
0
08 Dec 2022
T-STAR: Truthful Style Transfer using AMR Graph as Intermediate
  Representation
T-STAR: Truthful Style Transfer using AMR Graph as Intermediate RepresentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Anubhav Jangra
Preksha Nema
A. Raghuveer
138
8
0
03 Dec 2022
Designing Ecosystems of Intelligence from First Principles
Designing Ecosystems of Intelligence from First PrinciplesCollective Intelligence (CI), 2022
Karl J. Friston
M. Ramstead
Alex B. Kiefer
Alexander Tschantz
Christopher L. Buckley
...
K. Fung
Jason G. Fox
Steven Swanson
D. Mapes
Gabriel René
274
47
0
02 Dec 2022
TEMPERA: Test-Time Prompting via Reinforcement Learning
TEMPERA: Test-Time Prompting via Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Tianjun Zhang
Xuezhi Wang
Denny Zhou
Dale Schuurmans
Joseph E. Gonzalez
VLM
110
45
0
21 Nov 2022
Zero-Label Prompt Selection
Zero-Label Prompt Selection
Chonghua Liao
Yanan Zheng
Zhilin Yang
VLM
136
8
0
09 Nov 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question
  Answering
Rainier: Reinforced Knowledge Introspector for Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hamish Ivison
Skyler Hallinan
Ximing Lu
Pengfei He
Sean Welleck
Hannaneh Hajishirzi
Yejin Choi
RALM
242
62
0
06 Oct 2022
Explaining Patterns in Data with Language Models via Interpretable
  Autoprompting
Explaining Patterns in Data with Language Models via Interpretable Autoprompting
Chandan Singh
John X. Morris
J. Aneja
Alexander M. Rush
Jianfeng Gao
LRM
167
0
0
04 Oct 2022
PromptFL: Let Federated Participants Cooperatively Learn Prompts Instead
  of Models -- Federated Learning in Age of Foundation Model
PromptFL: Let Federated Participants Cooperatively Learn Prompts Instead of Models -- Federated Learning in Age of Foundation ModelIEEE Transactions on Mobile Computing (IEEE TMC), 2022
Tao Guo
Song Guo
Junxiao Wang
Wenchao Xu
FedMLVLMLRM
176
182
0
24 Aug 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models
BBTv2: Towards a Gradient-Free Future with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tianxiang Sun
Zhengfu He
Hong Qian
Yunhua Zhou
Xuanjing Huang
Xipeng Qiu
254
70
0
23 May 2022
Black-box Prompt Learning for Pre-trained Language Models
Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao
Zhichao Huang
Ruijia Xu
Xuechun Li
Yong Lin
Xiao Zhou
Tong Zhang
VLMAAML
270
83
0
21 Jan 2022
Toward a `Standard Model' of Machine Learning
Toward a `Standard Model' of Machine Learning
Zhiting Hu
Eric Xing
283
15
0
17 Aug 2021
Efficient (Soft) Q-Learning for Text Generation with Limited Good Data
Efficient (Soft) Q-Learning for Text Generation with Limited Good DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Han Guo
Bowen Tan
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
OffRL
204
40
0
14 Jun 2021
Previous
123456