Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.01185
Cited By
Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding
2 March 2024
Ha-Thanh Nguyen
Ken Satoh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding"
2 / 2 papers shown
Title
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
Hashmath Shaik
Alex Doboli
OffRL
ELM
68
0
0
31 Dec 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1