ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.01185
  4. Cited By
Balancing Exploration and Exploitation in LLM using Soft RLLF for
  Enhanced Negation Understanding

Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding

2 March 2024
Ha-Thanh Nguyen
Ken Satoh
ArXivPDFHTML

Papers citing "Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding"

2 / 2 papers shown
Title
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
Hashmath Shaik
Alex Doboli
OffRL
ELM
68
0
0
31 Dec 2024
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1