SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

16 December 2024

Papers citing "SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models"

2 / 2 papers shown

Title
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning Yafu Li Zhilin Wang Tingchen Fu Ganqu Cui Sen Yang Yu Cheng 36 1 0 21 Jan 2025
The Superalignment of Superhuman Intelligence with Large Language Models Minlie Huang Yingkang Wang Shiyao Cui Pei Ke J. Tang 101 1 0 15 Dec 2024