ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.05605
  4. Cited By
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization
v1v2v3v4v5 (latest)

Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization

8 February 2025
Yongcheng Zeng
Xinyu Cui
Xuanfa Jin
Guoqing Liu
Guoqing Liu
Quan He
Dong Li
Ning Yang
Haifeng Zhang
Ning Yang
Jun Wang
Jianye Hao
Haifeng Zhang
Jun Wang
    LLMAGLRM
ArXiv (abs)PDFHTML

Papers citing "Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization"

1 / 1 papers shown
Title
Sherlock: Self-Correcting Reasoning in Vision-Language Models
Sherlock: Self-Correcting Reasoning in Vision-Language Models
Yi Ding
Ruqi Zhang
ReLMLRMVLM
224
6
0
28 May 2025
1