ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.11504
  4. Cited By

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

18 November 2024
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
Jia Zheng
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
    ALM
ArXiv (abs)PDFHTML

Papers citing "Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering"

4 / 4 papers shown
Title
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Xueru Wen
Jie Lou
Yaojie Lu
Hongyu Lin
Xing Yu
Xinyu Lu
Xianpei Han
Jia Zheng
Debing Zhang
Le Sun
ALM
125
7
0
17 Feb 2025
Self-Rewarding Language Models
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLMSyDaALMLRM
394
338
0
18 Jan 2024
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Ziyang Luo
Can Xu
Pu Zhao
Qingfeng Sun
Xiubo Geng
Wenxiang Hu
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
ELMSyDaALM
167
694
0
14 Jun 2023
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions
Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions
Chen Feng Tsai
Xiaochen Zhou
Sierra S. Liu
Jing Li
Mo Yu
Hongyuan Mei
LLMAGELMAI4MHLM&MA
102
32
0
06 Apr 2023
1