ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.09942
  4. Cited By
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

11 June 2025
Hao Peng
Yunjia Qi
Xiaozhi Wang
Bin Xu
Lei Hou
Juanzi Li
    OffRL
ArXiv (abs)PDFHTMLHuggingFace (6 upvotes)

Papers citing "VerIF: Verification Engineering for Reinforcement Learning in Instruction Following"

5 / 5 papers shown
DecepChain: Inducing Deceptive Reasoning in Large Language Models
DecepChain: Inducing Deceptive Reasoning in Large Language Models
Wei Shen
Han Wang
Xue Yang
Huan Zhang
LRM
171
1
0
30 Sep 2025
GSPR: Aligning LLM Safeguards as Generalizable Safety Policy Reasoners
GSPR: Aligning LLM Safeguards as Generalizable Safety Policy Reasoners
Xue Yang
Yulin Chen
Jingru Zeng
Hao Peng
Huihao Jing
Wenbin Hu
Xi Yang
Ziqian Zeng
Sirui Han
Yangqiu Song
LRM
112
1
0
29 Sep 2025
PSRT: Accelerating LRM-based Guard Models via Prefilled Safe Reasoning Traces
PSRT: Accelerating LRM-based Guard Models via Prefilled Safe Reasoning Traces
Jiawei Zhao
Yuang Qi
Weiming Zhang
Nenghai Yu
Kejiang Chen
LRM
138
0
0
26 Sep 2025
A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models
A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models
Yanbo Wang
Yongcan Yu
Jian Liang
Ran He
HILMLRM
205
6
0
04 Sep 2025
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
Rui Pu
Chaozhuo Li
Rui Ha
Litian Zhang
Lirong Qiu
Xi Zhang
AAMLLRM
151
1
0
05 Aug 2025
1
Page 1 of 1