ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.16614
  4. Cited By
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

23 February 2025
Alexander Zhang
Marcus Dong
J. H. Liu
W. Zhang
Yejie Wang
Jian Yang
Ge Zhang
T. Liu
Zhongyuan Peng
Yingshui Tan
Y. Zhang
Z. Wang
Weixun Wang
Yancheng He
K. Deng
Wangchunshu Zhou
Wenhao Huang
Z. Zhang
    LRM
ArXivPDFHTML

Papers citing "CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models"

1 / 1 papers shown
Title
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Yancheng He
Shilong Li
J. Liu
Weixun Wang
Xingyuan Bu
...
Zhongyuan Peng
Z. Zhang
Zhicheng Zheng
Wenbo Su
Bo Zheng
ELM
LRM
60
6
0
26 Feb 2025
1