ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.20303
  4. Cited By
A Looming Replication Crisis in Evaluating Behavior in Language Models?
  Evidence and Solutions

A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions

30 September 2024
Laurène Vaugrante
Mathias Niepert
Thilo Hagendorff
    LRM
ArXivPDFHTML

Papers citing "A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions"

1 / 1 papers shown
Title
An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
Qiuhai Zeng
Claire Jin
Xinyue Wang
Yuhan Zheng
Qunhua Li
36
0
0
23 Feb 2025
1