ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.10515
  4. Cited By
Reactor Mk.1 performances: MMLU, HumanEval and BBH test results

Reactor Mk.1 performances: MMLU, HumanEval and BBH test results

15 June 2024
TJ Dunham
Henry Syahputra
ArXivPDFHTML

Papers citing "Reactor Mk.1 performances: MMLU, HumanEval and BBH test results"

1 / 1 papers shown
Title
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELM
LRM
25
0
0
11 Oct 2024
1