Reactor Mk.1 performances: MMLU, HumanEval and BBH test results

15 June 2024

Papers citing "Reactor Mk.1 performances: MMLU, HumanEval and BBH test results"

1 / 1 papers shown

Title
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks Rushang Karia Daniel Bramblett D. Dobhal Siddharth Srivastava ELM LRM 25 0 0 11 Oct 2024