Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2410.22368
Cited By

Project MPG: towards a generalized performance benchmark for LLM
capabilities

Project MPG: towards a generalized performance benchmark for LLM capabilities

28 October 2024

Nick Masiewicki

Xerxes Dotiwalla

Rama Parusmathi

Peter Grabowski

ArXiv (abs)PDF HTML

Papers citing "Project MPG: towards a generalized performance benchmark for LLM capabilities"

1 / 1 papers shown

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

Balázs Galambosi

Abigail Z. Jacobs

Tatsunori Hashimoto

455

601

0

06 Apr 2024