Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.21072
Cited By
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
29 July 2024
Marco AF Pimentel
Clément Christophe
Tathagata Raha
Prateek Munjal
Praveen K Kanithi
Shadab Khan
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks"
1 / 1 papers shown
Title
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,927
0
20 Apr 2018
1