Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.12857
Cited By
Enterprise Benchmarks for Large Language Model Evaluation
11 October 2024
Bing Zhang
Mikio Takeuchi
Ryo Kawahara
Shubhi Asthana
Md. Maruf Hossain
Guang-Jie Ren
Kate Soule
Yada Zhu
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enterprise Benchmarks for Large Language Model Evaluation"
2 / 2 papers shown
Title
LLMs Outperform Experts on Challenging Biology Benchmarks
Lennart Justen
ELM
11
0
0
09 May 2025
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon
Nurit Cohen-Inger
Yehonatan Elisha
Bracha Shapira
L. Rokach
Seffi Cohen
ELM
83
0
0
11 Feb 2025
1