ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.12857
  4. Cited By
Enterprise Benchmarks for Large Language Model Evaluation

Enterprise Benchmarks for Large Language Model Evaluation

11 October 2024
Bing Zhang
Mikio Takeuchi
Ryo Kawahara
Shubhi Asthana
Md. Maruf Hossain
Guang-Jie Ren
Kate Soule
Yada Zhu
    ELM
ArXivPDFHTML

Papers citing "Enterprise Benchmarks for Large Language Model Evaluation"

2 / 2 papers shown
Title
LLMs Outperform Experts on Challenging Biology Benchmarks
LLMs Outperform Experts on Challenging Biology Benchmarks
Lennart Justen
ELM
11
0
0
09 May 2025
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon
Nurit Cohen-Inger
Yehonatan Elisha
Bracha Shapira
L. Rokach
Seffi Cohen
ELM
86
0
0
11 Feb 2025
1