Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.22169
Cited By
v1
v2 (latest)
ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments
28 May 2025
Gili Lior
Eliya Habba
Shahar Levy
Avi Caciularu
Gabriel Stanovsky
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4046★)
Papers citing
"ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments"
2 / 2 papers shown
PromptSuite: A Task-Agnostic Framework for Multi-Prompt Generation
Eliya Habba
Noam Dahan
Gili Lior
Gabriel Stanovsky
LRM
352
1
0
20 Jul 2025
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Eliya Habba
Ofir Arviv
Itay Itzhak
Yotam Perlitz
Elron Bandel
Leshem Choshen
Michal Shmueli-Scheuer
Gabriel Stanovsky
360
12
0
03 Mar 2025
1
Page 1 of 1