Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.02464
Cited By
Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I
2 July 2024
Harrie Oosterhuis
R. Jagerman
Zhen Qin
Xuanhui Wang
Michael Bendersky
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I"
3 / 3 papers shown
Title
LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations
Laura Dietz
Oleg Zendel
P. Bailey
Charles L. A. Clarke
Ellese Cotterill
Jeff Dalton
Faegheh Hasibi
Mark Sanderson
Nick Craswell
ELM
43
0
0
27 Apr 2025
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
229
961
0
17 Apr 2021
1