Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework

14 June 2024

Papers citing "Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework"

2 / 2 papers shown

Title
How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name Disambiguation Olivier Binette Youngsoo Baek Siddharth Engineer Christina Jones Abel Dasylva Jerome P. Reiter 24 2 0 08 Apr 2024
Don't Make Your LLM an Evaluation Benchmark Cheater Kun Zhou Yutao Zhu Zhipeng Chen Wentong Chen Wayne Xin Zhao Xu Chen Yankai Lin Ji-Rong Wen Jiawei Han ELM 105 136 0 03 Nov 2023