ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.10366
  4. Cited By
Improving the Validity and Practical Usefulness of AI/ML Evaluations
  Using an Estimands Framework

Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework

14 June 2024
Olivier Binette
Jerome P. Reiter
ArXivPDFHTML

Papers citing "Improving the Validity and Practical Usefulness of AI/ML Evaluations Using an Estimands Framework"

2 / 2 papers shown
Title
How to Evaluate Entity Resolution Systems: An Entity-Centric Framework
  with Application to Inventor Name Disambiguation
How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name Disambiguation
Olivier Binette
Youngsoo Baek
Siddharth Engineer
Christina Jones
Abel Dasylva
Jerome P. Reiter
24
2
0
08 Apr 2024
Don't Make Your LLM an Evaluation Benchmark Cheater
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
105
136
0
03 Nov 2023
1