ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06532
  4. Cited By
Active Bayesian Assessment for Black-Box Classifiers
v1v2v3 (latest)

Active Bayesian Assessment for Black-Box Classifiers

16 February 2020
Disi Ji
Robert L Logan IV
Padhraic Smyth
M. Steyvers
    UQCV
ArXiv (abs)PDFHTML

Papers citing "Active Bayesian Assessment for Black-Box Classifiers"

10 / 10 papers shown
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation
Alexander Rubinstein
Benjamin Raible
Martin Gubri
Seong Joon Oh
ELM
483
0
1
09 Oct 2025
Scaling Up Active Testing to Large Language Models
Scaling Up Active Testing to Large Language Models
Gabrielle Berrada
Jannik Kossen
Muhammed Razzak
Freddie Bickford-Smith
Y. Gal
Tom Rainforth
ALM
212
3
0
12 Aug 2025
Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid
  Progress
Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress
Christian Schroeder de Witt
Vishaal Udandarao
Juil Sock
Matthias Bethge
Adel Bibi
Samuel Albanie
271
3
0
29 Feb 2024
tinyBenchmarks: evaluating LLMs with fewer examples
tinyBenchmarks: evaluating LLMs with fewer examples
Felipe Maia Polo
Lucas Weber
Leshem Choshen
Yuekai Sun
Gongjun Xu
Mikhail Yurochkin
ELM
482
201
0
22 Feb 2024
Label-Efficient Model Selection for Text Generation
Label-Efficient Model Selection for Text Generation
Shir Ashury-Tahan
Ariel Gera
Benjamin Sznajder
Leshem Choshen
L. Ein-Dor
Eyal Shnarch
439
12
0
12 Feb 2024
A structured regression approach for evaluating model performance across
  intersectional subgroups
A structured regression approach for evaluating model performance across intersectional subgroupsConference on Fairness, Accountability and Transparency (FAccT), 2024
Christine Herlihy
Kimberly Truong
Alexandra Chouldechova
Miroslav Dudik
345
8
0
26 Jan 2024
Active Assessment of Prediction Services as Accuracy Surface Over
  Attribute Combinations
Active Assessment of Prediction Services as Accuracy Surface Over Attribute CombinationsNeural Information Processing Systems (NeurIPS), 2021
Vihari Piratla
Soumen Chakrabarty
Sunita Sarawagi
231
4
0
14 Aug 2021
Counterfactual Explanations Can Be Manipulated
Counterfactual Explanations Can Be ManipulatedNeural Information Processing Systems (NeurIPS), 2021
Dylan Slack
Sophie Hilgard
Himabindu Lakkaraju
Sameer Singh
279
166
0
04 Jun 2021
Active Testing: Sample-Efficient Model Evaluation
Active Testing: Sample-Efficient Model EvaluationInternational Conference on Machine Learning (ICML), 2021
Jannik Kossen
Sebastian Farquhar
Y. Gal
Tom Rainforth
VLM
352
79
0
09 Mar 2021
Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data
  and Bayesian Inference
Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian InferenceNeural Information Processing Systems (NeurIPS), 2020
Disi Ji
Padhraic Smyth
M. Steyvers
234
54
0
19 Oct 2020
1
Page 1 of 1