Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2201.07040
Cited By

Benchmark datasets driving artificial intelligence development fail to
capture the needs of medical professionals

v1v2 (latest)

Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals

Journal of Biomedical Informatics (JBI), 2022

18 January 2022

Wolfgang Frühwirt

Matthias Samwald

ArXiv (abs)PDF HTML Github (76★)

Papers citing "Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals"

12 / 12 papers shown

Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes

Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes

Matthew W. Kenaston

Muhammad Umair Anjum

Syed Arsalan Ahmed Naqvi

...

Eliezer M. Van Allen

101

0

0

16 Nov 2025

Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners

Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners

Nick Diakopoulos

140

1

0

30 Sep 2025

Towards Experience-Centered AI: A Framework for Integrating Lived Experience in Design and Development

Towards Experience-Centered AI: A Framework for Integrating Lived Experience in Design and Development

Tatiana Chakravorti

M. D. Choudhury

101

0

0

09 Aug 2025

It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank Representations

It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank RepresentationsIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025

661

0

0

12 Jun 2025

Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methods

Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methodsHealth Information Science and Systems (HISS), 2025

J. Benítez-Andrades

C. Prada-García

Nicolás Ordás-Reyes

Marta Esteban Blanco

Antonio Serrano-García

278

7

0

23 Mar 2025

Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation

Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation

Erasmo Purificato

Arman Noroozian

Guillaume Chaslot

David Fernandez-Llorca

837

47

0

10 Feb 2025

PhilHumans: Benchmarking Machine Learning for Personal Health

PhilHumans: Benchmarking Machine Learning for Personal Health

Vadim Liventsev

Allmin Pradhap Singh Susaiyah

Zixiu "Alex" Wu

...

Diego Reforgiato Recupero

Raymond Sterling

291

0

0

04 May 2024

CSMeD: Bridging the Dataset Gap in Automated Citation Screening for
Systematic Literature Reviews

CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature ReviewsNeural Information Processing Systems (NeurIPS), 2023

Óscar E. Mendoza

Matthias Samwald

339

8

0

21 Nov 2023

BenchMD: A Benchmark for Unified Learning on Medical Images and Sensors

BenchMD: A Benchmark for Unified Learning on Medical Images and Sensors

Kathryn Wantlin

Shih-Cheng Huang

Farah Z. Dadabhoy

...

Pranav Rajpurkar

204

5

0

17 Apr 2023

The Shaky Foundations of Clinical Foundation Models: A Survey of Large
Language Models and Foundation Models for EMRs

The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRs

Scott L. Fleming

Jason Alan Fries

386

39

0

22 Mar 2023

Mapping global dynamics of benchmark creation and saturation in
artificial intelligence

Mapping global dynamics of benchmark creation and saturation in artificial intelligenceNature Communications (Nat Commun), 2022

A. Barbosa-Silva

Matthias Samwald

372

77

0

09 Mar 2022

A curated, ontology-based, large-scale knowledge graph of artificial
intelligence tasks and benchmarks

A curated, ontology-based, large-scale knowledge graph of artificial intelligence tasks and benchmarks

A. Barbosa-Silva

Matthias Samwald

346

35

0

04 Oct 2021

Page 1 of 1