ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.07040
  4. Cited By
Benchmark datasets driving artificial intelligence development fail to
  capture the needs of medical professionals
v1v2 (latest)

Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals

Journal of Biomedical Informatics (JBI), 2022
18 January 2022
Kathrin Blagec
J. Kraiger
Wolfgang Frühwirt
Matthias Samwald
    AI4MH
ArXiv (abs)PDFHTMLGithub (76★)

Papers citing "Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals"

12 / 12 papers shown
Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes
Cognitive bias in LLM reasoning compromises interpretation of clinical oncology notes
Matthew W. Kenaston
Umair Ayub
Mihir Parmar
Muhammad Umair Anjum
Syed Arsalan Ahmed Naqvi
...
Eliezer M. Van Allen
Ben Zhou
YooJung Choi
Chitta Baral
Irbaz B. Riaz
LRM
101
0
0
16 Nov 2025
Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners
Towards Real-World Validity in Generative AI Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners
Charlotte Li
Nick Hagar
Sachita Nishal
Jeremy Gilbert
Nick Diakopoulos
ELM
140
1
0
30 Sep 2025
Towards Experience-Centered AI: A Framework for Integrating Lived Experience in Design and Development
Towards Experience-Centered AI: A Framework for Integrating Lived Experience in Design and Development
Sanjana Gautam
Mohit Chandra
Ankolika De
Tatiana Chakravorti
Girik Malik
M. D. Choudhury
101
0
0
09 Aug 2025
It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank Representations
It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank RepresentationsIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Guoyi Zhang
Guangsheng Xu
Siyang Chen
Han Wang
Xiaohu Zhang
661
0
0
12 Jun 2025
Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methods
Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methodsHealth Information Science and Systems (HISS), 2025
J. Benítez-Andrades
C. Prada-García
Nicolás Ordás-Reyes
Marta Esteban Blanco
Alicia Merayo
Antonio Serrano-García
278
7
0
23 Mar 2025
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation
Maria Eriksson
Erasmo Purificato
Arman Noroozian
Joao Vinagre
Guillaume Chaslot
Emilia Gomez
David Fernandez-Llorca
ELM
837
47
0
10 Feb 2025
PhilHumans: Benchmarking Machine Learning for Personal Health
PhilHumans: Benchmarking Machine Learning for Personal Health
Vadim Liventsev
Vivek Kumar
Allmin Pradhap Singh Susaiyah
Zixiu "Alex" Wu
Ivan Rodin
...
Milan Petkovic
Diego Reforgiato Recupero
Ehud Reiter
Daniele Riboni
Raymond Sterling
AI4MHLM&MA
291
0
0
04 May 2024
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for
  Systematic Literature Reviews
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature ReviewsNeural Information Processing Systems (NeurIPS), 2023
Wojciech Kusa
Óscar E. Mendoza
Matthias Samwald
Petr Knoth
Allan Hanbury
339
8
0
21 Nov 2023
BenchMD: A Benchmark for Unified Learning on Medical Images and Sensors
BenchMD: A Benchmark for Unified Learning on Medical Images and Sensors
Kathryn Wantlin
Chenwei Wu
Shih-Cheng Huang
Oishi Banerjee
Farah Z. Dadabhoy
...
A. Adamson
Laura Heacock
G. Tison
Alex Tamkin
Pranav Rajpurkar
SSLOOD
204
5
0
17 Apr 2023
The Shaky Foundations of Clinical Foundation Models: A Survey of Large
  Language Models and Foundation Models for EMRs
The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRs
Michael Wornow
Yizhe Xu
Rahul Thapa
Birju S. Patel
E. Steinberg
Scott L. Fleming
M. Pfeffer
Jason Alan Fries
N. Shah
LM&MA
386
39
0
22 Mar 2023
Mapping global dynamics of benchmark creation and saturation in
  artificial intelligence
Mapping global dynamics of benchmark creation and saturation in artificial intelligenceNature Communications (Nat Commun), 2022
Simon Ott
A. Barbosa-Silva
Kathrin Blagec
J. Brauner
Matthias Samwald
372
77
0
09 Mar 2022
A curated, ontology-based, large-scale knowledge graph of artificial
  intelligence tasks and benchmarks
A curated, ontology-based, large-scale knowledge graph of artificial intelligence tasks and benchmarks
Kathrin Blagec
A. Barbosa-Silva
Simon Ott
Matthias Samwald
346
35
0
04 Oct 2021
1
Page 1 of 1