ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.16446
  4. Cited By
Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric,
  Data, and Algorithm

Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm

25 March 2024
Lei Liu
Xiaoyan Yang
Fangzhou Li
Chenfei Chi
Yue Shen
Shiwei Lyu
Xiaowei Ma
Xianguo Lyu
Liya Ma
Zhiqiang Zhang
Wei Xue
Yiran Huang
Jinjie Gu
    LM&MA
    ELM
ArXivPDFHTML

Papers citing "Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm"

7 / 7 papers shown
Title
Med-CoDE: Medical Critique based Disagreement Evaluation Framework
Med-CoDE: Medical Critique based Disagreement Evaluation Framework
Mohit Gupta
Akiko Aizawa
R. Shah
LM&MA
ELM
25
0
0
21 Apr 2025
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language
  Models in Hospital Environments
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments
Sourjyadip Ray
Kushal Gupta
Soumi Kundu
Payal Arvind Kasat
Somak Aditya
Pawan Goyal
16
1
0
08 Oct 2024
Retrospective Comparative Analysis of Prostate Cancer In-Basket
  Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Yuexing Hao
J. Holmes
Jared Hobson
Alexandra Bennett
Daniel K. Ebner
...
N. Yu
Chris L. Hallemeier
Brooke E. Ball
Mark R. Waddle
Wei Liu
LM&MA
27
0
0
26 Sep 2024
A Survey on Medical Large Language Models: Technology, Application,
  Trustworthiness, and Future Directions
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MA
AILaw
34
14
0
06 Jun 2024
How Language Model Hallucinations Can Snowball
How Language Model Hallucinations Can Snowball
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
78
246
0
22 May 2023
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
240
1,070
0
05 Oct 2022
PubMedQA: A Dataset for Biomedical Research Question Answering
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
196
791
0
13 Sep 2019
1