Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.16446
Cited By
Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm
25 March 2024
Lei Liu
Xiaoyan Yang
Fangzhou Li
Chenfei Chi
Yue Shen
Shiwei Lyu
Xiaowei Ma
Xianguo Lyu
Liya Ma
Zhiqiang Zhang
Wei Xue
Yiran Huang
Jinjie Gu
LM&MA
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Automatic Evaluation for LLMs' Clinical Capabilities: Metric, Data, and Algorithm"
7 / 7 papers shown
Title
Med-CoDE: Medical Critique based Disagreement Evaluation Framework
Mohit Gupta
Akiko Aizawa
R. Shah
LM&MA
ELM
27
0
0
21 Apr 2025
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments
Sourjyadip Ray
Kushal Gupta
Soumi Kundu
Payal Arvind Kasat
Somak Aditya
Pawan Goyal
18
1
0
08 Oct 2024
Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams
Yuexing Hao
J. Holmes
Jared Hobson
Alexandra Bennett
Daniel K. Ebner
...
N. Yu
Chris L. Hallemeier
Brooke E. Ball
Mark R. Waddle
Wei Liu
LM&MA
27
0
0
26 Sep 2024
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MA
AILaw
34
14
0
06 Jun 2024
How Language Model Hallucinations Can Snowball
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
78
246
0
22 May 2023
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
240
1,070
0
05 Oct 2022
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
202
791
0
13 Sep 2019
1