ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.10990
  4. Cited By
MedBench: A Comprehensive, Standardized, and Reliable Benchmarking
  System for Evaluating Chinese Medical Large Language Models

MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models

24 June 2024
Mianxin Liu
Jinru Ding
Jie Xu
Weiguo Hu
Xiaoyang Li
Lifeng Zhu
Zhian Bai
Xiaoming Shi
Benyou Wang
Haitao Song
Pengfei Liu
Xiaofan Zhang
Shanshan Wang
Kang Li
Haofen Wang
Tong Ruan
Xuanjing Huang
Xin Sun
Shaoting Zhang
    ELM
    AI4MH
    LM&MA
ArXivPDFHTML

Papers citing "MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models"

3 / 3 papers shown
Title
Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges
Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges
Xiaoxiao Liu
Qingying Xiao
Junying Chen
Xiangyi Feng
Xiangbo Wu
...
Xiang Wan
Jian Chang
Guangjun Yu
Yan Hu
Benyou Wang
LM&MA
LRM
59
0
0
11 Mar 2025
Don't Make Your LLM an Evaluation Benchmark Cheater
Don't Make Your LLM an Evaluation Benchmark Cheater
Kun Zhou
Yutao Zhu
Zhipeng Chen
Wentong Chen
Wayne Xin Zhao
Xu Chen
Yankai Lin
Ji-Rong Wen
Jiawei Han
ELM
105
136
0
03 Nov 2023
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
240
1,070
0
05 Oct 2022
1