ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.08087
  4. Cited By
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
v1v2v3v4v5v6 (latest)

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

15 June 2021
Ningyu Zhang
Mosha Chen
Zhen Bi
Xiaozhuan Liang
Lei Li
Xin Shang
Kangping Yin
Chuanqi Tan
Jian Xu
Fei Huang
Luo Si
Yuan Ni
Guotong Xie
Zhifang Sui
Baobao Chang
Hui Zong
Zheng Yuan
Linfeng Li
Jun Yan
Hongying Zan
Kunli Zhang
Buzhou Tang
Qingcai Chen
    LM&MAELM
ArXiv (abs)PDFHTML

Papers citing "CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark"

50 / 83 papers shown
MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents
MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents
Jinru Ding
Lu Lu
Chao Ding
Mouxiao Bian
Jiayuan Chen
...
Rongzhao Zhang
Luyi Jiang
Bing Han
Y Samuel Wang
Jie Xu
LM&MAELM
453
2
0
18 Nov 2025
47B Mixture-of-Experts Beats 671B Dense Models on Chinese Medical Examinations
47B Mixture-of-Experts Beats 671B Dense Models on Chinese Medical Examinations
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
Hongying Luo
Lu Chen
...
Jibin Guan
Junfeng Hao
Junhao Song
Ziqian Bi
Ziqian Bi
MoEELM
389
0
0
16 Nov 2025
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Xiao Ye
Jacob Dineen
Zhaonan Li
Zhikun Xu
Weiyu Chen
...
Ji-Eun Irene Yum
Muhammad Ali Khan
Muhammad Umar Afzal
Irbaz B. Riaz
Ben Zhou
LM&MAELM
257
1
0
20 Oct 2025
Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasks and Benchmarks
Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasks and Benchmarks
Wenya Xie
Qingying Xiao
Yu Zheng
Xidong Wang
Junying Chen
...
Anningzhe Gao
Prayag Tiwari
Xiang Wan
Feng Jiang
Benyou Wang
LM&MA
243
0
0
13 Oct 2025
A Unified Biomedical Named Entity Recognition Framework with Large Language Models
A Unified Biomedical Named Entity Recognition Framework with Large Language Models
Tengxiao Lv
Ling Luo
Juntao Li
Yanhua Wang
Yuchen Pan
...
Yan Jiang
Huiyi Lv
Yuanyuan Sun
Jian Wang
Hongfei Lin
130
1
0
10 Oct 2025
AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
Chen Liang
Zhaoqi Huang
Haofen Wang
Fu Chai
Chunying Yu
...
Zhengjie Liu
Yanpeng Li
Hongjun Wang
Ruifeng Luo
Xianzhong Zhao
ELM
190
2
0
23 Sep 2025
Exploring Stability-Plasticity Trade-offs for Continual Named Entity Recognition
Exploring Stability-Plasticity Trade-offs for Continual Named Entity RecognitionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Duzhen Zhang
Chenxing Li
Jiahua Dong
Qi Liu
Dong Yu
169
0
0
05 Aug 2025
ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark
ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark
He Wang
Linhan Ma
Dake Guo
Xiong Wang
Lei Xie
Jin Xu
Junyang Lin
AuLLM
392
6
0
08 Jul 2025
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Ming-bo Wen
Yujiong Shen
Zelin Li
Huayu Sha
Binze Hu
...
Zhiheng Xi
Jiajun Sun
Tao Gui
Tao Gui
Qi Zhang
LM&MAELM
365
12
0
04 Jun 2025
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
Jiacheng Xie
Yang Yu
Ziyang Zhang
Shuai Zeng
Jiaxuan He
...
Congyu Guo
Lening Zhao
Congcong Jing
Guanghui An
Dong Xu
LM&MAELM
357
6
0
29 May 2025
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
Yichun Feng
Jiawei Wang
Lu Zhou
Zhen Lei
Yixue Li
OffRLLM&MA
561
23
0
26 May 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
421
14
0
29 Apr 2025
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity RecognitionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Jian Zhang
Tianqing Zhang
Qi Li
Hongwei Wang
257
0
0
19 Apr 2025
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning
Peiyi Lin
Fukai Zhang
Kai Niu
Hao Fu
CLL
372
0
0
20 Mar 2025
OphthBench: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Ophthalmology
OphthBench: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Ophthalmology
Chengfeng Zhou
Ji Wang
Juanjuan Qin
Yining Wang
Ling Sun
Weiwei Dai
LM&MAELM
459
1
0
03 Feb 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MAAILaw
893
298
0
28 Jan 2025
Data Augmentation Techniques for Chinese Disease Name Normalization
Data Augmentation Techniques for Chinese Disease Name NormalizationIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Wenqian Cui
Xiangling Fu
Shaohui Liu
Mingjun Gu
Xien Liu
Ji Wu
Irwin King
251
1
0
03 Jan 2025
Large Language Model Benchmarks in Medical Tasks
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Yujiao Shi
Cheng Fei
Cheng Fei
...
Junyu Liu
Xinyuan Song
Riyang Bao
Zekun Jiang
Ziyuan Qin
LM&MAAI4MH
830
26
0
28 Oct 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLMCoGeReLMVLMLRM
250
1
0
17 Oct 2024
3DS: Medical Domain Adaptation of LLMs via Decomposed Difficulty-based Data Selection
3DS: Medical Domain Adaptation of LLMs via Decomposed Difficulty-based Data Selection
Hongxin Ding
Yue Fang
Runchuan Zhu
Xinke Jiang
Jinyang Zhang
Yongxin Xu
Xu Chu
Junfeng Zhao
Yasha Wang
398
4
0
13 Oct 2024
Privacy Evaluation Benchmarks for NLP ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Wei Huang
Yinggui Wang
Cen Chen
ELMSILM
451
5
0
24 Sep 2024
RexUniNLU: Recursive Method with Explicit Schema Instructor for
  Universal NLU
RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Chengyuan Liu
Shihang Wang
Fubang Zhao
Kun Kuang
Yangyang Kang
Weiming Lu
Changlong Sun
Fei Wu
261
1
0
09 Sep 2024
Probing Causality Manipulation of Large Language Models
Probing Causality Manipulation of Large Language Models
Chenyang Zhang
Haibo Tong
Bin Zhang
Dongyu Zhang
LRM
314
1
0
26 Aug 2024
LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace
  Them
LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them
Wenya Xie
Qingying Xiao
Yu Zheng
Xidong Wang
Junying Chen
Ke Ji
Anningzhe Gao
Xiang Wan
Feng Jiang
Benyou Wang
LM&MA
164
5
0
26 Jun 2024
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment
  and Knowledge Aggregation
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation
Yusheng Liao
Shuyang Jiang
Yanfeng Wang
Yu Wang
442
6
0
25 Jun 2024
Retrieval Augmented Instruction Tuning for Open NER with Large Language
  Models
Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
Tingyu Xie
Jian Zhang
Yan Zhang
Yuanyuan Liang
Qi Li
Hongwei Wang
RALM
378
3
0
25 Jun 2024
MedBench: A Comprehensive, Standardized, and Reliable Benchmarking
  System for Evaluating Chinese Medical Large Language Models
MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models
Mianxin Liu
Jinru Ding
Jie Xu
Weiguo Hu
Xiaoyang Li
...
Haofen Wang
Tong Ruan
Xuanjing Huang
Xin Sun
Shaoting Zhang
ELMAI4MHLM&MA
265
29
0
24 Jun 2024
medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs
medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs
Mingyi Jia
Junwen Duan
Yan Song
Jianxin Wang
500
20
0
20 Jun 2024
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Yuming Yang
Wantong Zhao
Jessica Fan
Junjie Ye
Xiao Wang
...
Kaixin Huang
Yunke Zhang
Tao Gui
Qi Zhang
Xuanjing Huang
509
8
0
17 Jun 2024
TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large
  Language Models
TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models
Ping Yu
Kaitao Song
Fengchen He
Ming Chen
Jianfeng Lu
LM&MA
228
13
0
07 Jun 2024
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering
  Medical Knowledge
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge
Yuxuan Zhou
Xien Liu
Chen Ning
Ji Wu
ELM
243
8
0
05 Jun 2024
Medical Dialogue: A Survey of Categories, Methods, Evaluation and
  Challenges
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi
Zeming Liu
Li Du
Yuxuan Wang
Hongru Wang
Yuhang Guo
Tong Ruan
Jie Xu
Shaoting Zhang
LM&MAELM
412
8
0
17 May 2024
A Comprehensive Survey on Evaluating Large Language Model Applications
  in the Medical Industry
A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry
Yining Huang
Keke Tang
Meilian Chen
Boyuan Wang
ELMLM&MA
519
31
0
24 Apr 2024
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models
  with Sparse Mixture of Low-Rank Adapter Experts
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts
Yusheng Liao
Shuyang Jiang
Yu Wang
Yanfeng Wang
MoE
262
14
0
13 Apr 2024
Intent Detection and Entity Extraction from BioMedical Literature
Intent Detection and Entity Extraction from BioMedical Literature
Ankan Mullick
Mukur Gupta
Pawan Goyal
MedIm
286
7
0
04 Apr 2024
MRC-based Nested Medical NER with Co-prediction and Adaptive
  Pre-training
MRC-based Nested Medical NER with Co-prediction and Adaptive Pre-trainingInternational Conference on Language Resources and Evaluation (LREC), 2024
Xiaojing Du
Hanjie Zhao
Danyan Xing
Yuxiang Jia
Hongying Zan
227
11
0
23 Mar 2024
Role Prompting Guided Domain Adaptation with General Capability Preserve
  for Large Language Models
Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models
Rui Wang
Fei Mi
Yi Chen
Boyang Xue
Hongru Wang
Qi Zhu
Kam-Fai Wong
Rui-Lan Xu
CLL
251
18
0
05 Mar 2024
DrBenchmark: A Large Language Understanding Evaluation Benchmark for
  French Biomedical Domain
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
Yanis Labrak
Adrien Bazoge
Oumaima El Khettari
Mickael Rouvier
Pacome Constant dit Beaufils
...
B. Daille
Solen Quiniou
Emmanuel Morin
P. Gourraud
Richard Dufour
LM&MA
280
9
0
20 Feb 2024
Benchmarking Large Language Models on Communicative Medical Coaching: a
  Novel System and Dataset
Benchmarking Large Language Models on Communicative Medical Coaching: a Novel System and Dataset
Hengguan Huang
Songtao Wang
Hongfu Liu
Hao Wang
Ye Wang
LM&MA
299
4
0
08 Feb 2024
An Empirical Investigation of Domain Adaptation Ability for Chinese
  Spelling Check Models
An Empirical Investigation of Domain Adaptation Ability for Chinese Spelling Check ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Xi Wang
Ruoqing Zhao
Hongliang Dai
Piji Li
LRM
187
2
0
26 Jan 2024
A Fast, Performant, Secure Distributed Training Framework For Large
  Language Model
A Fast, Performant, Secure Distributed Training Framework For Large Language Model
Wei Huang
Yinggui Wang
Anda Cheng
Aihui Zhou
Chaofan Yu
Lei Wang
ALM
248
26
0
18 Jan 2024
Data-Centric Foundation Models in Computational Healthcare: A Survey
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang
Jin Gao
Zheling Tan
Lingfeng Zhou
Kexin Ding
Mu Zhou
Shaoting Zhang
Yi Xu
AI4CE
402
39
0
04 Jan 2024
Text2MDT: Extracting Medical Decision Trees from Medical Texts
Text2MDT: Extracting Medical Decision Trees from Medical Texts
Wei-wei Zhu
Wenfeng Li
Xing Tian
Pengfei Wang
Xiaoling Wang
Jin Chen
Man Lan
Yuan Ni
Guotong Xie
265
9
0
04 Jan 2024
Overview of the PromptCBLUE Shared Task in CHIP2023
Overview of the PromptCBLUE Shared Task in CHIP2023
Wei-wei Zhu
Xiaoling Wang
Mosha Chen
Buzhou Tang
LM&MAELM
292
15
0
29 Dec 2023
HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and
  Reliable Medical LLMs Responses
HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Responses
Xinke Jiang
Ruizhe Zhang
Yongxin Xu
Rihong Qiu
Yue Fang
...
Jinyi Tang
Hongxin Ding
Xu Chu
Junfeng Zhao
Yasha Wang
RALM
394
38
0
26 Dec 2023
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for
  Systematic Literature Reviews
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature ReviewsNeural Information Processing Systems (NeurIPS), 2023
Wojciech Kusa
Óscar E. Mendoza
Matthias Samwald
Petr Knoth
Allan Hanbury
339
8
0
21 Nov 2023
Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse
  Biomedical Tasks
Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Ling Luo
Jinzhong Ning
Yingwen Zhao
Zhijun Wang
Zeyuan Ding
...
Yuqi Liu
Zhihao Yang
Jian Wang
Shengdi Yin
Hongfei Lin
LM&MA
398
95
0
20 Nov 2023
KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained
  Language Model
KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model
Lei Geng
Xu Yan
Ziqiang Cao
Juntao Li
Wenjie Li
Sujian Li
Xinjie Zhou
Yang Yang
Jun Zhang
187
2
0
20 Nov 2023
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
Junying Chen
Xidong Wang
Anningzhe Gao
Feng Jiang
Shunian Chen
...
Chuyi Kong
Jianquan Li
Xiang Wan
Haizhou Li
Benyou Wang
LM&MA
281
120
0
16 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application,
  and Challenge
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David Clifton
LM&MA
815
207
0
09 Nov 2023
12
Next
Page 1 of 2