ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.07306
  4. Cited By

Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies

10 March 2025
Luyi Jiang
Jiasi Chen
Lu Lu
Xinwei Peng
Lihao Liu
Junjun He
Jie Xu
    ELMLM&MA
ArXiv (abs)PDFHTML

Papers citing "Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies"

9 / 9 papers shown
LettuceDetect: A Hallucination Detection Framework for RAG Applications
LettuceDetect: A Hallucination Detection Framework for RAG Applications
Adam Kovacs
Gábor Recski
201
15
0
24 Feb 2025
How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation
How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain SimulationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Rui Li
Heming Xia
Xinfeng Yuan
Qingxiu Dong
Lei Sha
W. Li
Lei Sha
AI4CE
199
2
0
20 Feb 2025
Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models
Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingqian Cui
Pengfei He
Jingying Zeng
Hui Liu
Xianfeng Tang
...
Zhen Li
Suhang Wang
Yue Xing
Shucheng Zhou
Qi He
LRM
574
29
0
18 Feb 2025
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary PerceptionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Shiyu Ni
Keping Bi
Jiafeng Guo
Lulu Yu
Baolong Bi
Xueqi Cheng
283
17
0
17 Feb 2025
LLMs for Drug-Drug Interaction Prediction: A Comprehensive Comparison
LLMs for Drug-Drug Interaction Prediction: A Comprehensive Comparison
Gabriele De Vito
Filomena Ferrucci
Athanasios Angelakis
LM&MA
340
7
0
09 Feb 2025
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization
Yuanye Liu
Jiahang Xu
Li Zhang
Qi Chen
Xuan Feng
Yang Chen
Zhongxin Guo
Yuqing Yang
Peng Cheng
778
13
0
06 Feb 2025
SOK: Exploring Hallucinations and Security Risks in AI-Assisted Software Development with Insights for LLM Deployment
SOK: Exploring Hallucinations and Security Risks in AI-Assisted Software Development with Insights for LLM Deployment
Ariful Haque
Sunzida Siddique
M. Rahman
Ahmed Rafi Hasan
Laxmi Rani Das
Marufa Kamal
Tasnim Masura
Kishor Datta Gupta
304
6
0
31 Jan 2025
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario
Lucen Zhong
Zhengxiao Du
Xiaohan Zhang
Haiyi Hu
J. Tang
LLMAG
224
25
0
20 Jan 2025
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy
Huandong Wang
Wenjie Fu
Yingzhou Tang
Zhilong Chen
Yanhua Huang
J. Piao
Chen Gao
Fengli Xu
Tao Jiang
Yongqian Li
PILM
222
21
0
17 Jan 2025
1