ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.07340
  4. Cited By
MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large
  Language Models in Medicine

MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine

12 May 2023
Jie Xu
Lu Lu
Sen Yang
Bilin Liang
Xinwei Peng
Jiali Pang
Jinru Ding
Xiaoming Shi
Lingrui Yang
Huan-Zhi Song
Kang Li
Xin Sun
Shaoting Zhang
    LM&MA
    AI4MH
ArXivPDFHTML

Papers citing "MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine"

3 / 3 papers shown
Title
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the
  Question Answering Performance of the GPT LLM Family
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family
Yiming Tan
Dehai Min
Y. Li
Wenbo Li
Nan Hu
Yongrui Chen
Guilin Qi
AI4MH
ELM
47
93
0
14 Mar 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Out of Order: How Important Is The Sequential Order of Words in a
  Sentence in Natural Language Understanding Tasks?
Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?
Thang M. Pham
Trung Bui
Long Mai
Anh Totti Nguyen
195
122
0
30 Dec 2020
1