Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.07340
Cited By
MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine
12 May 2023
Jie Xu
Lu Lu
Sen Yang
Bilin Liang
Xinwei Peng
Jiali Pang
Jinru Ding
Xiaoming Shi
Lingrui Yang
Huan-Zhi Song
Kang Li
Xin Sun
Shaoting Zhang
LM&MA
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MedGPTEval: A Dataset and Benchmark to Evaluate Responses of Large Language Models in Medicine"
3 / 3 papers shown
Title
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family
Yiming Tan
Dehai Min
Y. Li
Wenbo Li
Nan Hu
Yongrui Chen
Guilin Qi
AI4MH
ELM
47
93
0
14 Mar 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?
Thang M. Pham
Trung Bui
Long Mai
Anh Totti Nguyen
195
122
0
30 Dec 2020
1