ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.13375
  4. Cited By
Capabilities of GPT-4 on Medical Challenge Problems

Capabilities of GPT-4 on Medical Challenge Problems

20 March 2023
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
    LM&MA
    ELM
    AI4MH
ArXivPDFHTML

Papers citing "Capabilities of GPT-4 on Medical Challenge Problems"

50 / 370 papers shown
Title
LLM-based Conversational AI Therapist for Daily Functioning Screening
  and Psychotherapeutic Intervention via Everyday Smart Devices
LLM-based Conversational AI Therapist for Daily Functioning Screening and Psychotherapeutic Intervention via Everyday Smart Devices
Jingping Nie
Hanya Shao
Yuang Fan
Qijia Shao
Haoxuan You
Matthias Preindl
Xiaofan Jiang
AI4MH
30
15
0
16 Mar 2024
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine
  Knowledge
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge
Yizhen Li
Shaohan Huang
Jiaxing Qi
Lei Quan
Dongran Han
Zhongzhi Luan
LM&MA
AI4MH
27
4
0
14 Mar 2024
A Continued Pretrained LLM Approach for Automatic Medical Note
  Generation
A Continued Pretrained LLM Approach for Automatic Medical Note Generation
Dong Yuan
Eti Rastogi
Gautam Naik
Sree Prasanna Rajagopal
Sagar Goyal
Fen Zhao
Jai Chintagunta
Jeff Ward
LM&MA
AI4MH
32
19
0
14 Mar 2024
Automatic Interactive Evaluation for Large Language Models with State
  Aware Patient Simulator
Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Yusheng Liao
Yutong Meng
Yuhao Wang
Hongcheng Liu
Yanfeng Wang
Yu Wang
LM&MA
ELM
30
8
0
13 Mar 2024
Elephants Never Forget: Testing Language Models for Memorization of
  Tabular Data
Elephants Never Forget: Testing Language Models for Memorization of Tabular Data
Sebastian Bordt
Harsha Nori
Rich Caruana
LMTD
30
13
0
11 Mar 2024
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway
  Encoding
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding
Jiageng Wu
Xian Wu
Yefeng Zheng
Jie Yang
MedIm
LM&MA
16
2
0
11 Mar 2024
Guiding Clinical Reasoning with Large Language Models via Knowledge
  Seeds
Guiding Clinical Reasoning with Large Language Models via Knowledge Seeds
Jiageng Wu
Xian Wu
Jie Yang
LRM
ELM
28
6
0
11 Mar 2024
How Well Do Multi-modal LLMs Interpret CT Scans? An Auto-Evaluation
  Framework for Analyses
How Well Do Multi-modal LLMs Interpret CT Scans? An Auto-Evaluation Framework for Analyses
Qingqing Zhu
Benjamin Hou
T. Mathai
Pritam Mukherjee
Qiao Jin
Xiuying Chen
Zhizheng Wang
Ruida Cheng
Ronald M. Summers
Zhiyong Lu
30
0
0
08 Mar 2024
Benchmarking Large Language Models for Molecule Prediction Tasks
Benchmarking Large Language Models for Molecule Prediction Tasks
Zhiqiang Zhong
Kuangyu Zhou
Davide Mottin
27
7
0
08 Mar 2024
Quantum Many-Body Physics Calculations with Large Language Models
Quantum Many-Body Physics Calculations with Large Language Models
Haining Pan
N. Mudur
Will Taranto
Maria Tikhanovskaya
Subhashini Venugopalan
Yasaman Bahri
Michael P. Brenner
Eun-Ah Kim
25
4
0
05 Mar 2024
The Minimum Information about CLinical Artificial Intelligence Checklist
  for Generative Modeling Research (MI-CLAIM-GEN)
The Minimum Information about CLinical Artificial Intelligence Checklist for Generative Modeling Research (MI-CLAIM-GEN)
Brenda Y. Miao
Irene Y. Chen
C. Y. Williams
Jaysón M. Davidson
Augusto Garcia-Agundez
...
Bin Yu
Milena Gianfrancesco
A. Butte
Beau Norgeot
Madhumita Sushil
VLM
34
2
0
05 Mar 2024
Are More LLM Calls All You Need? Towards Scaling Laws of Compound
  Inference Systems
Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Ion Stoica
Matei A. Zaharia
James Y. Zou
LRM
29
0
0
04 Mar 2024
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts
  for Medical Open-Domain Question Answering
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering
Giacomo Frisoni
Alessio Cocchieri
Alex Presepi
Gianluca Moro
Zaiqiao Meng
RALM
MedIm
44
15
0
04 Mar 2024
KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean
  Healthcare Professional Licensing Examinations
KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations
Sunjun Kweon
B. Choi
Minkyu Kim
Rae Woong Park
Edward Choi
ELM
15
6
0
03 Mar 2024
AutoRD: An Automatic and End-to-End System for Rare Disease Knowledge
  Graph Construction Based on Ontologies-enhanced Large Language Models
AutoRD: An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontologies-enhanced Large Language Models
Lang Cao
Jimeng Sun
Adam Cross
30
3
0
01 Mar 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa
Mahmoud Salem
Shreyas Saxena
Kevin Leong
Joel Hestness
Sean Lie
MedIm
16
1
0
01 Mar 2024
FAC$^2$E: Better Understanding Large Language Model Capabilities by
  Dissociating Language and Cognition
FAC2^22E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition
Xiaoqiang Wang
Bang Liu
Lingfei Wu
22
0
0
29 Feb 2024
Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival
  Human Crowd Accuracy
Wisdom of the Silicon Crowd: LLM Ensemble Prediction Capabilities Rival Human Crowd Accuracy
P. Schoenegger
Indre Tuminauskaite
Peter S. Park
Rafael Valdece Sousa Bastos
P. Tetlock
29
24
0
29 Feb 2024
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen
Zhouxiang Fang
Yash Singla
Mark Dredze
ELM
AI4MH
34
31
0
28 Feb 2024
CodeS: Towards Building Open-source Language Models for Text-to-SQL
CodeS: Towards Building Open-source Language Models for Text-to-SQL
Haoyang Li
Jing Zhang
Hanbing Liu
Ju Fan
Xiaokang Zhang
Jun Zhu
Renjie Wei
Hongyan Pan
Cuiping Li
Hong Chen
ELM
AI4TS
38
91
0
26 Feb 2024
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using
  Discharge Summaries
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries
Sunjun Kweon
Jiyoun Kim
Heeyoung Kwak
Dongchul Cha
Hangyul Yoon
Kwanghyun Kim
Jeewon Yang
Seunghyun Won
Edward Choi
LM&MA
24
4
0
25 Feb 2024
CloChat: Understanding How People Customize, Interact, and Experience
  Personas in Large Language Models
CloChat: Understanding How People Customize, Interact, and Experience Personas in Large Language Models
Juhye Ha
Hyeon Jeon
DaEun Han
Jinwook Seo
Changhoon Oh
26
24
0
23 Feb 2024
Multimodal Healthcare AI: Identifying and Designing Clinically Relevant
  Vision-Language Applications for Radiology
Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology
Nur Yildirim
Hannah Richardson
Maria T. A. Wetscherek
Junaid Bajwa
Joseph Jacob
...
Ozan Oktay
M. Lungren
Javier Alvarez-Valle
A. Nori
Anja Thieme
LM&MA
51
39
0
22 Feb 2024
Investigating Why Clinicians Deviate from Standards of Care: Liberating
  Patients from Mechanical Ventilation in the ICU
Investigating Why Clinicians Deviate from Standards of Care: Liberating Patients from Mechanical Ventilation in the ICU
Nur Yildirim
Susanna Zlotnikov
Aradhana Venkat
Gursimran Chawla
Jennifer Kim
L. Bukowski
Jeremy M. Kahn
James McCann
John Zimmerman
22
5
0
21 Feb 2024
Benchmarking Retrieval-Augmented Generation for Medicine
Benchmarking Retrieval-Augmented Generation for Medicine
Guangzhi Xiong
Qiao Jin
Zhiyong Lu
Aidong Zhang
RALM
75
143
0
20 Feb 2024
Me LLaMA: Foundation Large Language Models for Medical Applications
Me LLaMA: Foundation Large Language Models for Medical Applications
Qianqian Xie
Qingyu Chen
Aokun Chen
C.A.I. Peng
Yan Hu
...
Huan He
Lucila Ohno-Machido
Yonghui Wu
Hua Xu
Jiang Bian
LM&MA
AI4MH
70
3
0
20 Feb 2024
Machine-Generated Text Localization
Machine-Generated Text Localization
Zhongping Zhang
Wenda Qin
Bryan A. Plummer
DeLMO
21
4
0
19 Feb 2024
BioMistral: A Collection of Open-Source Pretrained Large Language Models
  for Medical Domains
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
Yanis Labrak
Adrien Bazoge
Emmanuel Morin
P. Gourraud
Mickael Rouvier
Richard Dufour
96
188
0
15 Feb 2024
GPT-4's assessment of its performance in a USMLE-based case study
GPT-4's assessment of its performance in a USMLE-based case study
Uttam Dhakal
Aniket Kumar Singh
Suman Devkota
Yogesh Sapkota
Bishal Lamichhane
Suprinsa Paudyal
Chandra Dhakal
ELM
AI4MH
LM&MA
18
3
0
15 Feb 2024
AI-Augmented Predictions: LLM Assistants Improve Human Forecasting
  Accuracy
AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy
P. Schoenegger
Peter S. Park
Ezra Karger
P. Tetlock
29
14
0
12 Feb 2024
TransGPT: Multi-modal Generative Pre-trained Transformer for
  Transportation
TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation
Peng Wang
Xiang Wei
Fangxu Hu
Wenjuan Han
33
15
0
11 Feb 2024
Gemini Goes to Med School: Exploring the Capabilities of Multimodal
  Large Language Models on Medical Challenge Problems & Hallucinations
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations
Ankit Pal
Malaikannan Sankarasubbu
LM&MA
43
34
0
10 Feb 2024
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for
  Safety-Aware Street Crossing
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
Hochul Hwang
Sunjae Kwon
Yekyung Kim
Donghyun Kim
27
11
0
09 Feb 2024
GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph
  Alignment via Neighborhood Partitioning and Generative Subgraph Encoding
GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding
Stefan Dernbach
Khushbu Agarwal
Alejandro Zuniga
Michael Henry
Sutanay Choudhury
28
8
0
09 Feb 2024
Trust the Process: Zero-Knowledge Machine Learning to Enhance Trust in
  Generative AI Interactions
Trust the Process: Zero-Knowledge Machine Learning to Enhance Trust in Generative AI Interactions
Bianca-Mihaela Ganescu
Jonathan Passerat-Palmbach
SyDa
20
8
0
09 Feb 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
  Closed-Source LLMs
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
Simone Balloccu
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
SILM
ELM
PILM
16
152
0
06 Feb 2024
Integration of cognitive tasks into artificial general intelligence test
  for large models
Integration of cognitive tasks into artificial general intelligence test for large models
Youzhi Qu
Chen Wei
Penghui Du
Wenxin Che
Chi Zhang
...
Bin Hu
Kai Du
Haiyan Wu
Jia Liu
Quanying Liu
ELM
34
6
0
04 Feb 2024
Efficient Prompt Caching via Embedding Similarity
Efficient Prompt Caching via Embedding Similarity
Hanlin Zhu
Banghua Zhu
Jiantao Jiao
RALM
21
9
0
02 Feb 2024
Tradeoffs Between Alignment and Helpfulness in Language Models with
  Representation Engineering
Tradeoffs Between Alignment and Helpfulness in Language Models with Representation Engineering
Yotam Wolf
Noam Wies
Dorin Shteyman
Binyamin Rothberg
Yoav Levine
Amnon Shashua
LLMSV
21
13
0
29 Jan 2024
"You tell me": A Dataset of GPT-4-Based Behaviour Change Support
  Conversations
"You tell me": A Dataset of GPT-4-Based Behaviour Change Support Conversations
Selina Meyer
David Elsweiler
17
0
0
29 Jan 2024
L-AutoDA: Leveraging Large Language Models for Automated Decision-based
  Adversarial Attacks
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks
Ping Guo
Fei Liu
Xi Lin
Qingchuan Zhao
Qingfu Zhang
20
0
0
27 Jan 2024
Improving Medical Reasoning through Retrieval and Self-Reflection with
  Retrieval-Augmented Large Language Models
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models
Minbyul Jeong
Jiwoong Sohn
Mujeen Sung
Jaewoo Kang
11
27
0
27 Jan 2024
Prompting Large Language Models for Zero-Shot Clinical Prediction with
  Structured Longitudinal Electronic Health Record Data
Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data
Yinghao Zhu
Zixiang Wang
Junyi Gao
Yuning Tong
Jingkun An
Weibin Liao
Ewen M. Harrison
Liantao Ma
Chengwei Pan
LM&MA
41
8
0
25 Jan 2024
LLM on FHIR -- Demystifying Health Records
LLM on FHIR -- Demystifying Health Records
Paul Schmiedmayer
Adrit Rao
Philipp Zagar
Vishnu Ravi
Aydin Zahedivash
Arash Fereydooni
Oliver Aalami
LM&MA
21
7
0
25 Jan 2024
When Geoscience Meets Generative AI and Large Language Models:
  Foundations, Trends, and Future Challenges
When Geoscience Meets Generative AI and Large Language Models: Foundations, Trends, and Future Challenges
Abdenour Hadid
Tanujit Chakraborty
Daniel Busby
AI4CE
16
11
0
25 Jan 2024
How Good is ChatGPT at Face Biometrics? A First Look into Recognition,
  Soft Biometrics, and Explainability
How Good is ChatGPT at Face Biometrics? A First Look into Recognition, Soft Biometrics, and Explainability
Ivan Deandres-Tame
Ruben Tolosana
R. Vera-Rodríguez
Aythami Morales
Julian Fierrez
J. Ortega-Garcia
CVBM
34
21
0
24 Jan 2024
Evaluation of General Large Language Models in Contextually Assessing
  Semantic Concepts Extracted from Adult Critical Care Electronic Health Record
  Notes
Evaluation of General Large Language Models in Contextually Assessing Semantic Concepts Extracted from Adult Critical Care Electronic Health Record Notes
Darren Liu
Cheng Ding
Delgersuren Bold
Monique Bouvier
Jiaying Lu
...
Laurie Dimisko
Ran Xiao
J. H. Yoon
Carl Yang
Xiaoping Hu
LM&MA
ELM
AI4MH
13
4
0
24 Jan 2024
Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for
  Nuclei Segmentation
Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation
Saiyang Na
Yuzhi Guo
Feng Jiang
Hehuan Ma
Junzhou Huang
VLM
MedIm
13
14
0
24 Jan 2024
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray
  Report Labeling
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray Report Labeling
Jawook Gu
Hankyu Cho
Jiho Kim
Kihyun You
Eun K. Hong
Byungseok Roh
22
6
0
21 Jan 2024
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on
  Agriculture
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture
M. A. D. L. Balaguer
Vinamra Benara
Renato Luiz de Freitas Cunha
Roberto de M. Estevao Filho
Todd Hendry
...
Morris Sharp
B. Silva
Swati Sharma
Vijay Aski
Ranveer Chandra
FaML
17
78
0
16 Jan 2024
Previous
12345678
Next