ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.13375
  4. Cited By
Capabilities of GPT-4 on Medical Challenge Problems

Capabilities of GPT-4 on Medical Challenge Problems

20 March 2023
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
    LM&MA
    ELM
    AI4MH
ArXivPDFHTML

Papers citing "Capabilities of GPT-4 on Medical Challenge Problems"

50 / 370 papers shown
Title
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model
  System for Answering Medical Questions using Scientific Literature
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature
Alejandro Lozano
Scott L. Fleming
Chia-Chun Chiang
Nigam Shah
ELM
RALM
18
32
0
24 Oct 2023
KITAB: Evaluating LLMs on Constraint Satisfaction for Information
  Retrieval
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Marah Abdin
Suriya Gunasekar
Varun Chandrasekaran
Jerry Li
Mert Yuksekgonul
Rahee Peshawaria
Ranjita Naik
Besmira Nushi
49
12
0
24 Oct 2023
Large Language Models can Share Images, Too!
Large Language Models can Share Images, Too!
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
24
2
0
23 Oct 2023
Exploring the Boundaries of GPT-4 in Radiology
Exploring the Boundaries of GPT-4 in Radiology
Qianchu Liu
Stephanie L. Hyland
Shruthi Bannur
Kenza Bouzid
Daniel Coelho De Castro
...
Anja Thieme
A. Nori
M. Lungren
Ozan Oktay
Javier Alvarez-Valle
LM&MA
AI4CE
27
36
0
23 Oct 2023
Better to Ask in English: Cross-Lingual Evaluation of Large Language
  Models for Healthcare Queries
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Yiqiao Jin
Mohit Chandra
Gaurav Verma
Yibo Hu
Munmun De Choudhury
Srijan Kumar
LM&MA
ELM
87
65
0
19 Oct 2023
Large Language Model for Multi-objective Evolutionary Optimization
Large Language Model for Multi-objective Evolutionary Optimization
Fei Liu
Xi Lin
Zhenkun Wang
Shunyu Yao
Xialiang Tong
Mingxuan Yuan
Qingfu Zhang
19
37
0
19 Oct 2023
Large Language Model Prediction Capabilities: Evidence from a Real-World
  Forecasting Tournament
Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament
P. Schoenegger
Peter S. Park
ELM
AI4TS
11
14
0
17 Oct 2023
Emulating Human Cognitive Processes for Expert-Level Medical
  Question-Answering with Large Language Models
Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models
Khushboo Verma
Marina Moore
Stephanie Wottrich
Karla Robles López
Nishant Aggarwal
...
Maimoona Saeed
Tatiana López Velarde Pena
Bryan R. Barksdale
Sushovan Guha
Satwant Kumar
ELM
AI4MH
LM&MA
8
0
0
17 Oct 2023
Data Contamination Through the Lens of Time
Data Contamination Through the Lens of Time
Manley Roberts
Himanshu Thakur
Christine Herlihy
Colin White
Samuel Dooley
84
30
0
16 Oct 2023
Examining the Potential and Pitfalls of ChatGPT in Science and
  Engineering Problem-Solving
Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving
Karen D. Wang
E. Burkholder
Carl E. Wieman
S. Salehi
Nicholas Haber
AI4CE
ELM
30
33
0
12 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng-Wei Zhang
Yue Zhang
HILM
KELM
51
182
0
11 Oct 2023
On the Impact of Cross-Domain Data on German Language Models
On the Impact of Cross-Domain Data on German Language Models
Amin Dada
Aokun Chen
C.A.I. Peng
Kaleb E. Smith
Ahmad Idrissi-Yaghir
...
Daniel Truhn
Jan Egger
Jiang Bian
Jens Kleesiek
Yonghui Wu
11
3
0
11 Oct 2023
GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using
  Large Language Models
GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using Large Language Models
B. Silva
Leonardo Nunes
Roberto Estevão
Vijay Aski
Ranveer Chandra
ELM
LM&MA
27
11
0
10 Oct 2023
LLM for SoC Security: A Paradigm Shift
LLM for SoC Security: A Paradigm Shift
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
54
46
0
09 Oct 2023
Robust and Interpretable Medical Image Classifiers via Concept
  Bottleneck Models
Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
An Yan
Yu-Xiang Wang
Yiwu Zhong
Zexue He
Petros Karypis
...
Chengyu Dong
Amilcare Gentili
Chun-Nan Hsu
Jingbo Shang
Julian McAuley
21
30
0
04 Oct 2023
MathVista: Evaluating Mathematical Reasoning of Foundation Models in
  Visual Contexts
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
Pan Lu
Hritik Bansal
Tony Xia
Jiacheng Liu
Chun-yue Li
Hannaneh Hajishirzi
Hao Cheng
Kai-Wei Chang
Michel Galley
Jianfeng Gao
LRM
MLLM
36
492
0
03 Oct 2023
DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for
  Hospitalized Patients
DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for Hospitalized Patients
Hanyin Wang
Chufan Gao
Christopher Dantona
Bryan Hull
Jimeng Sun
LM&MA
15
54
0
22 Sep 2023
Talk2Care: Facilitating Asynchronous Patient-Provider Communication with
  Large-Language-Model
Talk2Care: Facilitating Asynchronous Patient-Provider Communication with Large-Language-Model
Ziqi Yang
Xuhai Xu
Bingsheng Yao
Shao Zhang
Ethan Rogers
Stephen Intille
N. Shara
G. Gao
Dakuo Wang
LM&MA
AI4MH
19
4
0
17 Sep 2023
Performance of ChatGPT-3.5 and GPT-4 on the United States Medical
  Licensing Examination With and Without Distractions
Performance of ChatGPT-3.5 and GPT-4 on the United States Medical Licensing Examination With and Without Distractions
Myriam Safrai
A. Azaria
LM&MA
ELM
AI4MH
17
0
0
12 Sep 2023
Zero-shot Learning with Minimum Instruction to Extract Social
  Determinants and Family History from Clinical Notes using GPT Model
Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model
Neel Jitesh Bhate
Ansh Mittal
Zhe He
Xiao Luo
12
12
0
11 Sep 2023
Aligning Large Language Models for Clinical Tasks
Aligning Large Language Models for Clinical Tasks
Supun Manathunga
Isuru Hettigoda
LM&MA
ELM
AI4MH
14
10
0
06 Sep 2023
An Automatic Evaluation Framework for Multi-turn Medical Consultations
  Capabilities of Large Language Models
An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models
Yusheng Liao
Yutong Meng
Hongcheng Liu
Yanfeng Wang
Yu Wang
LM&MA
ELM
11
6
0
05 Sep 2023
On the Planning, Search, and Memorization Capabilities of Large Language
  Models
On the Planning, Search, and Memorization Capabilities of Large Language Models
Yunhao Yang
Anshul Tomar
LM&Ro
ELM
41
1
0
05 Sep 2023
Publicly Shareable Clinical Large Language Model Built on Synthetic
  Clinical Notes
Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes
Sunjun Kweon
Junu Kim
Jiyoun Kim
Sujeong Im
Eunbyeol Cho
...
Seungjin Baek
Chang Hoon Han
Yoon Bin Jung
Yohan Jo
E. Choi
LM&MA
ELM
15
35
0
01 Sep 2023
The AI Revolution: Opportunities and Challenges for the Finance Sector
The AI Revolution: Opportunities and Challenges for the Finance Sector
Carsten Maple
Lukasz Szpruch
Gregory Epiphaniou
Kalina S. Staykova
Simran Singh
William Penwarden
Yisi Wen
Zijian Wang
Jagdish Hariharan
Pavle Avramović
AIFin
11
30
0
31 Aug 2023
Enhancing Subtask Performance of Multi-modal Large Language Model
Enhancing Subtask Performance of Multi-modal Large Language Model
Yongqiang Zhao
Zhenyu Li
Feng Zhang
Xinhai Xu
Donghong Liu
LRM
19
0
0
31 Aug 2023
Challenges of GPT-3-based Conversational Agents for Healthcare
Challenges of GPT-3-based Conversational Agents for Healthcare
Fabian Lechner
Allison Lahnala
Charles F Welch
Lucie Flek
LM&MA
13
2
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
62
31
0
27 Aug 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on
  Language, Multimodal, and Scientific GPT Models
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Kaiyuan Gao
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MA
SyDa
30
4
0
27 Aug 2023
Large Language Models Streamline Automated Machine Learning for Clinical
  Studies
Large Language Models Streamline Automated Machine Learning for Clinical Studies
Soroosh Tayebi Arasteh
T. Han
Mahshad Lotfinia
Christiane Kuhl
Jakob Nikolas Kather
Daniel Truhn
S. Nebelung
ELM
LM&MA
AI4MH
25
48
0
27 Aug 2023
MedAlign: A Clinician-Generated Dataset for Instruction Following with
  Electronic Medical Records
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records
Scott L. Fleming
Alejandro Lozano
W. Haberkorn
Jenelle A. Jindal
E. Reis
...
Jonathan H. Chen
Keith Morse
Emma Brunskill
Jason Alan Fries
N. Shah
LM&MA
23
52
0
27 Aug 2023
Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study
  of the Driver's License Knowledge Test
Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study of the Driver's License Knowledge Test
Saba Rahimi
T. Balch
Manuela Veloso
ELM
15
1
0
22 Aug 2023
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak
  Large Language Models
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models
Mohamed S. Elaraby
Mengyin Lu
Jacob Dunn
Xueying Zhang
Yu Wang
Shizhu Liu
Pingchuan Tian
Yuping Wang
Yuxuan Wang
HILM
17
23
0
22 Aug 2023
Diagnostic Reasoning Prompts Reveal the Potential for Large Language
  Model Interpretability in Medicine
Diagnostic Reasoning Prompts Reveal the Potential for Large Language Model Interpretability in Medicine
Thomas Savage
Ashwin Nayak
Roberta Gallo
E. Rangan
Jonathan H. Chen
LM&MA
ELM
LRM
AI4CE
9
115
0
13 Aug 2023
A Comparative Study of Open-Source Large Language Models, GPT-4 and
  Claude 2: Multiple-Choice Test Taking in Nephrology
A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology
Sean Wu
Michael Koo
L. Blum
A. Black
Liyo Kao
Fabien Scalzo
Ira Kurtz
LM&MA
ELM
AI4MH
10
40
0
09 Aug 2023
Resource Management for GPT-based Model Deployed on Clouds: Challenges,
  Solutions, and Future Directions
Resource Management for GPT-based Model Deployed on Clouds: Challenges, Solutions, and Future Directions
Yongkang Dang
Minxian Xu
Kejiang Ye
9
0
0
05 Aug 2023
A Survey of Spanish Clinical Language Models
A Survey of Spanish Clinical Language Models
Guillem García Subies
Á. Jiménez
Paloma Martínez
LM&MA
ELM
LRM
14
0
0
04 Aug 2023
Scaling Clinical Trial Matching Using Large Language Models: A Case
  Study in Oncology
Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology
Cliff Wong
Sheng Zhang
Yu Gu
C. Moung
Jacob Abel
...
R. Weerasinghe
B. Piening
Tristan Naumann
Carlo Bifulco
Hoifung Poon
LM&MA
6
35
0
04 Aug 2023
LLMs Understand Glass-Box Models, Discover Surprises, and Suggest
  Repairs
LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs
Ben Lengerich
Sebastian Bordt
Harsha Nori
M. Nunnally
Y. Aphinyanaphongs
Manolis Kellis
Rich Caruana
11
7
0
02 Aug 2023
TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a
  Domain-Specific Expert in Transportation Safety
TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety
Ou Zheng
Mohamed Abdel-Aty
Dongdong Wang
Chenzhu Wang
Shengxuan Ding
14
12
0
28 Jul 2023
Matching Patients to Clinical Trials with Large Language Models
Matching Patients to Clinical Trials with Large Language Models
Qiao Jin
Zifeng Wang
C. Floudas
Fangyuan Chen
Changlin Gong
Dara Bracken-Clarke
Elisabetta Xue
Yifan Yang
Jimeng Sun
Zhiyong Lu
LM&MA
15
88
0
27 Jul 2023
Mental-LLM: Leveraging Large Language Models for Mental Health
  Prediction via Online Text Data
Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data
Xuhai Xu
Bingsheng Yao
Yu Dong
Saadia Gabriel
Hongfeng Yu
James A. Hendler
Marzyeh Ghassemi
A. Dey
Dakuo Wang
LM&MA
CLL
AI4MH
30
64
0
26 Jul 2023
Evaluating Large Language Models for Radiology Natural Language
  Processing
Evaluating Large Language Models for Radiology Natural Language Processing
Zheng Liu
Tianyang Zhong
Yiwei Li
Yutong Zhang
Yirong Pan
...
Shijie Zhao
Quanzheng Li
Hongtu Zhu
Dinggang Shen
Tianming Liu
LM&MA
ELM
41
6
0
25 Jul 2023
Chit-Chat or Deep Talk: Prompt Engineering for Process Mining
Chit-Chat or Deep Talk: Prompt Engineering for Process Mining
U. Jessen
Michal Sroka
Dirk Fahland
8
18
0
19 Jul 2023
Information Retrieval Meets Large Language Models: A Strategic Report
  from Chinese IR Community
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community
Qingyao Ai
Ting Bai
Zhao Cao
Yi-Ju Chang
Jiawei Chen
...
Peng-Zhen Zhang
Fan Zhang
Wei-na Zhang
M. Zhang
Xiaofei Zhu
47
58
0
19 Jul 2023
How is ChatGPT's behavior changing over time?
How is ChatGPT's behavior changing over time?
Lingjiao Chen
Matei A. Zaharia
James Y. Zou
ELM
KELM
AI4MH
14
389
0
18 Jul 2023
The Potential and Pitfalls of using a Large Language Model such as
  ChatGPT or GPT-4 as a Clinical Assistant
The Potential and Pitfalls of using a Large Language Model such as ChatGPT or GPT-4 as a Clinical Assistant
Jingqing Zhang
Kai Sun
A. Jagadeesh
Mahta Ghahfarokhi
Deepa Gupta
Ashok Gupta
Vibhor Gupta
Yike Guo
LM&MA
AI4MH
ELM
24
11
0
16 Jul 2023
Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models
Yuheng Huang
Jiayang Song
Zhijie Wang
Shengming Zhao
Huaming Chen
Felix Juefei-Xu
Lei Ma
28
3
0
16 Jul 2023
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
S. Mohamadi
G. Mujtaba
Ngan Le
Gianfranco Doretto
Don Adjeroh
LM&MA
AI4MH
16
20
0
09 Jul 2023
Frontier AI Regulation: Managing Emerging Risks to Public Safety
Frontier AI Regulation: Managing Emerging Risks to Public Safety
Markus Anderljung
Joslyn Barnhart
Anton Korinek
Jade Leung
Cullen O'Keefe
...
Jonas Schuett
Yonadav Shavit
Divya Siddarth
Robert F. Trager
Kevin J. Wolf
SILM
20
115
0
06 Jul 2023
Previous
12345678
Next