ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.13375
  4. Cited By
Capabilities of GPT-4 on Medical Challenge Problems

Capabilities of GPT-4 on Medical Challenge Problems

20 March 2023
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
    LM&MA
    ELM
    AI4MH
ArXivPDFHTML

Papers citing "Capabilities of GPT-4 on Medical Challenge Problems"

50 / 370 papers shown
Title
In-Context Learning and Fine-Tuning GPT for Argument Mining
In-Context Learning and Fine-Tuning GPT for Argument Mining
Jérémie Cabessa
Hugo Hernault
Umer Mushtaq
16
0
0
10 Jun 2024
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in
  Low-Resource and Extinct Languages
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
Andrew M. Bean
Simi Hellsten
Harry Mayne
Jabez Magomere
Ethan A. Chi
Ryan A. Chi
Scott A. Hale
Hannah Rose Kirk
ELM
LRM
34
6
0
10 Jun 2024
Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of
  Health for Clinical Text
Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text
Avijit Mitra
Emily Druhl
Raelene Goodwin
Hong Yu
24
2
0
10 Jun 2024
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Yanis Labrak
Adel Moumen
Richard Dufour
Mickael Rouvier
ELM
LM&MA
MedIm
29
0
0
09 Jun 2024
Deconstructing The Ethics of Large Language Models from Long-standing
  Issues to New-emerging Dilemmas
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas
Chengyuan Deng
Yiqun Duan
Xin Jin
Heng Chang
Yijun Tian
...
Kuofeng Gao
Sihong He
Jun Zhuang
Lu Cheng
Haohan Wang
AILaw
38
16
0
08 Jun 2024
Transforming Dental Diagnostics with Artificial Intelligence: Advanced
  Integration of ChatGPT and Large Language Models for Patient Care
Transforming Dental Diagnostics with Artificial Intelligence: Advanced Integration of ChatGPT and Large Language Models for Patient Care
Masoumeh Farhadi Nia
Mohsen Ahmadi
Elyas Irankhah
LM&MA
AI4CE
21
5
0
07 Jun 2024
A Survey on Medical Large Language Models: Technology, Application,
  Trustworthiness, and Future Directions
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MA
AILaw
34
14
0
06 Jun 2024
Exploring Multilingual Large Language Models for Enhanced TNM
  classification of Radiology Report in lung cancer staging
Exploring Multilingual Large Language Models for Enhanced TNM classification of Radiology Report in lung cancer staging
Hidetoshi Matsuo
Mizuho Nishio
Takaaki Matsunaga
Koji Fujimoto
Takamichi Murakami
LM&MA
31
5
0
05 Jun 2024
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering
  Medical Knowledge
MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge
Yuxuan Zhou
Xien Liu
Chen Ning
Ji Wu
ELM
23
3
0
05 Jun 2024
Multiple Choice Questions and Large Languages Models: A Case Study with
  Fictional Medical Data
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
Maxime Griot
Jean Vanderdonckt
D. Yüksel
C. Hemptinne
AI4Ed
ELM
LM&MA
40
5
0
04 Jun 2024
Eliciting the Priors of Large Language Models using Iterated In-Context
  Learning
Eliciting the Priors of Large Language Models using Iterated In-Context Learning
Jian-Qiao Zhu
Thomas L. Griffiths
BDL
30
2
0
04 Jun 2024
MedFuzz: Exploring the Robustness of Large Language Models in Medical
  Question Answering
MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering
Robert Osazuwa Ness
Katie Matton
Hayden Helm
Sheng Zhang
Junaid Bajwa
Carey E. Priebe
Eric Horvitz
ELM
25
9
0
03 Jun 2024
Superhuman performance in urology board questions by an explainable
  large language model enabled for context integration of the European
  Association of Urology guidelines: the UroBot study
Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study
Martin J. Hetz
Nicolas Carl
Sarah Haggenmüller
Christoph Wies
Maurice Stephan Michel
Frederik Wessels
T. Brinker
ELM
26
0
0
03 Jun 2024
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical
  Machine Reading Comprehension
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension
Shubham Vatsal
Ayush Singh
LM&MA
RALM
19
0
0
29 May 2024
Augmented Risk Prediction for the Onset of Alzheimer's Disease from
  Electronic Health Records with Large Language Models
Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models
Jiankun Wang
Sumyeong Ahn
Taykhoom Dalal
Xiaodan Zhang
Weishen Pan
Qiannan Zhang
Bin Chen
H. H. Dodge
Fei-Yue Wang
Jiayu Zhou
LM&MA
22
2
0
26 May 2024
Efficient Medical Question Answering with Knowledge-Augmented Question
  Generation
Efficient Medical Question Answering with Knowledge-Augmented Question Generation
Julien Khlaut
Corentin Dancette
Elodie Ferreres
Alaedine Bennani
Paul Hérent
Pierre Manceron
19
1
0
23 May 2024
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
Minbyul Jeong
Hyeon Hwang
Chanwoong Yoon
Taewhoo Lee
Jaewoo Kang
MedIm
HILM
LM&MA
33
11
0
21 May 2024
Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large
  Language Models for Trauma Assessments
Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments
Sichang Tu
Abigail Powers
Natalie Merrill
Negar Fani
Sierra Carter
S. Doogan
Jinho D. Choi
LM&MA
22
1
0
18 May 2024
High Order Reasoning for Time Critical Recommendation in Evidence-based
  Medicine
High Order Reasoning for Time Critical Recommendation in Evidence-based Medicine
Manjiang Yu
Xue Li
35
0
0
05 May 2024
MedAdapter: Efficient Test-Time Adaptation of Large Language Models
  towards Medical Reasoning
MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
Wenqi Shi
Ran Xu
Yuchen Zhuang
Yue Yu
Hang Wu
Carl Yang
M. D. Wang
MedIm
LM&MA
33
18
0
05 May 2024
Prompt engineering paradigms for medical applications: scoping review
  and recommendations for better practices
Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices
Jamil Zaghir
Marco Naguib
Mina Bjelogrlic
Aurélie Névéol
Xavier Tannier
Christian Lovis
AI4CE
LM&MA
27
6
0
02 May 2024
ALCM: Autonomous LLM-Augmented Causal Discovery Framework
ALCM: Autonomous LLM-Augmented Causal Discovery Framework
Elahe Khatibi
Mahyar Abbasian
Zhongqi Yang
Iman Azimi
Amir M. Rahmani
56
12
0
02 May 2024
UMass-BioNLP at MEDIQA-M3G 2024: DermPrompt -- A Systematic Exploration
  of Prompt Engineering with GPT-4V for Dermatological Diagnosis
UMass-BioNLP at MEDIQA-M3G 2024: DermPrompt -- A Systematic Exploration of Prompt Engineering with GPT-4V for Dermatological Diagnosis
Parth Vashisht
Abhilasha Lodha
Mukta Maddipatla
Zonghai Yao
Avijit Mitra
Zhichao Yang
Junda Wang
Sunjae Kwon
Hong-ye Yu
LM&MA
26
1
0
27 Apr 2024
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT
  Analysis
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis
Xiaoman Zhang
Chaoyi Wu
Ziheng Zhao
Jiayu Lei
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
23
16
0
25 Apr 2024
Influence of Solution Efficiency and Valence of Instruction on Additive
  and Subtractive Solution Strategies in Humans and GPT-4
Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4
Lydia Uhler
Verena Jordan
Jürgen Buder
Markus Huff
F. Papenmeier
20
0
0
25 Apr 2024
Adapting Open-Source Large Language Models for Cost-Effective,
  Expert-Level Clinical Note Generation with On-Policy Reinforcement Learning
Adapting Open-Source Large Language Models for Cost-Effective, Expert-Level Clinical Note Generation with On-Policy Reinforcement Learning
Hanyin Wang
Chufan Gao
Bolun Liu
Qiping Xu
Guleid Hussein
Mohamad El Labban
Kingsley Iheasirim
H. Korsapati
Chuck Outcalt
Jimeng Sun
LM&MA
AI4MH
22
2
0
25 Apr 2024
Hippocrates: An Open-Source Framework for Advancing Large Language
  Models in Healthcare
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare
Emre Can Acikgoz
Osman Batur .Ince
Rayene Bench
Arda Anil Boz
.Ilker Kesen
Aykut Erdem
Erkut Erdem
LM&MA
27
9
0
25 Apr 2024
LLM-Based Section Identifiers Excel on Open Source but Stumble in Real
  World Applications
LLM-Based Section Identifiers Excel on Open Source but Stumble in Real World Applications
Saranya Krishnamoorthy
Ayush Singh
Shabnam Tafreshi
24
0
0
25 Apr 2024
Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs:
  Full-Parameter vs. Parameter-Efficient Approaches
Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Clément Christophe
Praveen K Kanithi
Prateek Munjal
Tathagata Raha
Nasir Hayat
...
Charles Chen
Natalia Vassilieva
Boulbaba Ben Amor
Marco AF Pimentel
Shadab Khan
AI4MH
LM&MA
33
28
0
23 Apr 2024
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical
  Error Detection and Correction
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction
Augustin Toma
Ronald Xie
Steven Palayew
Patrick R. Lawler
Bo Wang
30
3
0
22 Apr 2024
Holistic Safety and Responsibility Evaluations of Advanced AI Models
Holistic Safety and Responsibility Evaluations of Advanced AI Models
Laura Weidinger
Joslyn Barnhart
Jenny Brennan
Christina Butterfield
Susie Young
...
Sebastian Farquhar
Lewis Ho
Iason Gabriel
Allan Dafoe
William S. Isaac
ELM
24
8
0
22 Apr 2024
MedThink: Explaining Medical Visual Question Answering via Multimodal
  Decision-Making Rationale
MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale
Xiaotang Gai
Chenyi Zhou
Jiaxiang Liu
Yang Feng
Jian Wu
Zuo-Qiang Liu
MedIm
36
6
0
18 Apr 2024
emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0
  Framework, Enriched with emrQA Medical Information
emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information
Jimenez Eladio
Hao Wu
20
2
0
18 Apr 2024
LLMs in Biomedicine: A study on clinical Named Entity Recognition
LLMs in Biomedicine: A study on clinical Named Entity Recognition
Masoud Monajatipoor
Jiaxin Yang
Joel Stremmel
Melika Emami
Fazlolah Mohaghegh
Mozhdeh Rouhsedaghat
Kai-Wei Chang
LM&MA
27
5
0
10 Apr 2024
MedRG: Medical Report Grounding with Multi-modal Large Language Model
MedRG: Medical Report Grounding with Multi-modal Large Language Model
K. Zou
Yang Bai
Zhihao Chen
Yang Zhou
Yidi Chen
Kai Ren
Meng Wang
Xuedong Yuan
Xiaojing Shen
Huazhu Fu
MedIm
31
3
0
10 Apr 2024
CausalBench: A Comprehensive Benchmark for Causal Learning Capability of
  Large Language Models
CausalBench: A Comprehensive Benchmark for Causal Learning Capability of Large Language Models
Yu Zhou
Xingyu Wu
Beichen Huang
Jibin Wu
Liang Feng
Kay Chen Tan
ELM
CML
40
2
0
09 Apr 2024
Elephants Never Forget: Memorization and Learning of Tabular Data in
  Large Language Models
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Sebastian Bordt
Harsha Nori
Vanessa Rodrigues
Besmira Nushi
Rich Caruana
36
12
0
09 Apr 2024
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical
  Question Answering
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering
Inigo Alonso
Maite Oronoz
Rodrigo Agerri
AI4MH
LM&MA
ELM
47
14
1
08 Apr 2024
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving
  Complex Mathematical Problems
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
Bin Lei
LLMAG
AI4CE
28
11
0
06 Apr 2024
Effects of Different Prompts on the Quality of GPT-4 Responses to
  Dementia Care Questions
Effects of Different Prompts on the Quality of GPT-4 Responses to Dementia Care Questions
Zhuochun Li
Bo Xie
Robin Hilsabeck
Alyssa Aguirre
Ning Zou
Zhimeng Luo
Daqing He
36
0
0
05 Apr 2024
Conversational Disease Diagnosis via External Planner-Controlled Large
  Language Models
Conversational Disease Diagnosis via External Planner-Controlled Large Language Models
Zhoujian Sun
Cheng Luo
Ziyi Liu
Zheng-Wei Huang
LM&MA
35
3
0
04 Apr 2024
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
Md. Nishat Raihan
Dhiman Goswami
Sadiya Sayara Chowdhury Puspo
Christian D. Newman
Tharindu Ranasinghe
Marcos Zampieri
ELM
21
2
0
03 Apr 2024
TWIN-GPT: Digital Twins for Clinical Trials via Large Language Model
TWIN-GPT: Digital Twins for Clinical Trials via Large Language Model
Yue Wang
Tianfan Fu
Yinlong Xu
Zihan Ma
Hongxia Xu
Yingzhou Lu
Bang Du
Hong-Yan Gao
Jian Wu
LM&MA
35
26
0
01 Apr 2024
Small Language Models Learn Enhanced Reasoning Skills from Medical
  Textbooks
Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks
Hyunjae Kim
Hyeon Hwang
Jiwoo Lee
Sihyeon Park
Dain Kim
Taewhoo Lee
Chanwoong Yoon
Jiwoong Sohn
Donghee Choi
Jaewoo Kang
ELM
AI4MH
LRM
43
16
0
30 Mar 2024
Can LLMs Correct Physicians, Yet? Investigating Effective Interaction
  Methods in the Medical Domain
Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain
Burcu Sayin
Pasquale Minervini
Jacopo Staiano
Andrea Passerini
LM&MA
AI4MH
24
8
0
29 Mar 2024
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Elliot Bolton
Abhinav Venigalla
Michihiro Yasunaga
David Leo Wright Hall
Betty Xiong
...
R. Daneshjou
Jonathan Frankle
Percy Liang
Michael Carbin
Christopher D. Manning
LM&MA
MedIm
32
51
0
27 Mar 2024
Construction of a Japanese Financial Benchmark for Large Language Models
Construction of a Japanese Financial Benchmark for Large Language Models
Masanori Hirano
18
10
0
22 Mar 2024
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao
Xuemei Dong
Wenyi Xu
Yunjun Gao
Bin Wei
Ying Zhang
28
9
0
21 Mar 2024
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual
  Navigation
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation
Hao Wang
Jiayou Qin
Ashish Bastola
Xiwen Chen
John Suchanek
Zihao Gong
Abolfazl Razi
30
14
0
19 Mar 2024
Embracing the Generative AI Revolution: Advancing Tertiary Education in
  Cybersecurity with GPT
Embracing the Generative AI Revolution: Advancing Tertiary Education in Cybersecurity with GPT
Raza Nowrozy
David Jam
AI4CE
20
1
0
18 Mar 2024
Previous
12345678
Next