Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.13375
Cited By
Capabilities of GPT-4 on Medical Challenge Problems
20 March 2023
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MA
ELM
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Capabilities of GPT-4 on Medical Challenge Problems"
50 / 370 papers shown
Title
Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks
Zhaofeng Wu
Linlu Qiu
Alexis Ross
Ekin Akyürek
Boyuan Chen
Bailin Wang
Najoung Kim
Jacob Andreas
Yoon Kim
LRM
ReLM
35
191
0
05 Jul 2023
Transformers in Healthcare: A Survey
Subhash Nerella
S. Bandyopadhyay
Jiaqing Zhang
Miguel Contreras
Scott Siegel
...
Jessica Sena
B. Shickel
A. Bihorac
Kia Khezeli
Parisa Rashidi
MedIm
AI4CE
19
24
0
30 Jun 2023
SummQA at MEDIQA-Chat 2023:In-Context Learning with GPT-4 for Medical Summarization
Yash Mathur
Sanketh Rangreji
Raghav Kapoor
Medha Palavalli
Amanda Bertsch
Matthew R. Gormley
AI4MH
38
13
0
30 Jun 2023
From Query Tools to Causal Architects: Harnessing Large Language Models for Advanced Causal Discovery from Data
Taiyu Ban
Lyvzhou Chen
Xiangyu Wang
Huanhuan Chen
ELM
22
58
0
29 Jun 2023
A negation detection assessment of GPTs: analysis with the xNot360 dataset
Nguyen Ha Thanh
Randy Goebel
Francesca Toni
Kostas Stathis
Ken Satoh
17
9
0
29 Jun 2023
Pareto Optimal Learning for Estimating Large Language Model Errors
Theodore Zhao
Mu-Hsin Wei
J. S. Preston
Hoifung Poon
14
6
0
28 Jun 2023
OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue
Weihao Gao
Zhuo Deng
Zhiyuan Niu
Fuju Rong
Chucheng Chen
...
Fangjun Li
Zhenjie Cao
Zhaoyi Ma
Wenbin Wei
Lan Ma
LM&MA
21
33
0
21 Jun 2023
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications
Saed Rezayi
Zheng Liu
Zihao Wu
Chandra Dhakal
Bao Ge
...
Gengchen Mai
Ninghao Liu
Chen Zhen
Tianming Liu
Sheng R. Li
19
31
0
20 Jun 2023
Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health
Shubo Tian
Qiao Jin
Lana Yeganova
Po-Ting Lai
Qingqing Zhu
...
Donald C. Comeau
R. Islamaj
Aadit Kapoor
Xin Gao
Zhiyong Lu
LM&MA
MedIm
AI4MH
103
207
0
15 Jun 2023
Customizing General-Purpose Foundation Models for Medical Report Generation
Bang-ju Yang
Asif Raza
Yuexian Zou
Tong Zhang
MedIm
11
11
0
09 Jun 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Junling Liu
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
...
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu
Michael Lingzhi Li
LM&MA
ELM
17
63
0
05 Jun 2023
Beyond Generating Code: Evaluating GPT on a Data Visualization Course
Zhutian Chen
Chenyang Zhang
Qianwen Wang
J. Troidl
Simon Warchol
Johanna Beyer
Nils Gehlenborg
Hanspeter Pfister
17
29
0
05 Jun 2023
On Optimal Caching and Model Multiplexing for Large Model Inference
Banghua Zhu
Ying Sheng
Lianmin Zheng
Clark W. Barrett
Michael I. Jordan
Jiantao Jiao
18
17
0
03 Jun 2023
Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today
Zhuo Wang
R. Li
Bowen Dong
Jie Wang
Xiuxing Li
...
C. Mao
Wei Zhang
L. Dong
Jing Gao
Jianyong Wang
LM&MA
ELM
AI4MH
14
18
0
02 Jun 2023
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Chunyuan Li
Cliff Wong
Sheng Zhang
Naoto Usuyama
Haotian Liu
Jianwei Yang
Tristan Naumann
Hoifung Poon
Jianfeng Gao
LM&MA
MedIm
51
674
0
01 Jun 2023
Deliberate then Generate: Enhanced Prompting Framework for Text Generation
Bei Li
Rui Wang
Junliang Guo
Kaitao Song
Xuejiao Tan
Hany Hassan
Arul Menezes
Tong Xiao
Jiang Bian
JingBo Zhu
8
14
0
31 May 2023
GPT4GEO: How a Language Model Sees the World's Geography
Jonathan Roberts
Timo Lüddecke
Sowmen Das
Kai Han
Samuel Albanie
19
57
0
30 May 2023
The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification
Linhao Qu
X. Luo
Kexue Fu
Manning Wang
Zhijian Song
22
21
0
29 May 2023
Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery
Magdalena Wysocka
Oskar Wysocki
Maxime Delmas
V. Mutel
André Freitas
LM&MA
25
6
0
28 May 2023
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks
Minki Kang
Seanie Lee
Jinheon Baek
Kenji Kawaguchi
Sung Ju Hwang
ALM
LRM
47
56
0
28 May 2023
What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks
Taicheng Guo
Kehan Guo
B. Nan
Zhengwen Liang
Zhichun Guo
Nitesh V. Chawla
Olaf Wiest
Xiangliang Zhang
ELM
36
124
0
27 May 2023
Large language models improve Alzheimer's disease diagnosis using multi-modality data
Yingjie Feng
Jun Wang
Xianfeng Gu
Xiaoyin Xu
M. Zhang
LM&MA
16
10
0
26 May 2023
SciMON: Scientific Inspiration Machines Optimized for Novelty
Qingyun Wang
Doug Downey
Heng Ji
Tom Hope
LLMAG
21
60
0
23 May 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Wen-tau Yih
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
ALM
19
595
0
23 May 2023
BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance
Karel DÓosterlinck
François Remy
Johannes Deleu
Thomas Demeester
Chris Develder
Klim Zaporojets
Aneiss Ghodsi
Simon Ellershaw
Jack R. Collins
Christopher Potts
48
10
0
22 May 2023
Fairness of ChatGPT
Yunqi Li
Lanjing Zhang
Yongfeng Zhang
20
21
0
22 May 2023
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables
Xinyuan Lu
Liangming Pan
Qian Liu
Preslav Nakov
Min-Yen Kan
LMTD
28
24
0
22 May 2023
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities
Yuqi Zhu
Xiaohan Wang
Jing Chen
Shuofei Qiao
Yixin Ou
Yunzhi Yao
Shumin Deng
Huajun Chen
Ningyu Zhang
LLMAG
33
109
0
22 May 2023
Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Augustin Toma
Patrick R. Lawler
Jimmy Ba
Rahul G. Krishnan
Barry Rubin
Bo Wang
LM&MA
AI4MH
ELM
20
29
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
31
356
0
19 May 2023
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
Xiaoman Zhang
Chaoyi Wu
Ziheng Zhao
Weixiong Lin
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
LM&MA
29
152
0
17 May 2023
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Sergio Pelaez
Gaurav Verma
Barbara Ribeiro
P. Shapira
6
13
0
17 May 2023
Large Language Models Leverage External Knowledge to Extend Clinical Insight Beyond Language Boundaries
Jiageng Wu
X. Wu
Zhaopeng Qiu
Minghui Li
Yingying Zhang
Yefeng Zheng
Changzheng Yuan
Jie Yang
LM&MA
ELM
AI4MH
19
16
0
17 May 2023
Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study
Yaxin Fan
Feng Jiang
Peifeng Li
Haizhou Li
ELM
17
18
0
15 May 2023
Improving Small Language Models on PubMedQA via Generative Data Augmentation
Zhen Guo
Peiqi Wang
Yanwei Wang
Shangdi Yu
LM&MA
MedIm
18
10
0
12 May 2023
Generative Pre-trained Transformer: A Comprehensive Review on Enabling Technologies, Potential Applications, Emerging Challenges, and Future Directions
Gokul Yenduri
M. Ramalingam
G. C. Selvi
Y. Supriya
Gautam Srivastava
...
Rutvij H. Jhaveri
B. Prabadevi
Weizheng Wang
Athanasios V. Vasilakos
Thippa Reddy Gadekallu
AI4CE
LM&MA
8
158
0
11 May 2023
The Case Records of ChatGPT: Language Models and Complex Clinical Questions
T. Poterucha
P. Elias
C. Haggerty
ELM
LM&MA
11
0
0
09 May 2023
Professional Certification Benchmark Dataset: The First 500 Jobs For Large Language Models
David A. Noever
Matt Ciolino
ELM
34
4
0
07 May 2023
Causal Reasoning and Large Language Models: Opening a New Frontier for Causality
Emre Kıcıman
Robert Osazuwa Ness
Amit Sharma
Chenhao Tan
LRM
ELM
24
258
0
28 Apr 2023
Prompt Engineering for Healthcare: Methodologies and Applications
Jiaqi Wang
Enze Shi
Sigang Yu
Zihao Wu
Chong Ma
...
Dajiang Zhu
Yixuan Yuan
Dinggang Shen
Tianming Liu
Shu Zhang
LM&MA
42
106
0
28 Apr 2023
PMC-LLaMA: Towards Building Open-source Language Models for Medicine
Chaoyi Wu
Weixiong Lin
Xiaoman Zhang
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
LM&MA
AI4MH
86
75
0
27 Apr 2023
Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery
Debadutta Dash
Rahul Thapa
Juan M. Banda
Akshay Swaminathan
Morgan Cheatham
...
Garret K. Morris
H. Magon
M. Lungren
Eric Horvitz
N. Shah
ELM
LM&MA
AI4MH
68
49
0
26 Apr 2023
Fundamental Limitations of Alignment in Large Language Models
Yotam Wolf
Noam Wies
Oshri Avnery
Yoav Levine
Amnon Shashua
ALM
6
137
0
19 Apr 2023
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information
Qiao Jin
Yifan Yang
Qingyu Chen
Zhiyong Lu
LM&MA
LLMAG
14
143
0
19 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
24
8
0
18 Apr 2023
Low-code LLM: Graphical User Interface over Large Language Models
Yuzhe Cai
Shaoguang Mao
Wenshan Wu
Zehua Wang
Yaobo Liang
...
Ting Song
Yan Xia
Jonathan Tien
Nan Duan
Furu Wei
26
13
0
17 Apr 2023
Improving Patient Pre-screening for Clinical Trials: Assisting Physicians with Large Language Models
D. Hamer
P. Schoor
T. Polak
Daniel Kapitan
LRM
LM&MA
24
14
0
14 Apr 2023
Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language Understanding
Yuqing Wang
Yun Zhao
Linda R. Petzold
AI4MH
LM&MA
ELM
11
50
0
09 Apr 2023
Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI Models
Pengfei Li
Jianyi Yang
M. A. Islam
Shaolei Ren
78
115
0
06 Apr 2023
GPT-4 to GPT-3.5: 'Hold My Scalpel' -- A Look at the Competency of OpenAI's GPT on the Plastic Surgery In-Service Training Exam
J. Freedman
Ian Nappier
LM&MA
ELM
MedIm
6
6
0
04 Apr 2023
Previous
1
2
3
4
5
6
7
8
Next