Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2201.12888
Cited By
A Dataset for Medical Instructional Video Classification and Question Answering
Scientific Data (Sci Data), 2022
30 January 2022
D. Gupta
Kush Attal
Dina Demner-Fushman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Dataset for Medical Instructional Video Classification and Question Answering"
18 / 18 papers shown
Title
EyePCR: A Comprehensive Benchmark for Fine-Grained Perception, Knowledge Comprehension and Clinical Reasoning in Ophthalmic Surgery
Gui Wang
Yang Wennuo
Xusen Ma
Zehao Zhong
Zhuoru Wu
Ende Wu
Rong Qu
W. Cheah
Jianfeng Ren
Linlin Shen
127
0
0
19 Sep 2025
MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning
Suhao Yu
Haojin Wang
Juncheng Wu
Cihang Xie
Yuyin Zhou
159
10
0
22 May 2025
Enhancing the Learning Experience: Using Vision-Language Models to Generate Questions for Educational Videos
International Conference on Artificial Intelligence in Education (AIED), 2025
Markos Stamatakis
Joshua Berger
Christian Wartena
Ralph Ewerth
Anett Hoppe
AI4Ed
273
1
0
03 May 2025
Ask2Loc: Learning to Locate Instructional Visual Answers by Asking Questions
Chang Zong
Bin Li
Shoujun Zhou
Jian Wan
Lei Zhang
816
1
0
22 Apr 2025
How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?
Rahul Thapa
Andrew Li
Qingyang Wu
Bryan He
Yuki Sahashi
...
Angela Zhang
Ben Athiwaratkun
Shuaiwen Leon Song
David Ouyang
James Zou
LM&MA
384
2
0
19 Apr 2025
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Biomedical Engineering Letters (Biomed Eng Lett), 2025
Lukas Buess
Matthias Keicher
Nassir Navab
Andreas Maier
Soroosh Tayebi Arasteh
LM&MA
825
6
0
13 Feb 2025
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Wenyi Hong
Yean Cheng
Zhiyong Yang
Weihan Wang
Lefan Wang
Xiaotao Gu
Xiaotao Gu
Yuxiao Dong
J. Tang
CoGe
VLM
217
22
0
06 Jan 2025
Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track
D. Gupta
Dina Demner-Fushman
LM&MA
182
2
0
15 Dec 2024
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Yujiao Shi
Cheng Fei
Cheng Fei
...
Junyu Liu
Xinyuan Song
Riyang Bao
Zekun Jiang
Ziyuan Qin
LM&MA
AI4MH
519
16
0
28 Oct 2024
FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment
Jinglin Xu
Sibo Yin
Guohao Zhao
Zishuo Wang
Yuxin Peng
238
31
0
11 May 2024
Multimodal Transformer With a Low-Computational-Cost Guarantee
Sungjin Park
Edward Choi
122
2
0
23 Feb 2024
NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding
Ming Hu
Lin Wang
Siyuan Yan
Don Ma
Qingli Ren
Peng Xia
Wei Feng
Peibo Duan
Lie Ju
Zongyuan Ge
252
21
0
20 Oct 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
International Conference on Language Resources and Evaluation (LREC), 2023
Deepak Gupta
Kush Attal
Dina Demner-Fushman
LM&MA
122
4
0
21 Sep 2023
Human in the loop approaches in multi-modal conversational task guidance system development
R. Manuvinakurike
Sovan Biswas
G. Raffa
R. Beckwith
A. Rhodes
Meng Shi
Gesem Gudino Mejia
Saurav Sahay
L. Nachman
149
2
0
03 Nov 2022
Visual Answer Localization with Cross-modal Mutual Knowledge Transfer
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yixuan Weng
Bin Li
246
11
0
26 Oct 2022
Learning to Locate Visual Answer in Video Corpus Using Question
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
276
8
0
11 Oct 2022
Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features
Social Science Research Network (SSRN), 2022
Deepa Gupta
R. Loane
Soumya Gayen
Dina Demner-Fushman
MedIm
149
14
0
05 Oct 2022
Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
495
63
0
13 Mar 2022
1