ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.05949
  4. Cited By
General surgery vision transformer: A video pre-trained foundation model
  for general surgery

General surgery vision transformer: A video pre-trained foundation model for general surgery

9 March 2024
Samuel Schmidgall
Ji Woong Kim
Jeffery Jopling
Axel Krieger
    ViT
    MedIm
ArXivPDFHTML

Papers citing "General surgery vision transformer: A video pre-trained foundation model for general surgery"

9 / 9 papers shown
Title
SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence
Chang Han Low
Ziyue Wang
Tianyi Zhang
Zhitao Zeng
Zhu Zhuo
E. Mazomenos
Yueming Jin
LRM
46
1
0
13 Mar 2025
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic
  Surgical Video-Language Pretraining
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Ming Hu
Kun Yuan
Yaling Shen
Feilong Tang
Xiaohao Xu
...
Jin Ye
N. Padoy
Nassir Navab
Junjun He
Zongyuan Ge
VLM
CLIP
85
10
0
23 Nov 2024
VidLPRO: A $\underline{Vid}$eo-$\underline{L}$anguage
  $\underline{P}$re-training Framework for $\underline{Ro}$botic and
  Laparoscopic Surgery
VidLPRO: A Vid‾\underline{Vid}Vid​eo-L‾\underline{L}L​anguage P‾\underline{P}P​re-training Framework for Ro‾\underline{Ro}Ro​botic and Laparoscopic Surgery
Mohammadmahdi Honarmand
Muhammad Abdullah Jamal
Omid Mohareri
58
1
0
07 Sep 2024
GP-VLS: A general-purpose vision language model for surgery
GP-VLS: A general-purpose vision language model for surgery
Samuel Schmidgall
Joseph Cho
C. Zakka
W. Hiesinger
LM&MA
44
5
0
27 Jul 2024
Language models are susceptible to incorrect patient self-diagnosis in
  medical applications
Language models are susceptible to incorrect patient self-diagnosis in medical applications
Rojin Ziaei
Samuel Schmidgall
ELM
LM&MA
23
8
0
17 Sep 2023
LoViT: Long Video Transformer for Surgical Phase Recognition
LoViT: Long Video Transformer for Surgical Phase Recognition
Yang Liu
Maxence Boels
Luis C. García-Peraza-Herrera
Tom Kamiel Magda Vercauteren
P. Dasgupta
Alejandro Granados
Sebastien Ourselin
36
30
0
15 May 2023
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
Large Language Models are Few-Shot Clinical Information Extractors
Large Language Models are Few-Shot Clinical Information Extractors
Monica Agrawal
S. Hegselmann
Hunter Lang
Yoon Kim
David Sontag
BDL
LM&MA
154
327
0
25 May 2022
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic
  Videos
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
168
828
0
09 Feb 2016
1