Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1602.03012
Cited By
v1
v2 (latest)
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
9 February 2016
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos"
50 / 324 papers shown
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Zhang
Robby T. Tan
Mamba
293
3
0
04 Apr 2025
Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence
Anita Rau
Mark Endo
Josiah Aklilu
Jaewoo Heo
Khaled Saab
Alberto Paderno
Jeffrey Jopling
F. C. Holsinger
Serena Yeung-Levy
268
2
0
03 Apr 2025
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov
Amina Miftakhova
Artemiy Tereshchenko
Galina Zubkova
Pavel Blinov
Andrey Savchenko
LM&MA
347
5
0
26 Mar 2025
fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models
Saurav Sharma
Didier Mutter
N. Padoy
VLM
MedIm
227
0
0
25 Mar 2025
End-to-End Deep Learning for Real-Time Neuroimaging-Based Assessment of Bimanual Motor Skills
Aseem Subedi
Rahul Rahul
Lora Cavuoto
Steven D. Schwaitzberg
Matthew Hackett
Jack Norfleet
S. De
135
1
0
21 Mar 2025
Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding
International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025
David Gastager
Ghazal Ghazaei
Constantin Patsch
220
2
0
14 Mar 2025
CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition
Kaixiang Yang
Xin Li
Qiang Li
Zhiwei Wang
270
0
0
13 Mar 2025
SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence
Linghan Cai
Ziyue Wang
Tianyi Zhang
Zhitao Zeng
Zhu Zhuo
E. Mazomenos
Yueming Jin
LRM
228
10
0
13 Mar 2025
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
Wei Dai
Peilin Chen
Malinda Lu
Daniel Li
Haowen Wei
Hejie Cui
Paul Pu Liang
LM&MA
319
12
0
09 Mar 2025
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments
Computer Vision and Pattern Recognition (CVPR), 2025
Ege Özsoy
Chantal Pellegrini
Tobias Czempiel
Felix Tristram
Kun Yuan
David Bani-Harouni
U. Eck
Benjamin Busam
Matthias Keicher
Nassir Navab
374
13
0
04 Mar 2025
MoSFormer: Augmenting Temporal Context with Memory of Surgery for Surgical Phase Recognition
Hao Ding
Xu Lian
Mathias Unberath
203
1
0
02 Mar 2025
Revisiting the Evaluation Bias Introduced by Frame Sampling Strategies in Surgical Video Segmentation Using SAM2
Utku Ozbulak
Seyed Amir Mousavi
Francesca Tozzi
Nikdokht Rashidian
W. Willaert
W. D. Neve
J. Vankerschaver
217
1
0
28 Feb 2025
Hierarchical Context Transformer for Multi-level Semantic Scene Understanding
Luoying Hao
Yan Hu
Yang Yue
Li Wu
Huazhu Fu
Yanfu Zhang
Jiang Liu
222
3
0
24 Feb 2025
SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference
IEEE International Conference on Robotics and Automation (ICRA), 2024
Daming Gao
Xingjian Luo
Jinlin Wu
Long Bai
Zhen Lei
Hongliang Ren
Sebastien Ourselin
Hongbin Liu
348
4
0
17 Feb 2025
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review
Ufaq Khan
Umair Nawaz
A. Qayyum
Shazad Ashraf
Yutong Xie
Muhammad Haris Khan
Muhammad Bilal
Junaid Qadir
470
5
0
16 Feb 2025
Early Operative Difficulty Assessment in Laparoscopic Cholecystectomy via Snapshot-Centric Video Analysis
International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025
Saurav Sharma
Maria Vannucci
Leonardo Pestana Legori
Mario Scaglia
Giovanni Guglielmo Laracca
Didier Mutter
Sergio Alfieri
Pietro Mascagni
N. Padoy
225
2
0
10 Feb 2025
Dual Invariance Self-training for Reliable Semi-supervised Surgical Phase Recognition
IEEE International Symposium on Biomedical Imaging (ISBI), 2025
Sahar Nasirihaghighi
Negin Ghamsarian
Raphael Sznitman
Klaus Schoeffmann
301
4
0
29 Jan 2025
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
International Conference on Learning Representations (ICLR), 2025
Jiajie Li
Brian R Quaranto
Chenhui Xu
Ishan Mishra
Ruiyang Qin
Dancheng Liu
Peter C W Kim
Jinjun Xiong
422
2
0
25 Jan 2025
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
1.2K
2
0
21 Jan 2025
Identifying Surgical Instruments in Pedagogical Cataract Surgery Videos through an Optimized Aggregation Network
International Conference on Image Processing, Applications and Systems (ICIPAS), 2025
Sanya Sinha
Michal Balazia
Francois Bremond
285
1
0
05 Jan 2025
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
270
1
0
16 Dec 2024
Machine Learning-Based Automated Assessment of Intracorporeal Suturing in Laparoscopic Fundoplication
Global Surgical Education - Journal of the Association for Surgical Education (JSE), 2024
Shekhar Madhav Khairnar
Huu Phong Nguyen
Alexis Desir
Carla Holcomb
Daniel J. Scott
Ganesh Sankaranarayanan
337
3
0
16 Dec 2024
Adaptive Graph Learning from Spatial Information for Surgical Workflow Anticipation
IEEE Transactions on Medical Robotics and Bionics (TMRB), 2024
Francis Xiatian Zhang
Jingjing Deng
Robert Lieck
Hubert P. H. Shum
262
5
0
09 Dec 2024
Memory-Augmented Multimodal LLMs for Surgical VQA via Self-Contained Inquiry
Wenjun Hou
Yi Cheng
Kaishuai Xu
Yan Hu
Wenjie Li
Jiang-Dong Liu
234
4
0
17 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
402
3
0
12 Nov 2024
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
Zhenwei Zhang
VGen
MedIm
317
6
0
03 Nov 2024
Large Language Model Benchmarks in Medical Tasks
Lawrence K. Q. Yan
Ming Li
Yujiao Shi
Cheng Fei
Cheng Fei
...
Junyu Liu
Xinyuan Song
Riyang Bao
Zekun Jiang
Ziyuan Qin
LM&MA
AI4MH
695
21
0
28 Oct 2024
Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin Representation
Hao Ding
Yuqian Zhang
Hongchao Shu
Xu Lian
Ji Woong Kim
Axel Krieger
Mathias Unberath
Ji Woong Kim
Axel Krieger
Mathias Unberath
MedIm
298
5
0
26 Oct 2024
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
401
11
0
22 Oct 2024
Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Models
Juseong Jin
Chang Wook Jeong
260
8
0
13 Oct 2024
A Bayesian Approach to Weakly-supervised Laparoscopic Image Segmentation
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Zhou Zheng
Y. Hayashi
M. Oda
T. Kitasaka
K. Mori
UQCV
107
2
0
11 Oct 2024
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
IEEE Transactions on Medical Imaging (IEEE TMI), 2024
Shu Yang
Zhiyuan Cai
Luyang Luo
Ning Ma
Shuchang Xu
Hao Chen
223
1
0
30 Sep 2024
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
Neural Information Processing Systems (NeurIPS), 2024
Kun Yuan
V. Srivastav
Nassir Navab
N. Padoy
399
23
0
30 Sep 2024
MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation
Lijian Xu
Hao Sun
Ziyu Ni
Jiaming Song
Shaoting Zhang
LM&MA
232
6
0
29 Sep 2024
Leveraging Surgical Activity Grammar for Primary Intention Prediction in Laparoscopy Procedures
IEEE International Conference on Robotics and Automation (ICRA), 2024
Jie Zhang
Song Zhou
Yiwei Wang
Chidan Wan
Huan Zhao
Xiong Cai
Han Ding
275
1
0
29 Sep 2024
The FIX Benchmark: Extracting Features Interpretable to eXperts
Helen Jin
Shreya Havaldar
Chaehyeon Kim
Anton Xue
Weiqiu You
...
Bhuvnesh Jain
Amin Madani
M. Sako
Lyle Ungar
Eric Wong
387
3
0
20 Sep 2024
SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba
Xiangning Zhang
Qingwei Zhang
Jinnan Chen
Shilun Cai
Shilun Cai
XiaoBo Li
Xiaobo Li
Dahong Qian
Mamba
415
2
0
18 Sep 2024
DACAT: Dual-stream Adaptive Clip-aware Time Modeling for Robust Online Surgical Phase Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Kaixiang Yang
Qiang Li
Zhiwei Wang
206
3
0
10 Sep 2024
VidLPRO: A
V
i
d
‾
\underline{Vid}
Vi
d
eo-
L
‾
\underline{L}
L
anguage
P
‾
\underline{P}
P
re-training Framework for
R
o
‾
\underline{Ro}
R
o
botic and Laparoscopic Surgery
Mohammadmahdi Honarmand
Muhammad Abdullah Jamal
Omid Mohareri
352
5
0
07 Sep 2024
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
Adrito Das
Danyal Z. Khan
Dimitrios Psychogyios
Yitong Zhang
John G. Hanrahan
...
Santiago Rodriguez
Pablo Arbelaez
Danail Stoyanov
Hani J. Marcus
Sophia Bano
212
11
0
02 Sep 2024
Revisiting Surgical Instrument Segmentation Without Human Intervention: A Graph Partitioning View
Mingyu Sheng
Jianan Fan
Dongnan Liu
Ron Kikinis
Weidong Cai
255
4
0
27 Aug 2024
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
Joseph Cho
Samuel Schmidgall
C. Zakka
Mrudang Mathur
Dhamanpreet Kaur
R. Shad
W. Hiesinger
VGen
MedIm
308
18
0
26 Aug 2024
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resections with Pringle Maneuver
AAAI Conference on Artificial Intelligence (AAAI), 2024
Diandian Guo
Weixin Si
Zhixi Li
Jialun Pei
Pheng-Ann Heng
219
7
0
20 Aug 2024
SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models
Danush Kumar Venkatesh
Dominik Rivoir
Micha Pfeiffer
Stefanie Speidel
MedIm
309
6
0
19 Aug 2024
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning
Jiajie Li
Garrett C Skinner
Gene Yang
Brian R Quaranto
Steven D. Schwaitzberg
Peter C W Kim
Jinjun Xiong
236
21
0
15 Aug 2024
Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Shu Yang
Luyang Luo
Qiong Wang
Hao Chen
MedIm
162
23
0
07 Aug 2024
SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction
cCaughan Koksal
Ghazal Ghazaei
Felix Holm
Azade Farshad
Nassir Navab
MedIm
225
9
0
29 Jul 2024
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision
Tim J. M. Jaspers
Ronald L.P.D. de Jong
Yasmina Alkhalil
Tijn Zeelenberg
C. H. Kusters
...
Franciscus Hendericus Aäron Bakker
J P Ruurda
Willem M. Brinkman
Peter H. N. de With
Fons van der Sommen
286
9
0
25 Jul 2024
MuST: Multi-Scale Transformers for Surgical Phase Recognition
Alejandra Pérez
Santiago Rodríguez
Nicolás Ayobi
Nicolás Aparicio
Eugénie Dessevres
Pablo Arbelaez
MedIm
224
7
0
24 Jul 2024
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video
Guiqiu Liao
M. Jogan
Sai Koushik
Eric Eaton
Daniel A. Hashimoto
VOS
389
3
0
22 Jul 2024
Previous
1
2
3
4
5
6
7
Next