Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1602.03012
Cited By
v1
v2 (latest)
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
9 February 2016
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos"
50 / 317 papers shown
Title
Video Dataset for Surgical Phase, Keypoint, and Instrument Recognition in Laparoscopic Surgery (PhaKIR)
Tobias Rueckert
Raphaela Maerkl
D. Rauber
Leonard Klausmann
Max Gutbrod
Daniel Rueckert
Hubertus Feussner
Dirk Wilhelm
Christoph Palm
4
0
0
09 Nov 2025
SurgiATM: A Physics-Guided Plug-and-Play Model for Deep Learning-Based Smoke Removal in Laparoscopic Surgery
Mingyu Sheng
Jianan Fan
Dongnan Liu
Guoyan Zheng
Ron Kikinis
Weidong (Tom) Cai
32
0
0
07 Nov 2025
T-FIX: Text-Based Explanations with Features Interpretable to eXperts
Shreya Havaldar
Helen Jin
Chaehyeon Kim
Anton Xue
Weiqiu You
...
Rajat Deo
Sameed Ahmed M. Khatana
Gary E. Weissman
Lyle Ungar
Eric Wong
24
0
0
06 Nov 2025
Adaptive transfer learning for surgical tool presence detection in laparoscopic videos through gradual freezing fine-tuning
Ana Davila
Jacinto Colan
Y. Hasegawa
48
0
0
17 Oct 2025
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
Danush Kumar Venkatesh
Adam Schmidt
Muhammad Abdullah Jamal
Omid Mohareri
VGen
MedIm
36
0
0
07 Oct 2025
Decoding the Surgical Scene: A Scoping Review of Scene Graphs in Surgery
Angelo Henriques
Korab Hoxha
Daniel Zapp
Peter Charbel Issa
Nassir Navab
M. A. Nasseri
40
0
0
25 Sep 2025
Surgical Video Understanding with Label Interpolation
Garam Kim
Tae Kyeong Jeong
Juyoun Park
52
0
0
23 Sep 2025
Multi-scale Temporal Prediction via Incremental Generation and Multi-agent Collaboration
Zhitao Zeng
Guojian Yuan
Junyuan Mao
Yuxuan Wang
Xiaoshuang Jia
Yueming Jin
130
0
0
22 Sep 2025
EyePCR: A Comprehensive Benchmark for Fine-Grained Perception, Knowledge Comprehension and Clinical Reasoning in Ophthalmic Surgery
Gui Wang
Yang Wennuo
Xusen Ma
Zehao Zhong
Zhuoru Wu
Ende Wu
Rong Qu
W. Cheah
Jianfeng Ren
Linlin Shen
87
0
0
19 Sep 2025
Leveraging Generic Foundation Models for Multimodal Surgical Data Analysis
Simon Pezold
Jérôme A. Kurylec
Jan S. Liechti
Beat P. Müller
Joël L. Lavanchy
16
0
0
08 Sep 2025
SurgLLM: A Versatile Large Multimodal Model with Spatial Focus and Temporal Awareness for Surgical Video Understanding
Zhen Chen
Xingjian Luo
Kun Yuan
J. Wu
Danny Tat Ming Chan
Nassir Navab
Hongbin Liu
Zhen Lei
Jiebo Luo
88
1
0
30 Aug 2025
Identifying Surgical Instruments in Laparoscopy Using Deep Learning Instance Segmentation
International Conference on Content-Based Multimedia Indexing (CBMI), 2019
Sabrina Kletz
Klaus Schoeffmann
Jenny Benois-Pineau
Heinrich Husslein
48
39
0
29 Aug 2025
GLENDA: Gynecologic Laparoscopy Endometriosis Dataset
Conference on Multimedia Modeling (MMM), 2019
Andreas Leibetseder
Sabrina Kletz
Klaus Schoeffmann
Simon Keckstein
Jörg Keckstein
44
30
0
29 Aug 2025
ROBUST-MIPS: A Combined Skeletal Pose and Instance Segmentation Dataset for Laparoscopic Surgical Instruments
Zhe Han
Charlie Budd
Gongyu Zhang
Huanyu Tian
Christos Bergeles
Tom Vercauteren
66
1
0
27 Aug 2025
OctreeNCA: Single-Pass 184 MP Segmentation on Consumer Hardware
Nick Lemke
John Kalkhof
Niklas Babendererde
Anirban Mukhopadhyay
36
0
0
09 Aug 2025
Object Recognition Datasets and Challenges: A Review
Aria Salari
Abtin Djavadifar
Xiangrui Liu
Homayoun Najjaran
ObjD
114
66
0
30 Jul 2025
StepAL: Step-aware Active Learning for Cataract Surgical Videos
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Nisarg A. Shah
Bardia Safaei
S. Sikder
S. Vedula
Vishal M. Patel
84
1
0
29 Jul 2025
Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning
Ruizhe Chen
Zhiting Fan
Tianze Luo
Heqing Zou
Zhaopeng Feng
Guiyang Xie
Hansheng Zhang
Zhuochen Wang
Zuozhu Liu
Huaijian Zhang
AI4TS
53
5
0
24 Jul 2025
CPKD: Clinical Prior Knowledge-Constrained Diffusion Models for Surgical Phase Recognition in Endoscopic Submucosal Dissection
Xiangning Zhang
Jinnan Chen
Qingwei Zhang
Yaqi Wang
Chengfeng Zhou
XiaoBo Li
Dahong Qian
MedIm
123
0
0
04 Jul 2025
SurgiSR4K: A High-Resolution Endoscopic Video Dataset for Robotic-Assisted Minimally Invasive Procedures
Fengyi Jiang
Xiaorui Zhang
Lingbo Jin
Ruixing Liang
Yuxin Chen
...
Wenqing Sun
Cong Gao
Hallie McNamara
Jingpei Lu
Omid Mohareri
96
0
0
30 Jun 2025
SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model
Guankun Wang
Junyi Wang
Wenjin Mo
Long Bai
Kun Yuan
...
N. Padoy
Zhen Lei
Hongbin Liu
Nassir Navab
Hongliang Ren
94
1
0
22 Jun 2025
orGAN: A Synthetic Data Augmentation Pipeline for Simultaneous Generation of Surgical Images and Ground Truth Labels
Niran Nataraj
Maina Sogabe
Kenji Kawashima
MedIm
101
0
0
17 Jun 2025
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis
Jianhui Wei
Zikai Xiao
Danyu Sun
Luqi Gong
Zongxin Yang
Zuozhu Liu
Jian Wu
104
3
0
09 Jun 2025
Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study
Leon D. Mayer
Tim Radsch
Dominik Michael
Lucas Luttner
Amine Yamlahi
...
Patrick Godau
Marcel Knopp
Annika Reinke
Fiona Kolbinger
Lena Maier-Hein
150
0
0
06 Jun 2025
SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence
Zhitao Zeng
Zhu Zhuo
Xiaojun Jia
Erli Zhang
Junde Wu
...
Xiaochun Cao
Yutong Ban
Qi Dou
Yang Liu
Yueming Jin
VLM
218
6
0
03 Jun 2025
FORLA: Federated Object-centric Representation Learning with Slot Attention
Guiqiu Liao
M. Jogan
Eric Eaton
Daniel A. Hashimoto
FedML
141
1
0
03 Jun 2025
SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Ssharvien Kumar Sivakumar
Yannik Frisch
Ghazal Ghazaei
Anirban Mukhopadhyay
VGen
199
1
0
03 Jun 2025
Large-scale Self-supervised Video Foundation Model for Intelligent Surgery
Shu Yang
F. Zhou
Leon D. Mayer
Fuxiang Huang
Yiliang Chen
...
Zheng Li
Jing Qin
J. Teoh
Lena Maier-Hein
Hao-tao Chen
170
3
0
03 Jun 2025
SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Yiping Li
Ronald L.P.D. de Jong
Sahar Nasirihaghighi
Tim J. M. Jaspers
Romy van Jaarsveld
...
Richard van Hillegersberg
Fons van der Sommen
J P Ruurda
M. Breeuwer
Yasmina al Khalil
MedIm
140
3
0
02 Jun 2025
ProstaTD: Bridging Surgical Triplet from Classification to Fully Supervised Detection
Yiliang Chen
Zhixi Li
Cheng Xu
Alex Qinyang Liu
Ruize Cui
Xuemiao Xu
J. Teoh
Shengfeng He
Jing Qin
195
0
0
01 Jun 2025
EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding
Ege Özsoy
Arda Mamur
Felix Tristram
Chantal Pellegrini
Magdalena Wysocki
Benjamin Busam
Nassir Navab
95
3
0
30 May 2025
Lightweight Relational Embedding in Task-Interpolated Few-Shot Networks for Enhanced Gastrointestinal Disease Classification
Conference on Algebraic Informatics (AI), 2024
Xinliu Zhong
Leo Hwa Liang
Angela S. Koh
Yeo Si Yong
217
1
0
30 May 2025
EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Shengyuan Liu
Boyun Zheng
Wenting Chen
Zhihao Peng
Zhenfei Yin
Jing Shao
Jiancong Hu
Yixuan Yuan
ELM
211
6
0
29 May 2025
Specialized Foundation Models for Intelligent Operating Rooms
Ege Özsoy
Chantal Pellegrini
David Bani-Harouni
Kun Yuan
Matthias Keicher
Nassir Navab
135
1
0
19 May 2025
ReSW-VL: Representation Learning for Surgical Workflow Analysis Using Vision-Language Model
Satoshi Kondo
90
0
0
19 May 2025
Surgical Foundation Model Leveraging Compression and Entropy Maximization for Image-Guided Surgical Assistance
Lianhao Yin
O. Meireles
Guy Rosman
Daniela Rus
97
0
0
16 May 2025
You Are Your Best Teacher: Semi-Supervised Surgical Point Tracking with Cycle-Consistent Self-Distillation
Valay Bundele
Mehran Hosseinzadeh
Hendrik Lensch
196
0
0
09 May 2025
Sim2Real in endoscopy segmentation with a novel structure aware image translation
Clara Tomasini
L. Riazuelo
Ana C. Murillo
MedIm
158
0
0
05 May 2025
Multimodal Graph Representation Learning for Robust Surgical Workflow Recognition with Adversarial Feature Disentanglement
Information Fusion (Inf. Fusion), 2025
Long Bai
Boyi Ma
Ruohan Wang
Guankun Wang
Beilei Cui
...
Mobarakol Islam
Zhe Min
Jiewen Lai
Nassir Navab
Hongliang Ren
194
2
0
03 May 2025
Multi-Stage Boundary-Aware Transformer Network for Action Segmentation in Untrimmed Surgical Videos
Computer Vision and Image Understanding (CVIU), 2025
Rezowan Shuvo
M S Mekala
Eyad Elyan
MedIm
702
1
0
26 Apr 2025
Surgeons vs. Computer Vision: A comparative analysis on surgical phase recognition capabilities
International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025
Marco Mezzina
Pieter De Backer
Tom Vercauteren
Matthew B. Blaschko
Alexandre Mottrie
Tinne Tuytelaars
92
1
0
26 Apr 2025
Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections
Max Kirchner
Alexander C. Jenke
S. Bodenstedt
Fiona Kolbinger
Oliver Saldanha
Jakob N. Kather
M. Wagner
Stefanie Speidel
FedML
MedIm
261
3
0
23 Apr 2025
Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation
Cheng Yuan
Yutong Ban
MedIm
198
0
0
18 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Zhang
Robby T. Tan
Mamba
218
12
0
04 Apr 2025
Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence
Anita Rau
Mark Endo
Josiah Aklilu
Jaewoo Heo
Khaled Saab
Alberto Paderno
Jeffrey Jopling
F. C. Holsinger
Serena Yeung-Levy
174
2
0
03 Apr 2025
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov
Amina Miftakhova
Artemiy Tereshchenko
Galina Zubkova
Pavel Blinov
Andrey Savchenko
LM&MA
229
3
0
26 Mar 2025
fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models
Saurav Sharma
Didier Mutter
N. Padoy
VLM
MedIm
141
0
0
25 Mar 2025
End-to-End Deep Learning for Real-Time Neuroimaging-Based Assessment of Bimanual Motor Skills
Aseem Subedi
Rahul Rahul
Lora Cavuoto
Steven D. Schwaitzberg
Matthew Hackett
Jack Norfleet
S. De
79
1
0
21 Mar 2025
Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding
International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2025
David Gastager
Ghazal Ghazaei
Constantin Patsch
159
1
0
14 Mar 2025
CoStoDet-DDPM: Collaborative Training of Stochastic and Deterministic Models Improves Surgical Workflow Anticipation and Recognition
Kaixiang Yang
Xin Li
Qiang Li
Zhiwei Wang
194
0
0
13 Mar 2025
1
2
3
4
5
6
7
Next