Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.02805
Cited By
An Empirical Study on Activity Recognition in Long Surgical Videos
5 May 2022
Zhuohong He
A. Mottaghi
Aidean Sharghi
Muhammad Abdullah Jamal
Omid Mohareri
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Empirical Study on Activity Recognition in Long Surgical Videos"
12 / 12 papers shown
Title
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen
Zhengrong Yue
Siran Chen
Z. Wang
Yang Liu
Peng Li
Y. Wang
VLM
145
0
0
13 Mar 2025
Friends Across Time: Multi-Scale Action Segmentation Transformer for Surgical Phase Recognition
Bokai Zhang
Jiayuan Meng
Bin Cheng
Dean Biskup
Svetlana Petculescu
Angela Chapman
ViT
MedIm
21
0
0
22 Jan 2024
ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room
Idris Hamoud
Muhammad Abdullah Jamal
V. Srivastav
Didier Mutter
N. Padoy
Omid Mohareri
21
2
0
19 Dec 2023
Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers
Sahar Nasirihaghighi
Negin Ghamsarian
Heinrich Husslein
Klaus Schoeffmann
12
2
0
01 Dec 2023
M
3
^{3}
3
3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
16
1
0
26 Sep 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
18
3
0
19 Jul 2023
Self-Knowledge Distillation for Surgical Phase Recognition
Jinglu Zhang
S. Barbarisi
A. Kadkhodamohammadi
Danail Stoyanov
Imanol Luengo
25
4
0
15 Jun 2023
Metrics Matter in Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Speidel
25
8
0
23 May 2023
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis
Muhammad Abdullah Jamal
Omid Mohareri
23
5
0
19 May 2023
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
19
15
0
15 Mar 2022
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
170
840
0
09 Feb 2016
1