An Empirical Study on Activity Recognition in Long Surgical Videos

An Empirical Study on Activity Recognition in Long Surgical Videos

5 May 2022

Muhammad Abdullah Jamal

Omid Mohareri

Papers citing "An Empirical Study on Activity Recognition in Long Surgical Videos"

12 / 12 papers shown

Title
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Boyu Chen Zhengrong Yue Siran Chen Z. Wang Yang Liu Peng Li Y. Wang VLM 145 0 0 13 Mar 2025
Friends Across Time: Multi-Scale Action Segmentation Transformer for Surgical Phase Recognition Bokai Zhang Jiayuan Meng Bin Cheng Dean Biskup Svetlana Petculescu Angela Chapman ViT MedIm 21 0 0 22 Jan 2024
ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room Idris Hamoud Muhammad Abdullah Jamal V. Srivastav Didier Mutter N. Padoy Omid Mohareri 21 2 0 19 Dec 2023
Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers Sahar Nasirihaghighi Negin Ghamsarian Heinrich Husslein Klaus Schoeffmann 12 2 0 01 Dec 2023
$M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding$ M $^{3}$ 3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding Muhammad Abdullah Jamal Omid Mohareri 3DPC 16 1 0 26 Sep 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition Isabel Funke Dominik Rivoir Stefanie Krell Stefanie Speidel 18 3 0 19 Jul 2023
Self-Knowledge Distillation for Surgical Phase Recognition Jinglu Zhang S. Barbarisi A. Kadkhodamohammadi Danail Stoyanov Imanol Luengo 25 4 0 15 Jun 2023
Metrics Matter in Surgical Phase Recognition Isabel Funke Dominik Rivoir Stefanie Speidel 25 8 0 23 May 2023
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis Muhammad Abdullah Jamal Omid Mohareri 23 5 0 19 May 2023
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis Dominik Rivoir Isabel Funke Stefanie Speidel 19 15 0 15 Mar 2022
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius Heng Wang Lorenzo Torresani ViT 280 1,981 0 09 Feb 2021
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos A. P. Twinanda S. Shehata Didier Mutter J. Marescaux M. de Mathelin N. Padoy 170 840 0 09 Feb 2016