ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.02805
  4. Cited By
An Empirical Study on Activity Recognition in Long Surgical Videos

An Empirical Study on Activity Recognition in Long Surgical Videos

5 May 2022
Zhuohong He
A. Mottaghi
Aidean Sharghi
Muhammad Abdullah Jamal
Omid Mohareri
ArXivPDFHTML

Papers citing "An Empirical Study on Activity Recognition in Long Surgical Videos"

12 / 12 papers shown
Title
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen
Zhengrong Yue
Siran Chen
Z. Wang
Yang Liu
Peng Li
Y. Wang
VLM
145
0
0
13 Mar 2025
Friends Across Time: Multi-Scale Action Segmentation Transformer for
  Surgical Phase Recognition
Friends Across Time: Multi-Scale Action Segmentation Transformer for Surgical Phase Recognition
Bokai Zhang
Jiayuan Meng
Bin Cheng
Dean Biskup
Svetlana Petculescu
Angela Chapman
ViT
MedIm
21
0
0
22 Jan 2024
ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition
  in the Operating Room
ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room
Idris Hamoud
Muhammad Abdullah Jamal
V. Srivastav
Didier Mutter
N. Padoy
Omid Mohareri
21
2
0
19 Dec 2023
Event Recognition in Laparoscopic Gynecology Videos with Hybrid
  Transformers
Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers
Sahar Nasirihaghighi
Negin Ghamsarian
Heinrich Husslein
Klaus Schoeffmann
12
2
0
01 Dec 2023
M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for
  2D image and video understanding
M3^{3}33D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
16
1
0
26 Sep 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical
  Phase Recognition
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
18
3
0
19 Jul 2023
Self-Knowledge Distillation for Surgical Phase Recognition
Self-Knowledge Distillation for Surgical Phase Recognition
Jinglu Zhang
S. Barbarisi
A. Kadkhodamohammadi
Danail Stoyanov
Imanol Luengo
25
4
0
15 Jun 2023
Metrics Matter in Surgical Phase Recognition
Metrics Matter in Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Speidel
25
8
0
23 May 2023
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis
SurgMAE: Masked Autoencoders for Long Surgical Video Analysis
Muhammad Abdullah Jamal
Omid Mohareri
23
5
0
19 May 2023
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A
  Study on Surgical Workflow Analysis
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
19
15
0
15 Mar 2022
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic
  Videos
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
170
840
0
09 Feb 2016
1