ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.21304
25
1

VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation

22 October 2024
Chika Maduabuchi
Ericmoore Jossou
Matteo Bucci
ArXivPDFHTML
Abstract

High-speed video (HSV) segmentation is essential for analyzing dynamic physical processes in scientific and industrial applications, such as boiling heat transfer. Existing models like U-Net struggle with generalization and accurately segmenting complex bubble formations. We present VideoSAM, a specialized adaptation of the Segment Anything Model (SAM), fine-tuned on a diverse HSV dataset for phase detection. Through diverse experiments, VideoSAM demonstrates superior performance across four fluid environments -- Water, FC-72, Nitrogen, and Argon -- significantly outperforming U-Net in complex segmentation tasks. In addition to introducing VideoSAM, we contribute an open-source HSV segmentation dataset designed for phase detection, enabling future research in this domain. Our findings underscore VideoSAM's potential to set new standards in robust and accurate HSV segmentation. The code and dataset used in this study are available online atthis https URL.

View on arXiv
@article{maduabuchi2025_2410.21304,
  title={ VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation },
  author={ Chika Maduabuchi and Ericmoore Jossou and Matteo Bucci },
  journal={arXiv preprint arXiv:2410.21304},
  year={ 2025 }
}
Comments on this paper