Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.08893
Cited By
A Deep Siamese Network for Scene Detection in Broadcast Videos
29 October 2015
Lorenzo Baraldi
C. Grana
Rita Cucchiara
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Deep Siamese Network for Scene Detection in Broadcast Videos"
25 / 25 papers shown
Title
Parameter-free Video Segmentation for Vision and Language Understanding
Louis Mahon
Mirella Lapata
VLM
76
2
0
03 Mar 2025
Modality-Aware Shot Relating and Comparing for Video Scene Detection
Jiawei Tan
Hongxing Wang
Kang Dang
Jiaxin Li
Zhilong Ou
65
0
0
23 Dec 2024
Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Linzi Xing
Quan Tran
Fabian Caba
Franck Dernoncourt
Seunghyun Yoon
Zhaowen Wang
Trung Bui
Giuseppe Carenini
104
1
0
30 Nov 2023
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation
Najmeh Sadoughi
Xinyu Li
Avijit Vajpayee
D. Fan
Bing Shuai
H. Santos-Villalobos
Vimal Bhat
M. Rohith
75
4
0
22 Aug 2023
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection
Wentao Zhu
Yufang Huang
Xi Xie
Wenxian Liu
Jincan Deng
Debing Zhang
Zhangyang Wang
Ji Liu
68
17
0
12 Apr 2023
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
45
0
29 Dec 2022
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
79
2
0
09 Dec 2022
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Ye Liu
Lingfeng Qiao
Di Yin
Zhuoxuan Jiang
Xinghua Jiang
Deqiang Jiang
Bo Ren
52
7
0
04 Jul 2022
AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism
Yang Zhao
Xuan Lin
Wenqiang Xu
Maozong Zheng
Zhengyong Liu
Zhou Zhao
104
2
0
10 Jun 2022
Learnable Optimal Sequential Grouping for Video Scene Detection
Daniel Rotman
Yevgeny Yaroker
Elad Amrani
Udi Barzelay
Rami Ben-Ari
33
10
0
17 May 2022
Scene Consistency Representation Learning for Video Scene Segmentation
Haoqian Wu
Keyu Chen
Yanan Luo
Ruizhi Qiao
Bo Ren
Haozhe Liu
Weicheng Xie
Linlin Shen
SSL
83
16
0
11 May 2022
Movie Genre Classification by Language Augmentation and Shot Sampling
Zhongping Zhang
Yiwen Gu
Bryan A. Plummer
Xin Miao
Jiayi Liu
Huayan Wang
VLM
CLIP
61
1
0
24 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
98
15
0
01 Mar 2022
Boundary-aware Self-supervised Learning for Video Scene Segmentation
Jonghwan Mun
Minchul Shin
Gunsoo Han
Sangho Lee
S. Ha
Joonseok Lee
Eun-Sol Kim
SSL
98
20
0
14 Jan 2022
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Zhenzhi Wang
Liyu Wu
Zhimin Li
Jiangfeng Xiong
Qinglin Lu
58
4
0
16 Sep 2021
Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
Shixing Chen
Xiaohan Nie
David D. Fan
Dongqing Zhang
Vimal Bhat
Raffay Hamid
SSL
77
62
0
28 Apr 2021
TransNet V2: An effective deep network architecture for fast shot transition detection
Tomás Soucek
Jakub Lokoč
93
124
0
11 Aug 2020
MovieNet: A Holistic Dataset for Movie Understanding
Qingqiu Huang
Yu Xiong
Anyi Rao
Jiaze Wang
Dahua Lin
VGen
109
244
0
21 Jul 2020
Motion2Vec: Semi-Supervised Representation Learning from Surgical Videos
A. Tanwani
P. Sermanet
Andy Yan
Raghav V. Anand
Mariano Phielipp
Ken Goldberg
SSL
68
36
0
31 May 2020
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao
Linning Xu
Yu Xiong
Guodong Xu
Qingqiu Huang
Bolei Zhou
Dahua Lin
122
112
0
06 Apr 2020
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset
Arpan Gupta
S. Muthiah
46
6
0
10 Jan 2019
Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks
Ahmed Hassanien
Mohamed A. Elgharib
Ahmed A. S. Seleim
Sung-Ho Bae
M. Hefeeda
Wojciech Matusik
64
51
0
09 May 2017
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
82
192
0
28 Nov 2016
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks
Lorenzo Baraldi
C. Grana
Rita Cucchiara
76
49
0
05 Oct 2016
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features
Lorenzo Baraldi
C. Grana
Rita Cucchiara
42
9
0
09 Apr 2016
1