ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.04838
  4. Cited By
TransNet V2: An effective deep network architecture for fast shot
  transition detection

TransNet V2: An effective deep network architecture for fast shot transition detection

11 August 2020
Tomás Soucek
Jakub Lokoč
ArXiv (abs)PDFHTML

Papers citing "TransNet V2: An effective deep network architecture for fast shot transition detection"

50 / 60 papers shown
Title
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Boyang Wang
Xuweiyi Chen
Matheus Gadelha
Zezhou Cheng
DiffMVGen
74
0
0
27 May 2025
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations
Ziqiao Peng
Yanbo Fan
Haoyu Wu
Xuan Wang
Hongyan Liu
Jun He
Zhaoxin Fan
35
2
0
23 May 2025
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation
Junhao Chen
Mingjin Chen
Jianjin Xu
Xiang Li
Junting Dong
...
Hongxiang Li
Yuhang Yang
Hao Zhao
Xiaoxiao Long
Ruqi Huang
DiffMVGen
66
0
0
23 May 2025
Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios
Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios
Huafeng Shi
Jianzhong Liang
Rongchang Xie
Xian Wu
Cheng Chen
Chang Liu
VGen
85
0
0
14 May 2025
Towards Understanding Camera Motions in Any Video
Towards Understanding Camera Motions in Any Video
Zhiqiu Lin
Siyuan Cen
Daniel Jiang
Jay Karhade
Hewei Wang
...
Rushikesh Zawar
Xue Bai
Yilun Du
Chuang Gan
Deva Ramanan
VGen
101
3
0
21 Apr 2025
Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform
Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform
Xianpan Zhou
VGen
104
0
0
21 Apr 2025
SkyReels-V2: Infinite-length Film Generative Model
SkyReels-V2: Infinite-length Film Generative Model
Guibin Chen
D. Lin
Jiangping Yang
Chunze Lin
J. Zhu
...
Di Qiu
Debang Li
Zhengcong Fei
Yang Li
Yahui Zhou
DiffMVGen
119
10
0
17 Apr 2025
A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search
A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search
Tinh-Anh Nguyen-Nhu
H. Tran
Nguyen-Khang Le
Minh-Nhat Nguyen
T. Nguyen
...
Huu-Phong Phan-Nguyen
Huy-Thach Pham
Quan Nguyen
Hoang M. Le
Quang-Vinh Dinh
99
0
0
12 Apr 2025
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking
H. Tran
Tinh-Anh Nguyen-Nhu
Huu-Phong Phan-Nguyen
T. Nguyen
Nhat-Minh Nguyen-Dich
Anh Dao
Huy-Duc Do
Quan Nguyen
Hoang M. Le
Quang-Vinh Dinh
73
0
0
11 Apr 2025
FMNV: A Dataset of Media-Published News Videos for Fake News Detection
FMNV: A Dataset of Media-Published News Videos for Fake News Detection
Yihao Wang
Zhong Qian
Peifeng Li
69
0
0
10 Apr 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
148
1
0
10 Apr 2025
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
Boyuan Wang
Xiaofeng Wang
Chaojun Ni
Guosheng Zhao
Zhiqin Yang
...
Yukun Zhou
Xinze Chen
Guan Huang
Lihong Liu
Xingang Wang
VGen
111
3
0
31 Mar 2025
Parameter-free Video Segmentation for Vision and Language Understanding
Louis Mahon
Mirella Lapata
VLM
76
2
0
03 Mar 2025
Faster than real-time detection of shot boundaries, sampling structure and dynamic keyframes in video
Faster than real-time detection of shot boundaries, sampling structure and dynamic keyframes in video
Hannes Fassold
126
0
0
13 Feb 2025
Multi-subject Open-set Personalization in Video Generation
Multi-subject Open-set Personalization in Video Generation
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Yuwei Fang
Kwot Sin Lee
Ivan Skorokhodov
Kfir Aberman
Jun-Yan Zhu
Ming-Hsuan Yang
Sergey Tulyakov
DiffMVGen
192
13
0
10 Jan 2025
SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model
  with Transparent Explanations
SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations
Zhiwen Chen
Francesco Pinto
Minzhou Pan
Bo Li
107
5
0
09 Dec 2024
GameGen-X: Interactive Open-world Game Video Generation
GameGen-X: Interactive Open-world Game Video Generation
Haoxuan Che
Xuanhua He
Quande Liu
Cheng Jin
Hao Chen
VGen
135
25
0
01 Nov 2024
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View
  Recommendation
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation
Kuan-Ying Lee
Qian Zhou
Klara Nahrstedt
76
0
0
17 Oct 2024
ScreenWriter: Automatic Screenplay Generation and Movie Summarisation
ScreenWriter: Automatic Screenplay Generation and Movie Summarisation
Louis Mahon
Mirella Lapata
68
3
0
17 Oct 2024
FakingRecipe: Detecting Fake News on Short Video Platforms from the
  Perspective of Creative Process
FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process
Yuyan Bu
Qiang Sheng
Juan Cao
Peng Qi
Danding Wang
Jintao Li
DiffM
75
13
0
23 Jul 2024
An Empirical Comparison of Video Frame Sampling Methods for Multi-Modal
  RAG Retrieval
An Empirical Comparison of Video Frame Sampling Methods for Multi-Modal RAG Retrieval
Mahesh Kandhare
Thibault Gisselbrecht
81
5
0
22 Jul 2024
Multilingual Synopses of Movie Narratives: A Dataset for Story
  Understanding
Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Yidan Sun
Jianfei Yu
Boyang Li
104
0
0
18 Jun 2024
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision
  Models For Video Captioning and Summarization
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization
Richard Luo
Austin Peng
Adithya Vasudev
Rishabh Jain
44
2
0
31 May 2024
An Integrated Framework for Multi-Granular Explanation of Video
  Summarization
An Integrated Framework for Multi-Granular Explanation of Video Summarization
K. Tsigos
Evlampios Apostolidis
Vasileios Mezaris
59
1
0
16 May 2024
LLM-AD: Large Language Model based Audio Description System
LLM-AD: Large Language Model based Audio Description System
Peng Chu
Jiang Wang
Andre Abrantes
59
4
0
02 May 2024
Towards Automated Movie Trailer Generation
Towards Automated Movie Trailer Generation
Dawit Mureja Argaw
Mattia Soldan
Alejandro Pardo
Chen Zhao
Fabian Caba Heilbron
Joon Son Chung
Guohao Li
ViT
123
6
0
04 Apr 2024
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial
  Creation on Physical Tasks
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks
Yuexi Chen
Vlad I. Morariu
Anh Truong
Zhicheng Liu
DiffMVGen
72
5
0
12 Mar 2024
Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas
Find the Cliffhanger: Multi-Modal Trailerness in Soap Operas
Carlo Bretti
Pascal Mettes
Hendrik Vincent Koops
Daan Odijk
Nanne van Noord
71
4
0
29 Jan 2024
Large Model based Sequential Keyframe Extraction for Video Summarization
Large Model based Sequential Keyframe Extraction for Video Summarization
Kailong Tan
Yuxiang Zhou
Qianchen Xia
Rui Liu
Yong Chen
67
8
0
10 Jan 2024
Facilitating the Production of Well-tailored Video Summaries for Sharing
  on Social Media
Facilitating the Production of Well-tailored Video Summaries for Sharing on Social Media
Evlampios Apostolidis
Konstantinos Apostolidis
Vasileios Mezaris
76
1
0
05 Dec 2023
Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain
  Adaptation
Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation
Linzi Xing
Quan Tran
Fabian Caba
Franck Dernoncourt
Seunghyun Yoon
Zhaowen Wang
Trung Bui
Giuseppe Carenini
104
1
0
30 Nov 2023
Latent Wander: an Alternative Interface for Interactive and
  Serendipitous Discovery of Large AV Archives
Latent Wander: an Alternative Interface for Interactive and Serendipitous Discovery of Large AV Archives
Yuchen Yang
Linyida Zhang
55
2
0
09 Oct 2023
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic
  Video Segmentation
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation
Najmeh Sadoughi
Xinyu Li
Avijit Vajpayee
D. Fan
Bing Shuai
H. Santos-Villalobos
Vimal Bhat
M. Rohith
75
4
0
22 Aug 2023
Long-range Multimodal Pretraining for Movie Understanding
Long-range Multimodal Pretraining for Movie Understanding
Dawit Mureja Argaw
Joon-Young Lee
Markus Woodson
In So Kweon
Fabian Caba Heilbron
VLM
77
9
0
18 Aug 2023
Meta-Personalizing Vision-Language Models to Find Named Instances in
  Video
Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Chun-Hsiao Yeh
Bryan C. Russell
Josef Sivic
Fabian Caba Heilbron
Simon Jenni
VLMMLLM
101
11
0
16 Jun 2023
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary
  Detection
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection
Wentao Zhu
Yufang Huang
Xi Xie
Wenxian Liu
Jincan Deng
Debing Zhang
Zhangyang Wang
Ji Liu
68
17
0
12 Apr 2023
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for
  Multi-modal Highlight Detection in Movies
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Bei Gan
Xiujun Shu
Ruizhi Qiao
Haoqian Wu
Keyun Chen
Hanjun Li
Bohan Ren
53
5
0
26 Mar 2023
Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation
Bridging the Emotional Semantic Gap via Multimodal Relevance Estimation
Chuan Zhang
Daoxin Zhang
Ruixiu Zhang
Jiawei Li
Jianke Zhu
78
1
0
03 Feb 2023
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene
  Segmentation
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
79
2
0
09 Dec 2022
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
Anuj Diwan
Puyuan Peng
Raymond J. Mooney
VLM
67
3
0
03 Nov 2022
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long
  Livestream Videos
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Jielin Qiu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Ding Zhao
Hailin Jin
AI4TS
65
5
0
12 Oct 2022
Match Cutting: Finding Cuts with Smooth Visual Transitions
Match Cutting: Finding Cuts with Smooth Visual Transitions
Boris Chen
Amir Ziai
Rebecca Tucker
Yuchen Xie
VGen
100
14
0
11 Oct 2022
Analysing the Memorability of a Procedural Crime-Drama TV Series, CSI
Analysing the Memorability of a Procedural Crime-Drama TV Series, CSI
Sean Cummins
Lorin Sweeney
Alan F. Smeaton
69
1
0
06 Aug 2022
The Anatomy of Video Editing: A Dataset and Benchmark Suite for
  AI-Assisted Video Editing
The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing
Dawit Mureja Argaw
Fabian Caba Heilbron
Joon-Young Lee
Markus Woodson
In So Kweon
VGen
100
25
0
20 Jul 2022
Pixel-level Correspondence for Self-Supervised Learning from Video
Pixel-level Correspondence for Self-Supervised Learning from Video
Yash Sharma
Yi Zhu
Chris Russell
Thomas Brox
SSL
49
4
0
08 Jul 2022
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene
  Segmentation and Classification
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Ye Liu
Lingfeng Qiao
Di Yin
Zhuoxuan Jiang
Xinghua Jiang
Deqiang Jiang
Bo Ren
52
7
0
04 Jul 2022
ClothFormer:Taming Video Virtual Try-on in All Module
ClothFormer:Taming Video Virtual Try-on in All Module
Jianbin Jiang
Tan Wang
He Yan
Junhui Liu
86
28
0
26 Apr 2022
Movie Genre Classification by Language Augmentation and Shot Sampling
Movie Genre Classification by Language Augmentation and Shot Sampling
Zhongping Zhang
Yiwen Gu
Bryan A. Plummer
Xin Miao
Jiayi Liu
Huayan Wang
VLMCLIP
61
1
0
24 Mar 2022
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning
PACS: A Dataset for Physical Audiovisual CommonSense Reasoning
Samuel Yu
Peter Wu
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
LRM
117
16
0
21 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story
  Understanding
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
79
11
0
11 Mar 2022
12
Next