Condensed Movies: Story Based Retrieval with Contextual Embeddings

8 May 2020

Papers citing "Condensed Movies: Story Based Retrieval with Contextual Embeddings"

20 / 70 papers shown

Title
Hierarchical Self-supervised Representation Learning for Movie Understanding Fanyi Xiao Kaustav Kundu Joseph Tighe Davide Modolo SSL 37 24 0 06 Apr 2022
Long Movie Clip Classification with State-Space Video Models Md. Mohaiminul Islam Gedas Bertasius VLM 36 101 0 04 Apr 2022
Learning Audio-Video Modalities from Image Captions Arsha Nagrani Paul Hongsuck Seo Bryan Seybold Anja Hauth Santiago Manén Chen Sun Cordelia Schmid CLIP 11 82 0 01 Apr 2022
Movie Genre Classification by Language Augmentation and Shot Sampling Zhongping Zhang Yiwen Gu Bryan A. Plummer Xin Miao Jiayi Liu Huayan Wang VLM CLIP 9 1 0 24 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Yidan Sun Qin Chao Yangfeng Ji Boyang Albert Li VGen 22 10 0 11 Mar 2022
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge A. Brown Jaesung Huh Joon Son Chung Arsha Nagrani Daniel Garcia-Romero Andrew Zisserman 21 40 0 12 Jan 2022
Masking Modalities for Cross-modal Video Retrieval Valentin Gabeur Arsha Nagrani Chen Sun Alahari Karteek Cordelia Schmid 11 29 0 01 Nov 2021
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition Alejandro Pardo Fabian Caba Heilbron Juan Carlos León Alcázar Ali K. Thabet Bernard Ghanem VGen 29 28 0 12 Sep 2021
Learning to Cut by Watching Movies Alejandro Pardo Fabian Caba Heilbron Juan Carlos León Alcázar Ali K. Thabet Bernard Ghanem VGen 43 20 0 09 Aug 2021
Towards Long-Form Video Understanding Chaoxia Wu Philipp Krahenbuhl VLM ViT 36 165 0 21 Jun 2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities Andrew Brown Vicky Kalogeiton Andrew Zisserman CVBM 15 29 0 20 May 2021
Visual Semantic Role Labeling for Video Understanding Arka Sadhu Tanmay Gupta Mark Yatskar Ram Nevatia Aniruddha Kembhavi VLM 12 68 0 02 Apr 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval Max Bain Arsha Nagrani Gül Varol Andrew Zisserman VGen 34 1,124 0 01 Apr 2021
On Semantic Similarity in Video Retrieval Michael Wray Hazel Doughty Dima Damen 16 66 0 18 Mar 2021
Automated Video Labelling: Identifying Faces by Corroborative Evidence Andrew Brown Ernesto Coto Andrew Zisserman CVBM 13 15 0 10 Feb 2021
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge Arsha Nagrani Joon Son Chung Jaesung Huh Andrew Brown Ernesto Coto Weidi Xie Mitchell McLaren D. Reynolds Andrew Zisserman 11 74 0 12 Dec 2020
Look Before you Speak: Visually Contextualized Utterances Paul Hongsuck Seo Arsha Nagrani Cordelia Schmid 11 66 0 10 Dec 2020
QuerYD: A video dataset with high-quality text and audio narrations Andreea-Maria Oncescu João F. Henriques Yang Liu Andrew Zisserman Samuel Albanie VGen 6 11 0 22 Nov 2020
Playing a Part: Speaker Verification at the Movies A. Brown Jaesung Huh Arsha Nagrani Joon Son Chung Andrew Zisserman 8 23 0 29 Oct 2020
Learning Interactions and Relationships between Movie Characters Anna Kukleva Makarand Tapaswi Ivan Laptev 36 51 0 29 Mar 2020