Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.04208
Cited By
Condensed Movies: Story Based Retrieval with Contextual Embeddings
8 May 2020
Max Bain
Arsha Nagrani
A. Brown
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Condensed Movies: Story Based Retrieval with Contextual Embeddings"
20 / 70 papers shown
Title
Hierarchical Self-supervised Representation Learning for Movie Understanding
Fanyi Xiao
Kaustav Kundu
Joseph Tighe
Davide Modolo
SSL
37
24
0
06 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
36
101
0
04 Apr 2022
Learning Audio-Video Modalities from Image Captions
Arsha Nagrani
Paul Hongsuck Seo
Bryan Seybold
Anja Hauth
Santiago Manén
Chen Sun
Cordelia Schmid
CLIP
11
82
0
01 Apr 2022
Movie Genre Classification by Language Augmentation and Shot Sampling
Zhongping Zhang
Yiwen Gu
Bryan A. Plummer
Xin Miao
Jiayi Liu
Huayan Wang
VLM
CLIP
9
1
0
24 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
22
10
0
11 Mar 2022
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
A. Brown
Jaesung Huh
Joon Son Chung
Arsha Nagrani
Daniel Garcia-Romero
Andrew Zisserman
21
40
0
12 Jan 2022
Masking Modalities for Cross-modal Video Retrieval
Valentin Gabeur
Arsha Nagrani
Chen Sun
Alahari Karteek
Cordelia Schmid
11
29
0
01 Nov 2021
MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Bernard Ghanem
VGen
29
28
0
12 Sep 2021
Learning to Cut by Watching Movies
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Bernard Ghanem
VGen
43
20
0
09 Aug 2021
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
36
165
0
21 Jun 2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
Andrew Brown
Vicky Kalogeiton
Andrew Zisserman
CVBM
15
29
0
20 May 2021
Visual Semantic Role Labeling for Video Understanding
Arka Sadhu
Tanmay Gupta
Mark Yatskar
Ram Nevatia
Aniruddha Kembhavi
VLM
12
68
0
02 Apr 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
VGen
34
1,124
0
01 Apr 2021
On Semantic Similarity in Video Retrieval
Michael Wray
Hazel Doughty
Dima Damen
16
66
0
18 Mar 2021
Automated Video Labelling: Identifying Faces by Corroborative Evidence
Andrew Brown
Ernesto Coto
Andrew Zisserman
CVBM
13
15
0
10 Feb 2021
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Arsha Nagrani
Joon Son Chung
Jaesung Huh
Andrew Brown
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
11
74
0
12 Dec 2020
Look Before you Speak: Visually Contextualized Utterances
Paul Hongsuck Seo
Arsha Nagrani
Cordelia Schmid
11
66
0
10 Dec 2020
QuerYD: A video dataset with high-quality text and audio narrations
Andreea-Maria Oncescu
João F. Henriques
Yang Liu
Andrew Zisserman
Samuel Albanie
VGen
6
11
0
22 Nov 2020
Playing a Part: Speaker Verification at the Movies
A. Brown
Jaesung Huh
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
8
23
0
29 Oct 2020
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
36
51
0
29 Mar 2020
Previous
1
2