Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.03150
Cited By
Moments in Time Dataset: one million videos for event understanding
9 January 2018
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
Tom Yan
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Moments in Time Dataset: one million videos for event understanding"
50 / 268 papers shown
Title
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
9
319
0
14 Apr 2020
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao
Linning Xu
Yu Xiong
Guodong Xu
Qingqiu Huang
Bolei Zhou
Dahua Lin
6
111
0
06 Apr 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
25
54
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
36
51
0
29 Mar 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
8
88
0
29 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
17
3
0
12 Mar 2020
Evolving Losses for Unsupervised Video Representation Learning
A. Piergiovanni
A. Angelova
Michael S. Ryoo
SSL
6
138
0
26 Feb 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
Peisen Zhao
Lingxi Xie
Chen Ju
Ya-Qin Zhang
Yanfeng Wang
Qi Tian
10
1
0
18 Feb 2020
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos
Lichao Mou
Yuansheng Hua
P. Jin
Xiaoxiang Zhu
AI4TS
15
44
0
30 Jan 2020
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video
Jennifer J. Sun
Ting Liu
Alan S. Cowen
Florian Schroff
Hartwig Adam
Gautam Prasad
9
7
0
15 Jan 2020
Few-shot Action Recognition with Permutation-invariant Attention
Hongguang Zhang
Li Zhang
Xiaojuan Qi
Hongdong Li
Philip H. S. Torr
Piotr Koniusz
6
3
0
12 Jan 2020
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng-Tao Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
6
7
0
13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt
Jialing Lyu
Weichao Qiu
Xinyue Wei
Yi Zhang
Alan Yuille
Zhengjun Zha
VLM
14
3
0
13 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
16
9
0
09 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
21
126
0
02 Dec 2019
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
11
7
0
26 Nov 2019
Oops! Predicting Unintentional Action in Video
Dave Epstein
Boyuan Chen
Carl Vondrick
11
99
0
25 Nov 2019
Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition
Yang Liu
Zhaoyang Lu
Jing Li
Tao Yang
Chao Yao
8
51
0
25 Nov 2019
Cross-Class Relevance Learning for Temporal Concept Localization
Junwei Ma
S. Gorti
M. Volkovs
I. Stanevich
Guangwei Yu
15
7
0
19 Nov 2019
Cross-modal supervised learning for better acoustic representations
Shaoyong Jia
Xin Shu
Yang Yang
Dawei Liang
Qiyue Liu
Junhui Liu
SSL
DRL
AI4TS
17
2
0
15 Nov 2019
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Mathew Monfort
Bowen Pan
K. Ramakrishnan
A. Andonian
Barry A. McNamara
A. Lascelles
Quanfu Fan
Dan Gutfreund
Rogerio Feris
A. Oliva
VLM
6
68
0
01 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
25
7
0
29 Oct 2019
Predictive Coding Networks Meet Action Recognition
Xia Huang
Hossein Mousavi
Gemma Roig
6
1
0
22 Oct 2019
Adaptive and Iteratively Improving Recurrent Lateral Connections
Barak Battash
Lior Wolf
9
2
0
16 Oct 2019
Tiny Video Networks
A. Piergiovanni
A. Angelova
Michael S. Ryoo
20
46
0
15 Oct 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
13
6
0
20 Sep 2019
Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations
Liam Hiley
Alun D. Preece
Y. Hicks
XAI
6
15
0
07 Sep 2019
Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang
A. Cherian
20
5
0
05 Sep 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video
Greg Castañón
N. Shnidman
Tim Anderson
J. Byrne
14
1
0
28 Aug 2019
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
14
8
0
13 Aug 2019
Predicting Actions to Help Predict Translations
Zixiu "Alex" Wu
Julia Ive
Josiah Wang
Pranava Madhyastha
Lucia Specia
9
7
0
05 Aug 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
Sebastian Agethen
Winston H. Hsu
HAI
17
25
0
30 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
27
75
0
19 Jul 2019
Unsupervised predictive coding models may explain visual brain representation
Marcio Fonseca
MedIm
SSL
11
0
0
30 Jun 2019
GANalyze: Toward Visual Definitions of Cognitive Image Properties
L. Goetschalckx
A. Andonian
A. Oliva
Phillip Isola
FAtt
GAN
4
313
0
24 Jun 2019
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes
R. Krishnan
Mahesh Subedar
Omesh Tickoo
BDL
15
46
0
12 Jun 2019
Identifying Visible Actions in Lifestyle Vlogs
Oana Ignat
Laura Burdick
Jia Deng
Rada Mihalcea
10
14
0
10 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
12
102
0
30 May 2019
Unsupervised Learning from Video with Deep Neural Embeddings
Chengxu Zhuang
Tianwei She
A. Andonian
Max Sobol Mark
Daniel L. K. Yamins
SSL
4
55
0
28 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
12
77
0
13 May 2019
AI Enabling Technologies: A Survey
V. Gadepally
Justin A. Goodwin
J. Kepner
Albert Reuther
Hayley Reynolds
S. Samsi
Jonathan Su
David Martinez
19
24
0
08 May 2019
Representation Similarity Analysis for Efficient Task taxonomy & Transfer Learning
Kshitij Dwivedi
Gemma Roig
9
149
0
26 Apr 2019
Early Detection of Injuries in MLB Pitchers from Video
A. Piergiovanni
Michael S. Ryoo
MedIm
9
11
0
18 Apr 2019
Relational Action Forecasting
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Rahul Sukthankar
Kevin Patrick Murphy
Cordelia Schmid
28
79
0
08 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
6
1,232
0
03 Apr 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
C. Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
8
82
0
04 Mar 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
8
1,686
0
16 Feb 2019
Coupled Recurrent Network (CRN)
Lin Sun
K. Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
21
4
0
25 Dec 2018
Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction
Longlong Jing
Xiaodong Yang
Jingen Liu
Yingli Tian
12
154
0
28 Nov 2018
Previous
1
2
3
4
5
6
Next