Moments in Time Dataset: one million videos for event understanding

9 January 2018

Carl Vondrick

Papers citing "Moments in Time Dataset: one million videos for event understanding"

50 / 268 papers shown

Title
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding Dian Shao Yue Zhao Bo Dai Dahua Lin 9 319 0 14 Apr 2020
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation Anyi Rao Linning Xu Yu Xiong Guodong Xu Qingqiu Huang Bolei Zhou Dahua Lin 6 111 0 06 Apr 2020
Speech2Action: Cross-modal Supervision for Action Recognition Arsha Nagrani Chen Sun David A. Ross Rahul Sukthankar Cordelia Schmid Andrew Zisserman 25 54 0 30 Mar 2020
Learning Interactions and Relationships between Movie Characters Anna Kukleva Makarand Tapaswi Ivan Laptev 36 51 0 29 Mar 2020
Omni-sourced Webly-supervised Learning for Video Recognition Haodong Duan Yue Zhao Yuanjun Xiong Wentao Liu Dahua Lin VLM 8 88 0 29 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning Wenjie Shi Gao Huang Shiji Song Zhuoyuan Wang Tingyu Lin Cheng Wu SSL 28 18 0 16 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019 Manyuan Zhang Hao Shao Guanglu Song Yu Liu Junjie Yan 17 3 0 12 Mar 2020
Evolving Losses for Unsupervised Video Representation Learning A. Piergiovanni A. Angelova Michael S. Ryoo SSL 6 138 0 26 Feb 2020
Bottom-Up Temporal Action Localization with Mutual Regularization Peisen Zhao Lingxi Xie Chen Ju Ya-Qin Zhang Yanfeng Wang Qi Tian 10 1 0 18 Feb 2020
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos Lichao Mou Yuansheng Hua P. Jin Xiaoxiang Zhu AI4TS 15 44 0 30 Jan 2020
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video Jennifer J. Sun Ting Liu Alan S. Cowen Florian Schroff Hartwig Adam Gautam Prasad 9 7 0 15 Jan 2020
Few-shot Action Recognition with Permutation-invariant Attention Hongguang Zhang Li Zhang Xiaojuan Qi Hongdong Li Philip H. S. Torr Piotr Koniusz 6 3 0 12 Jan 2020
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong S. Schwarcz Peng-Tao Xu Davide D‘Ambrosio Juhana Kangaspunta A. Angelova Huong Phan Navdeep Jaitly 6 7 0 13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt Jialing Lyu Weichao Qiu Xinyue Wei Yi Zhang Alan Yuille Zhengjun Zha VLM 14 3 0 13 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions Matteo Tomei Lorenzo Baraldi Simone Calderara Simone Bronzin Rita Cucchiara 16 9 0 09 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation Quanfu Fan Chun-Fu Chen Hilde Kuehne Marco Pistoia David D. Cox 21 126 0 02 Dec 2019
Learning Efficient Video Representation with Video Shuffle Networks Pingchuan Ma Yao Zhou Yu Lu Wayne Zhang 11 7 0 26 Nov 2019
Oops! Predicting Unintentional Action in Video Dave Epstein Boyuan Chen Carl Vondrick 11 99 0 25 Nov 2019
Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition Yang Liu Zhaoyang Lu Jing Li Tao Yang Chao Yao 8 51 0 25 Nov 2019
Cross-Class Relevance Learning for Temporal Concept Localization Junwei Ma S. Gorti M. Volkovs I. Stanevich Guangwei Yu 15 7 0 19 Nov 2019
Cross-modal supervised learning for better acoustic representations Shaoyong Jia Xin Shu Yang Yang Dawei Liang Qiyue Liu Junhui Liu SSL DRL AI4TS 17 2 0 15 Nov 2019
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding Mathew Monfort Bowen Pan K. Ramakrishnan A. Andonian Barry A. McNamara A. Lascelles Quanfu Fan Dan Gutfreund Rogerio Feris A. Oliva VLM 6 68 0 01 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation Zixiu "Alex" Wu Ozan Caglayan Julia Ive Josiah Wang Lucia Specia 25 7 0 29 Oct 2019
Predictive Coding Networks Meet Action Recognition Xia Huang Hossein Mousavi Gemma Roig 6 1 0 22 Oct 2019
Adaptive and Iteratively Improving Recurrent Lateral Connections Barak Battash Lior Wolf 9 2 0 16 Oct 2019
Tiny Video Networks A. Piergiovanni A. Angelova Michael S. Ryoo 20 46 0 15 Oct 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos Will Price Dima Damen 13 6 0 20 Sep 2019
Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations Liam Hiley Alun D. Preece Y. Hicks XAI 6 15 0 07 Sep 2019
Discriminative Video Representation Learning Using Support Vector Classifiers Jue Wang A. Cherian 20 5 0 05 Sep 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video Greg Castañón N. Shnidman Tim Anderson J. Byrne 14 1 0 28 Aug 2019
Three Branches: Detecting Actions With Richer Features Jinchao Xia Jiajun Tang Cewu Lu 14 8 0 13 Aug 2019
Predicting Actions to Help Predict Translations Zixiu "Alex" Wu Julia Ive Josiah Wang Pranava Madhyastha Lucia Specia 9 7 0 05 Aug 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos Sebastian Agethen Winston H. Hsu HAI 17 25 0 30 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling Laura Sevilla-Lara Shengxin Cindy Zha Zhicheng Yan Vedanuj Goswami Matt Feiszli Lorenzo Torresani 27 75 0 19 Jul 2019
Unsupervised predictive coding models may explain visual brain representation Marcio Fonseca MedIm SSL 11 0 0 30 Jun 2019
GANalyze: Toward Visual Definitions of Cognitive Image Properties L. Goetschalckx A. Andonian A. Oliva Phillip Isola FAtt GAN 4 313 0 24 Jun 2019
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes R. Krishnan Mahesh Subedar Omesh Tickoo BDL 15 46 0 12 Jun 2019
Identifying Visible Actions in Lifestyle Vlogs Oana Ignat Laura Burdick Jia Deng Rada Mihalcea 10 14 0 10 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures Michael S. Ryoo A. Piergiovanni Mingxing Tan A. Angelova 12 102 0 30 May 2019
Unsupervised Learning from Video with Deep Neural Embeddings Chengxu Zhuang Tianwei She A. Andonian Max Sobol Mark Daniel L. K. Yamins SSL 4 55 0 28 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos Noureldien Hussein E. Gavves A. Smeulders 12 77 0 13 May 2019
AI Enabling Technologies: A Survey V. Gadepally Justin A. Goodwin J. Kepner Albert Reuther Hayley Reynolds S. Samsi Jonathan Su David Martinez 19 24 0 08 May 2019
Representation Similarity Analysis for Efficient Task taxonomy & Transfer Learning Kshitij Dwivedi Gemma Roig 9 149 0 26 Apr 2019
Early Detection of Injuries in MLB Pitchers from Video A. Piergiovanni Michael S. Ryoo MedIm 9 11 0 18 Apr 2019
Relational Action Forecasting Chen Sun Abhinav Shrivastava Carl Vondrick Rahul Sukthankar Kevin Patrick Murphy Cordelia Schmid 28 79 0 08 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning Chen Sun Austin Myers Carl Vondrick Kevin Patrick Murphy Cordelia Schmid VLM SSL 6 1,232 0 03 Apr 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition C. Li Qiaoyong Zhong Di Xie Shiliang Pu 8 82 0 04 Mar 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey Longlong Jing Yingli Tian SSL 8 1,686 0 16 Feb 2019
Coupled Recurrent Network (CRN) Lin Sun K. Jia Yuejia Shen Silvio Savarese Dit-Yan Yeung Bertram E. Shi 21 4 0 25 Dec 2018
Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction Longlong Jing Xiaodong Yang Jingen Liu Yingli Tian 12 154 0 28 Nov 2018