ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.03150
  4. Cited By
Moments in Time Dataset: one million videos for event understanding

Moments in Time Dataset: one million videos for event understanding

9 January 2018
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
Tom Yan
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
ArXivPDFHTML

Papers citing "Moments in Time Dataset: one million videos for event understanding"

50 / 268 papers shown
Title
FineGym: A Hierarchical Video Dataset for Fine-grained Action
  Understanding
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
9
319
0
14 Apr 2020
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
Anyi Rao
Linning Xu
Yu Xiong
Guodong Xu
Qingqiu Huang
Bolei Zhou
Dahua Lin
6
111
0
06 Apr 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
25
54
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
36
51
0
29 Mar 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Omni-sourced Webly-supervised Learning for Video Recognition
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
8
88
0
29 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement
  Learning
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
17
3
0
12 Mar 2020
Evolving Losses for Unsupervised Video Representation Learning
Evolving Losses for Unsupervised Video Representation Learning
A. Piergiovanni
A. Angelova
Michael S. Ryoo
SSL
6
138
0
26 Feb 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
Bottom-Up Temporal Action Localization with Mutual Regularization
Peisen Zhao
Lingxi Xie
Chen Ju
Ya-Qin Zhang
Yanfeng Wang
Qi Tian
10
1
0
18 Feb 2020
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in
  Aerial Videos
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos
Lichao Mou
Yuansheng Hua
P. Jin
Xiaoxiang Zhu
AI4TS
15
44
0
30 Jan 2020
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video
Jennifer J. Sun
Ting Liu
Alan S. Cowen
Florian Schroff
Hartwig Adam
Gautam Prasad
9
7
0
15 Jan 2020
Few-shot Action Recognition with Permutation-invariant Attention
Few-shot Action Recognition with Permutation-invariant Attention
Hongguang Zhang
Li Zhang
Xiaojuan Qi
Hongdong Li
Philip H. S. Torr
Piotr Koniusz
6
3
0
12 Jan 2020
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and
  Action Recognition in Ping Pong
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng-Tao Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
6
7
0
13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification
  Models Have Learnt
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt
Jialing Lyu
Weichao Qiu
Xinyue Wei
Yi Zhang
Alan Yuille
Zhengjun Zha
VLM
14
3
0
13 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
16
9
0
09 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little
  Network and Depthwise Temporal Aggregation
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
21
126
0
02 Dec 2019
Learning Efficient Video Representation with Video Shuffle Networks
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
11
7
0
26 Nov 2019
Oops! Predicting Unintentional Action in Video
Oops! Predicting Unintentional Action in Video
Dave Epstein
Boyuan Chen
Carl Vondrick
11
99
0
25 Nov 2019
Deep Image-to-Video Adaptation and Fusion Networks for Action
  Recognition
Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition
Yang Liu
Zhaoyang Lu
Jing Li
Tao Yang
Chao Yao
8
51
0
25 Nov 2019
Cross-Class Relevance Learning for Temporal Concept Localization
Cross-Class Relevance Learning for Temporal Concept Localization
Junwei Ma
S. Gorti
M. Volkovs
I. Stanevich
Guangwei Yu
15
7
0
19 Nov 2019
Cross-modal supervised learning for better acoustic representations
Cross-modal supervised learning for better acoustic representations
Shaoyong Jia
Xin Shu
Yang Yang
Dawei Liang
Qiyue Liu
Junhui Liu
SSL
DRL
AI4TS
17
2
0
15 Nov 2019
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action
  Video Understanding
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
Mathew Monfort
Bowen Pan
K. Ramakrishnan
A. Andonian
Barry A. McNamara
A. Lascelles
Quanfu Fan
Dan Gutfreund
Rogerio Feris
A. Oliva
VLM
6
68
0
01 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation
Transformer-based Cascaded Multimodal Speech Translation
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
25
7
0
29 Oct 2019
Predictive Coding Networks Meet Action Recognition
Predictive Coding Networks Meet Action Recognition
Xia Huang
Hossein Mousavi
Gemma Roig
6
1
0
22 Oct 2019
Adaptive and Iteratively Improving Recurrent Lateral Connections
Adaptive and Iteratively Improving Recurrent Lateral Connections
Barak Battash
Lior Wolf
9
2
0
16 Oct 2019
Tiny Video Networks
Tiny Video Networks
A. Piergiovanni
A. Angelova
Michael S. Ryoo
20
46
0
15 Oct 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
13
6
0
20 Sep 2019
Explainable Deep Learning for Video Recognition Tasks: A Framework &
  Recommendations
Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations
Liam Hiley
Alun D. Preece
Y. Hicks
XAI
6
15
0
07 Sep 2019
Discriminative Video Representation Learning Using Support Vector
  Classifiers
Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang
A. Cherian
20
5
0
05 Sep 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in
  Security Video
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video
Greg Castañón
N. Shnidman
Tim Anderson
J. Byrne
14
1
0
28 Aug 2019
Three Branches: Detecting Actions With Richer Features
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
14
8
0
13 Aug 2019
Predicting Actions to Help Predict Translations
Predicting Actions to Help Predict Translations
Zixiu "Alex" Wu
Julia Ive
Josiah Wang
Pranava Madhyastha
Lucia Specia
9
7
0
05 Aug 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based
  Mechanism for Videos
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
Sebastian Agethen
Winston H. Hsu
HAI
17
25
0
30 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
27
75
0
19 Jul 2019
Unsupervised predictive coding models may explain visual brain
  representation
Unsupervised predictive coding models may explain visual brain representation
Marcio Fonseca
MedIm
SSL
11
0
0
30 Jun 2019
GANalyze: Toward Visual Definitions of Cognitive Image Properties
GANalyze: Toward Visual Definitions of Cognitive Image Properties
L. Goetschalckx
A. Andonian
A. Oliva
Phillip Isola
FAtt
GAN
4
313
0
24 Jun 2019
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical
  Bayes
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes
R. Krishnan
Mahesh Subedar
Omesh Tickoo
BDL
15
46
0
12 Jun 2019
Identifying Visible Actions in Lifestyle Vlogs
Identifying Visible Actions in Lifestyle Vlogs
Oana Ignat
Laura Burdick
Jia Deng
Rada Mihalcea
10
14
0
10 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video
  Architectures
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
12
102
0
30 May 2019
Unsupervised Learning from Video with Deep Neural Embeddings
Unsupervised Learning from Video with Deep Neural Embeddings
Chengxu Zhuang
Tianwei She
A. Andonian
Max Sobol Mark
Daniel L. K. Yamins
SSL
4
55
0
28 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
12
77
0
13 May 2019
AI Enabling Technologies: A Survey
AI Enabling Technologies: A Survey
V. Gadepally
Justin A. Goodwin
J. Kepner
Albert Reuther
Hayley Reynolds
S. Samsi
Jonathan Su
David Martinez
19
24
0
08 May 2019
Representation Similarity Analysis for Efficient Task taxonomy &
  Transfer Learning
Representation Similarity Analysis for Efficient Task taxonomy & Transfer Learning
Kshitij Dwivedi
Gemma Roig
9
149
0
26 Apr 2019
Early Detection of Injuries in MLB Pitchers from Video
Early Detection of Injuries in MLB Pitchers from Video
A. Piergiovanni
Michael S. Ryoo
MedIm
9
11
0
18 Apr 2019
Relational Action Forecasting
Relational Action Forecasting
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Rahul Sukthankar
Kevin Patrick Murphy
Cordelia Schmid
28
79
0
08 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
6
1,232
0
03 Apr 2019
Collaborative Spatio-temporal Feature Learning for Video Action
  Recognition
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
C. Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
8
82
0
04 Mar 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A
  Survey
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
8
1,686
0
16 Feb 2019
Coupled Recurrent Network (CRN)
Coupled Recurrent Network (CRN)
Lin Sun
K. Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
21
4
0
25 Dec 2018
Self-Supervised Spatiotemporal Feature Learning via Video Rotation
  Prediction
Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction
Longlong Jing
Xiaodong Yang
Jingen Liu
Yingli Tian
12
154
0
28 Nov 2018
Previous
123456
Next