Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.08675
Cited By
YouTube-8M: A Large-Scale Video Classification Benchmark
27 September 2016
Sami Abu-El-Haija
Nisarg Kothari
Joonseok Lee
Apostol Natsev
G. Toderici
Balakrishnan Varadarajan
Sudheendra Vijayanarasimhan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"YouTube-8M: A Large-Scale Video Classification Benchmark"
50 / 170 papers shown
Title
TinyVIRAT: Low-resolution Video Action Recognition
Ugur Demir
Y. S. Rawat
M. Shah
25
36
0
14 Jul 2020
AViD Dataset: Anonymized Videos from Diverse Countries
A. Piergiovanni
Michael S. Ryoo
25
35
0
10 Jul 2020
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
40
371
0
29 Jun 2020
Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation
Liang-Chieh Chen
Raphael Gontijo-Lopes
Bowen Cheng
Maxwell D. Collins
E. D. Cubuk
Barret Zoph
Hartwig Adam
Jonathon Shlens
23
76
0
20 May 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
20
87
0
10 Apr 2020
Gradient Centralization: A New Optimization Technique for Deep Neural Networks
Hongwei Yong
Jianqiang Huang
Xiansheng Hua
Lei Zhang
ODL
13
184
0
03 Apr 2020
M2m: Imbalanced Classification via Major-to-minor Translation
Jaehyung Kim
Jongheon Jeong
Jinwoo Shin
13
220
0
01 Apr 2020
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
36
51
0
29 Mar 2020
Watching the World Go By: Representation Learning from Unlabeled Videos
Daniel Gordon
Kiana Ehsani
D. Fox
Ali Farhadi
SSL
AI4TS
16
87
0
18 Mar 2020
Automatic Shortcut Removal for Self-Supervised Representation Learning
Matthias Minderer
Olivier Bachem
N. Houlsby
Michael Tschannen
SSL
8
73
0
20 Feb 2020
Deep Audio-Visual Learning: A Survey
Hao Zhu
Mandi Luo
Rui Wang
A. Zheng
R. He
29
156
0
14 Jan 2020
Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data
Xi Yan
David Acuna
Sanja Fidler
24
42
0
09 Jan 2020
Multi-attention Networks for Temporal Localization of Video-level Labels
Lijun Zhang
Srinath Nizampatnam
Ahana Gangopadhyay
Marcos V. Conde
17
7
0
15 Nov 2019
Moviescope: Large-scale Analysis of Movies using Multiple Modalities
Paola Cascante-Bonilla
Kalpathy Sitaraman
Mengjia Luo
Vicente Ordonez
22
39
0
08 Aug 2019
Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
14
19
0
22 Jun 2019
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu
Abir Das
Kate Saenko
3DPC
4
46
0
05 Jun 2019
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang
Lin Ma
Lianqiang Zhou
11
19
0
28 May 2019
A Compressive Sensing Video dataset using Pixel-wise coded exposure
Sathyaprakash Narayanan
Y. Bethi
Chetan Singh Thakur
11
4
0
24 May 2019
Decentralized Learning of Generative Adversarial Networks from Non-iid Data
Ryo Yonetani
Tomohiro Takahashi
Atsushi Hashimoto
Yoshitaka Ushiku
37
24
0
23 May 2019
Large Scale Holistic Video Understanding
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Manohar Paluri
Jurgen Gall
Rainer Stiefelhagen
Luc Van Gool
24
35
0
25 Apr 2019
Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN
Ya-Liang Chang
Zhe-Yu Liu
Kuan-Ying Lee
Winston H. Hsu
DiffM
10
172
0
23 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation
Xiaohang Zhan
Xingang Pan
Ziwei Liu
Dahua Lin
Chen Change Loy
SSL
34
47
0
27 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
6
108
0
03 Mar 2019
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
33
88
0
27 Feb 2019
Single-frame Regularization for Temporally Stable CNNs
Gabriel Eilertsen
Rafał K. Mantiuk
Jonas Unger
11
43
0
27 Feb 2019
Understanding and Training Deep Diagonal Circulant Neural Networks
Alexandre Araujo
Benjamin Négrevergne
Y. Chevaleyre
Jamal Atif
19
4
0
29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
19
54
0
26 Jan 2019
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset
Arpan Gupta
S. Muthiah
19
6
0
10 Jan 2019
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
11
41
0
29 Sep 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Jinlai Liu
Zehuan Yuan
Changhu Wang
16
9
0
16 Sep 2018
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
N. Xu
L. Yang
Yuchen Fan
Dingcheng Yue
Yuchen Liang
Jianchao Yang
Thomas Huang
VOS
11
521
0
06 Sep 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild
Yu Luo
Jianbo Ye
Reginald B. Adams
Jia Li
M. Newman
J. Z. Wang
48
86
0
28 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
67
79
0
03 Aug 2018
Competitive Analysis System for Theatrical Movie Releases Based on Movie Trailer Deep Video Representation
Miguel Campo
C. Hsieh
Matt Nickens
J. J. Espinoza
Abhinav Taliyan
J. Rieger
Jean Ho
Bettina Sherick
HAI
13
8
0
12 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
18
181
0
19 Jun 2018
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
13
3
0
05 Jun 2018
Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning
Tribhuvanesh Orekondy
Seong Joon Oh
Yang Zhang
Bernt Schiele
Mario Fritz
PICV
FedML
334
37
0
15 May 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
F. I. F. Richard Yu
Haofeng Chen
Xin Wang
Wenqi Xian
Yingying Chen
Fangchen Liu
Vashisht Madhavan
Trevor Darrell
VLM
40
2,090
0
12 May 2018
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames
S. Bhardwaj
Mitesh M. Khapra
18
3
0
12 May 2018
Weakly-supervised Visual Instrument-playing Action Detection in Videos
Jen-Yu Liu
Yi-Hsuan Yang
Shyh-Kang Jeng
21
13
0
05 May 2018
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos
Silvio Giancola
Mohieddine Amine
Tarek Dghaily
Bernard Ghanem
AI4TS
19
193
0
12 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
25
995
0
08 Apr 2018
FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces
Andreas Rossler
D. Cozzolino
L. Verdoliva
Christian Riess
Justus Thies
Matthias Nießner
PICV
AAML
CVBM
8
375
0
24 Mar 2018
Towards Universal Representation for Unseen Action Recognition
Yi Zhu
Yang Long
Yu Guan
Shawn D. Newsam
Ling Shao
AI4TS
17
100
0
22 Mar 2018
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
20
33
0
27 Feb 2018
Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images
Yi Zhu
XueQing Deng
Shawn D. Newsam
26
51
0
07 Feb 2018
DeepType: Multilingual Entity Linking by Neural Type System Evolution
Jonathan Raiman
O. Raiman
BDL
HAI
117
183
0
03 Feb 2018
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
45
538
0
09 Jan 2018
Cross-modal Embeddings for Video and Audio Retrieval
Dídac Surís
A. Duarte
Amaia Salvador
Jordi Torres
Xavier Giró-i-Nieto
SSL
16
69
0
07 Jan 2018
Previous
1
2
3
4
Next