YouTube-8M: A Large-Scale Video Classification Benchmark

27 September 2016

Joonseok Lee

Balakrishnan Varadarajan

Sudheendra Vijayanarasimhan

VLM

ArXiv PDF HTML

Papers citing "YouTube-8M: A Large-Scale Video Classification Benchmark"

50 / 170 papers shown

Title
TinyVIRAT: Low-resolution Video Action Recognition Ugur Demir Y. S. Rawat M. Shah 25 36 0 14 Jul 2020
AViD Dataset: Anonymized Videos from Diverse Countries A. Piergiovanni Michael S. Ryoo 25 35 0 10 Jul 2020
Self-Supervised MultiModal Versatile Networks Jean-Baptiste Alayrac Adrià Recasens R. Schneider Relja Arandjelović Jason Ramapuram J. Fauw Lucas Smaira Sander Dieleman Andrew Zisserman SSL 40 371 0 29 Jun 2020
Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation Liang-Chieh Chen Raphael Gontijo-Lopes Bowen Cheng Maxwell D. Collins E. D. Cubuk Barret Zoph Hartwig Adam Jonathon Shlens 23 76 0 20 May 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs? Hirokatsu Kataoka Tenga Wakamiya Kensho Hara Y. Satoh 3DPC 20 87 0 10 Apr 2020
Gradient Centralization: A New Optimization Technique for Deep Neural Networks Hongwei Yong Jianqiang Huang Xiansheng Hua Lei Zhang ODL 13 184 0 03 Apr 2020
M2m: Imbalanced Classification via Major-to-minor Translation Jaehyung Kim Jongheon Jeong Jinwoo Shin 13 220 0 01 Apr 2020
Learning Interactions and Relationships between Movie Characters Anna Kukleva Makarand Tapaswi Ivan Laptev 36 51 0 29 Mar 2020
Watching the World Go By: Representation Learning from Unlabeled Videos Daniel Gordon Kiana Ehsani D. Fox Ali Farhadi SSL AI4TS 16 87 0 18 Mar 2020
Automatic Shortcut Removal for Self-Supervised Representation Learning Matthias Minderer Olivier Bachem N. Houlsby Michael Tschannen SSL 8 73 0 20 Feb 2020
Deep Audio-Visual Learning: A Survey Hao Zhu Mandi Luo Rui Wang A. Zheng R. He 29 156 0 14 Jan 2020
Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data Xi Yan David Acuna Sanja Fidler 24 42 0 09 Jan 2020
Multi-attention Networks for Temporal Localization of Video-level Labels Lijun Zhang Srinath Nizampatnam Ahana Gangopadhyay Marcos V. Conde 17 7 0 15 Nov 2019
Moviescope: Large-scale Analysis of Movies using Multiple Modalities Paola Cascante-Bonilla Kalpathy Sitaraman Mengjia Luo Vicente Ordonez 22 39 0 08 Aug 2019
Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019 Xiaohan Wang Yu Wu Linchao Zhu Yi Yang 14 19 0 22 Jun 2019
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection Huijuan Xu Abir Das Kate Saenko 3DPC 4 46 0 05 Jun 2019
Hallucinating Optical Flow Features for Video Classification Yongyi Tang Lin Ma Lianqiang Zhou 11 19 0 28 May 2019
A Compressive Sensing Video dataset using Pixel-wise coded exposure Sathyaprakash Narayanan Y. Bethi Chetan Singh Thakur 11 4 0 24 May 2019
Decentralized Learning of Generative Adversarial Networks from Non-iid Data Ryo Yonetani Tomohiro Takahashi Atsushi Hashimoto Yoshitaka Ushiku 37 24 0 23 May 2019
Large Scale Holistic Video Understanding Ali Diba Mohsen Fayyaz Vivek Sharma Manohar Paluri Jurgen Gall Rainer Stiefelhagen Luc Van Gool 24 35 0 25 Apr 2019
Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN Ya-Liang Chang Zhe-Yu Liu Kuan-Ying Lee Winston H. Hsu DiffM 10 172 0 23 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation Xiaohang Zhan Xingang Pan Ziwei Liu Dahua Lin Chen Change Loy SSL 34 47 0 27 Mar 2019
Less is More: Learning Highlight Detection from Video Duration Bo Xiong Yannis Kalantidis Deepti Ghadiyaram Kristen Grauman 6 108 0 03 Mar 2019
Efficient Video Classification Using Fewer Frames S. Bhardwaj Mukundhan Srinivasan Mitesh M. Khapra 33 88 0 27 Feb 2019
Single-frame Regularization for Temporally Stable CNNs Gabriel Eilertsen Rafał K. Mantiuk Jonas Unger 11 43 0 27 Feb 2019
Understanding and Training Deep Diagonal Circulant Neural Networks Alexandre Araujo Benjamin Négrevergne Y. Chevaleyre Jamal Atif 19 4 0 29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video Rohit Girdhar Du Tran Lorenzo Torresani Deva Ramanan 19 54 0 26 Jan 2019
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset Arpan Gupta S. Muthiah 19 6 0 10 Jan 2019
Video Action Transformer Network Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman ViT 28 702 0 06 Dec 2018
Non-local NetVLAD Encoding for Video Classification Yongyi Tang Xing Zhang Jingwen Wang Shaoxiang Chen Lin Ma Yu-Gang Jiang 11 41 0 29 Sep 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification Jinlai Liu Zehuan Yuan Changhu Wang 16 9 0 16 Sep 2018
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark N. Xu L. Yang Yuchen Fan Dingcheng Yue Yuchen Liang Jianchao Yang Thomas Huang VOS 11 521 0 06 Sep 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild Yu Luo Jianbo Ye Reginald B. Adams Jia Li M. Newman J. Z. Wang 48 86 0 28 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification Yang Du Chunfen Yuan Bing Li Lili Zhao Yangxi Li Weiming Hu 67 79 0 03 Aug 2018
Competitive Analysis System for Theatrical Movie Releases Based on Movie Trailer Deep Video Representation Miguel Campo C. Hsieh Matt Nickens J. J. Espinoza Abhinav Taliyan J. Rieger Jean Ho Bettina Sherick HAI 13 8 0 12 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification Ali Diba Mohsen Fayyaz Vivek Sharma M. M. Arzani Rahman Yousefzadeh Juergen Gall Luc Van Gool 3DPC 18 181 0 19 Jun 2018
Mining for meaning: from vision to language through multiple networks consensus Iulia Duta Andrei Liviu Nicolicioiu Simion-Vlad Bogolin Marius Leordeanu 13 3 0 05 Jun 2018
Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning Tribhuvanesh Orekondy Seong Joon Oh Yang Zhang Bernt Schiele Mario Fritz PICV FedML 334 37 0 15 May 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning F. I. F. Richard Yu Haofeng Chen Xin Wang Wenqi Xian Yingying Chen Fangchen Liu Vashisht Madhavan Trevor Darrell VLM 40 2,090 0 12 May 2018
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames S. Bhardwaj Mitesh M. Khapra 18 3 0 12 May 2018
Weakly-supervised Visual Instrument-playing Action Detection in Videos Jen-Yu Liu Yi-Hsuan Yang Shyh-Kang Jeng 21 13 0 05 May 2018
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos Silvio Giancola Mohieddine Amine Tarek Dghaily Bernard Ghanem AI4TS 19 193 0 12 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset Dima Damen Hazel Doughty G. Farinella Sanja Fidler Antonino Furnari ... Davide Moltisanti Jonathan Munro Toby Perrett Will Price Michael Wray EgoV 25 995 0 08 Apr 2018
FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces Andreas Rossler D. Cozzolino L. Verdoliva Christian Riess Justus Thies Matthias Nießner PICV AAML CVBM 8 375 0 24 Mar 2018
Towards Universal Representation for Unseen Action Recognition Yi Zhu Yang Long Yu Guan Shawn D. Newsam Ling Shao AI4TS 17 100 0 22 Mar 2018
Recurrent Residual Module for Fast Inference in Videos Bowen Pan Wuwei Lin Xiaolin Fang Chaoqin Huang Bolei Zhou Cewu Lu ObjD 20 33 0 27 Feb 2018
Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images Yi Zhu XueQing Deng Shawn D. Newsam 26 51 0 07 Feb 2018
DeepType: Multilingual Entity Linking by Neural Type System Evolution Jonathan Raiman O. Raiman BDL HAI 117 183 0 03 Feb 2018
Moments in Time Dataset: one million videos for event understanding Mathew Monfort A. Andonian Bolei Zhou K. Ramakrishnan Sarah Adel Bargal ... L. Brown Quanfu Fan Dan Gutfreund Carl Vondrick A. Oliva 45 538 0 09 Jan 2018
Cross-modal Embeddings for Video and Audio Retrieval Dídac Surís A. Duarte Amaia Salvador Jordi Torres Xavier Giró-i-Nieto SSL 16 69 0 07 Jan 2018