Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.11248
Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition
30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Closer Look at Spatiotemporal Convolutions for Action Recognition"
50 / 1,270 papers shown
Title
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing
Florian Dubost
Erin Hong
Nandita Bhaskhar
Siyi Tang
D. Rubin
Christopher Lee-Messer
NoLa
16
0
0
28 Nov 2020
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
24
0
0
25 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition
Sijie Zhu
Taojiannan Yang
Matías Mendieta
Chong Chen
3DH
32
12
0
24 Nov 2020
Play Fair: Frame Attributions in Video Models
Will Price
Dima Damen
FAtt
31
5
0
24 Nov 2020
KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition
Racha Friji
Hassen Drira
F. Chaieb
S. Kurtek
Hamza Kchok
3DPC
22
2
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
33
123
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TS
SSL
28
23
0
23 Nov 2020
The complementarity of a diverse range of deep learning features extracted from video content for video recommendation
A. Almeida
J. D. Villiers
A. Freitas
Mergandran Velayudan
19
16
0
21 Nov 2020
DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets
Jianpeng Zhang
Yutong Xie
Yong-quan Xia
Chunhua Shen
22
155
0
20 Nov 2020
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Alptekin Orbay
SLR
12
0
0
18 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
24
30
0
17 Nov 2020
Audio-Visual Event Recognition through the lens of Adversary
Juncheng Li
Kaixin Ma
Shuhui Qu
Po-Yao (Bernie) Huang
Florian Metze
AAML
8
9
0
15 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu
Yi Yang
ViT
49
417
0
14 Nov 2020
Adding Knowledge to Unsupervised Algorithms for the Recognition of Intent
Stuart Synakowski
Qianli Feng
Aleix M. Martinez
OCL
14
6
0
12 Nov 2020
Ontology-driven Event Type Classification in Images
Eric Müller-Budack
Matthias Springstein
Sherzod Hakimov
Kevin Mrutzek
Ralph Ewerth
19
9
0
09 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
29
1
0
08 Nov 2020
Predictive Process Model Monitoring using Recurrent Neural Networks
Johannes De Smedt
Jochen De Weerdt
25
0
0
05 Nov 2020
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
27
9
0
04 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment
Pedro Morgado
Yi Li
Nuno Vasconcelos
SSL
27
121
0
03 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
17
7
0
02 Nov 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning
L. Tao
Xueting Wang
T. Yamasaki
VLM
SSL
23
14
0
29 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
32
15
0
29 Oct 2020
Spatio-temporal Features for Generalized Detection of Deepfake Videos
Ipek Ganiyusufoglu
L. Ngô
N. Savov
Sezer Karaoglu
Theo Gevers
32
41
0
22 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
95
0
22 Oct 2020
Extraction of Discrete Spectra Modes from Video Data Using a Deep Convolutional Koopman Network
S. Leask
V. McDonell
11
1
0
19 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
BDL
24
22
0
18 Oct 2020
VolumeNet: A Lightweight Parallel Network for Super-Resolution of Medical Volumetric Data
Yinhao Li
Yutaro Iwamoto
Lanfen Lin
R. Xu
Yenwei Chen
SupR
29
38
0
16 Oct 2020
Pose And Joint-Aware Action Recognition
Anshul B. Shah
Shlok Kumar Mishra
Ankan Bansal
Jun-Cheng Chen
Ramalingam Chellappa
Abhinav Shrivastava
44
33
0
16 Oct 2020
Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning
Xinyu Yang
Majid Mirmehdi
T. Burghardt
27
4
0
14 Oct 2020
Video Action Understanding
Matthew Hutchinson
V. Gadepally
43
20
0
13 Oct 2020
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain
Francesco Ragusa
Antonino Furnari
S. Livatino
G. Farinella
EgoV
24
99
0
12 Oct 2020
Reconfigurable Cyber-Physical System for Lifestyle Video-Monitoring via Deep Learning
Daniel Deniz
Francisco Barranco
J. Isern
Eduardo Ros
9
7
0
07 Oct 2020
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
22
244
0
06 Oct 2020
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing
Okan Kopuklu
Stefan Hormann
Fabian Herzog
Hakan Çevikalp
Gerhard Rigoll
3DPC
23
15
0
30 Sep 2020
Score-level Multi Cue Fusion for Sign Language Recognition
Çagri Gökçe
Ogulcan Özdemir
A. Kındıroglu
L. Akarun
SLR
19
23
0
29 Sep 2020
PERF-Net: Pose Empowered RGB-Flow Net
Yinxiao Li
Zhichao Lu
Xuehan Xiong
Jonathan Huang
3DH
40
17
0
28 Sep 2020
Online Learnable Keyframe Extraction in Videos and its Application with Semantic Word Vector in Action Recognition
G. Elahi
Herbert Yang
25
25
0
25 Sep 2020
On the spatiotemporal behavior in biology-mimicking computing systems
J. Végh
Ádám-József Berki
22
6
0
18 Sep 2020
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks
Iulia Duta
Andrei Liviu Nicolicioiu
Marius Leordeanu
26
6
0
17 Sep 2020
Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations
Yanyi Zhang
Xinyu Li
I. Marsic
HAI
28
23
0
16 Sep 2020
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
Jinpeng Wang
Yuting Gao
Ke Li
Yiqi Lin
A. J. Ma
Hao Cheng
Pai Peng
Feiyue Huang
Rongrong Ji
Xing Sun
SSL
54
96
0
12 Sep 2020
Online Spatiotemporal Action Detection and Prediction via Causal Representations
Gurkirt Singh
3DPC
CML
24
0
0
31 Aug 2020
Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Wei Liu
Yunhui Liu
SSL
AI4TS
21
55
0
31 Aug 2020
All About Knowledge Graphs for Actions
P. Ghosh
Nirat Saini
L. Davis
Abhinav Shrivastava
24
31
0
28 Aug 2020
DMD: A Large-Scale Multi-Modal Driver Monitoring Dataset for Attention and Alertness Analysis
J. Ortega
Neslihan Köse
P. Cañas
Min-An Chao
A. Unnervik
Marcos Nieto
Oihana Otaegui
L. Salgado
27
91
0
27 Aug 2020
Self-Supervised Human Activity Recognition by Augmenting Generative Adversarial Networks
Mohammad Zaki Zadeh
Ashwin Ramesh Babu
Ashish Jaiswal
F. Makedon
14
16
0
26 Aug 2020
Making a Case for 3D Convolutions for Object Segmentation in Videos
Sabarinath Mahadevan
A. Athar
Aljosa Osep
Sebastian Hennen
Laura Leal-Taixé
Bastian Leibe
VOS
21
87
0
26 Aug 2020
Effective Action Recognition with Embedded Key Point Shifts
Haozhi Cao
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
15
7
0
26 Aug 2020
Discriminability Distillation in Group Representation Learning
Manyuan Zhang
Guanglu Song
Hang Zhou
Yu Liu
FedML
17
18
0
25 Aug 2020
Quantitative Survey of the State of the Art in Sign Language Recognition
Oscar Koller
SLR
27
94
0
22 Aug 2020
Previous
1
2
3
...
19
20
21
...
24
25
26
Next