ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.11248
  4. Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition

A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
ArXivPDFHTML

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown
Title
Semi-Supervised Learning for Sparsely-Labeled Sequential Data:
  Application to Healthcare Video Processing
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing
Florian Dubost
Erin Hong
Nandita Bhaskhar
Siyi Tang
D. Rubin
Christopher Lee-Messer
NoLa
16
0
0
28 Nov 2020
Recent Progress in Appearance-based Action Recognition
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
24
0
0
25 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition
A3D: Adaptive 3D Networks for Video Action Recognition
Sijie Zhu
Taojiannan Yang
Matías Mendieta
Chong Chen
3DH
32
12
0
24 Nov 2020
Play Fair: Frame Attributions in Video Models
Play Fair: Frame Attributions in Video Models
Will Price
Dima Damen
FAtt
31
5
0
24 Nov 2020
KShapeNet: Riemannian network on Kendall shape space for Skeleton based
  Action Recognition
KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition
Racha Friji
Hassen Drira
F. Chaieb
S. Kurtek
Hamza Kchok
3DPC
22
2
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
33
123
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised
  Video Representation Learning
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TS
SSL
28
23
0
23 Nov 2020
The complementarity of a diverse range of deep learning features
  extracted from video content for video recommendation
The complementarity of a diverse range of deep learning features extracted from video content for video recommendation
A. Almeida
J. D. Villiers
A. Freitas
Mergandran Velayudan
19
16
0
21 Nov 2020
DoDNet: Learning to segment multi-organ and tumors from multiple
  partially labeled datasets
DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets
Jianpeng Zhang
Yutong Xie
Yong-quan Xia
Chunhua Shen
22
155
0
20 Nov 2020
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Alptekin Orbay
SLR
12
0
0
18 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
24
30
0
17 Nov 2020
Audio-Visual Event Recognition through the lens of Adversary
Audio-Visual Event Recognition through the lens of Adversary
Juncheng Li
Kaixin Ma
Shuhui Qu
Po-Yao (Bernie) Huang
Florian Metze
AAML
8
9
0
15 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu
Yi Yang
ViT
49
417
0
14 Nov 2020
Adding Knowledge to Unsupervised Algorithms for the Recognition of
  Intent
Adding Knowledge to Unsupervised Algorithms for the Recognition of Intent
Stuart Synakowski
Qianli Feng
Aleix M. Martinez
OCL
14
6
0
12 Nov 2020
Ontology-driven Event Type Classification in Images
Ontology-driven Event Type Classification in Images
Eric Müller-Budack
Matthias Springstein
Sherzod Hakimov
Kevin Mrutzek
Ralph Ewerth
19
9
0
09 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
29
1
0
08 Nov 2020
Predictive Process Model Monitoring using Recurrent Neural Networks
Predictive Process Model Monitoring using Recurrent Neural Networks
Johannes De Smedt
Jochen De Weerdt
25
0
0
05 Nov 2020
Mutual Modality Learning for Video Action Classification
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
27
9
0
04 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment
Learning Representations from Audio-Visual Spatial Alignment
Pedro Morgado
Yi Li
Nuno Vasconcelos
SSL
27
121
0
03 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
17
7
0
02 Nov 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised
  Video Representation Leaning
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning
L. Tao
Xueting Wang
T. Yamasaki
VLM
SSL
23
14
0
29 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture
  Searching
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
32
15
0
29 Oct 2020
Spatio-temporal Features for Generalized Detection of Deepfake Videos
Spatio-temporal Features for Generalized Detection of Deepfake Videos
Ipek Ganiyusufoglu
L. Ngô
N. Savov
Sezer Karaoglu
Theo Gevers
32
41
0
22 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action
  Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
95
0
22 Oct 2020
Extraction of Discrete Spectra Modes from Video Data Using a Deep
  Convolutional Koopman Network
Extraction of Discrete Spectra Modes from Video Data Using a Deep Convolutional Koopman Network
S. Leask
V. McDonell
11
1
0
19 Oct 2020
Hierarchical Conditional Relation Networks for Multimodal Video Question
  Answering
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
T. Le
Vuong Le
Svetha Venkatesh
T. Tran
BDL
24
22
0
18 Oct 2020
VolumeNet: A Lightweight Parallel Network for Super-Resolution of
  Medical Volumetric Data
VolumeNet: A Lightweight Parallel Network for Super-Resolution of Medical Volumetric Data
Yinhao Li
Yutaro Iwamoto
Lanfen Lin
R. Xu
Yenwei Chen
SupR
29
38
0
16 Oct 2020
Pose And Joint-Aware Action Recognition
Pose And Joint-Aware Action Recognition
Anshul B. Shah
Shlok Kumar Mishra
Ankan Bansal
Jun-Cheng Chen
Ramalingam Chellappa
Abhinav Shrivastava
44
33
0
16 Oct 2020
Back to the Future: Cycle Encoding Prediction for Self-supervised
  Contrastive Video Representation Learning
Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning
Xinyu Yang
Majid Mirmehdi
T. Burghardt
27
4
0
14 Oct 2020
Video Action Understanding
Video Action Understanding
Matthew Hutchinson
V. Gadepally
43
20
0
13 Oct 2020
The MECCANO Dataset: Understanding Human-Object Interactions from
  Egocentric Videos in an Industrial-like Domain
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain
Francesco Ragusa
Antonino Furnari
S. Livatino
G. Farinella
EgoV
24
99
0
12 Oct 2020
Reconfigurable Cyber-Physical System for Lifestyle Video-Monitoring via
  Deep Learning
Reconfigurable Cyber-Physical System for Lifestyle Video-Monitoring via Deep Learning
Daniel Deniz
Francisco Barranco
J. Isern
Eduardo Ros
9
7
0
07 Oct 2020
Support-set bottlenecks for video-text representation learning
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
22
244
0
06 Oct 2020
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video
  Processing
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing
Okan Kopuklu
Stefan Hormann
Fabian Herzog
Hakan Çevikalp
Gerhard Rigoll
3DPC
23
15
0
30 Sep 2020
Score-level Multi Cue Fusion for Sign Language Recognition
Score-level Multi Cue Fusion for Sign Language Recognition
Çagri Gökçe
Ogulcan Özdemir
A. Kındıroglu
L. Akarun
SLR
19
23
0
29 Sep 2020
PERF-Net: Pose Empowered RGB-Flow Net
PERF-Net: Pose Empowered RGB-Flow Net
Yinxiao Li
Zhichao Lu
Xuehan Xiong
Jonathan Huang
3DH
40
17
0
28 Sep 2020
Online Learnable Keyframe Extraction in Videos and its Application with
  Semantic Word Vector in Action Recognition
Online Learnable Keyframe Extraction in Videos and its Application with Semantic Word Vector in Action Recognition
G. Elahi
Herbert Yang
25
25
0
25 Sep 2020
On the spatiotemporal behavior in biology-mimicking computing systems
On the spatiotemporal behavior in biology-mimicking computing systems
J. Végh
Ádám-József Berki
22
6
0
18 Sep 2020
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural
  Networks
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks
Iulia Duta
Andrei Liviu Nicolicioiu
Marius Leordeanu
26
6
0
17 Sep 2020
Multi-Label Activity Recognition using Activity-specific Features and
  Activity Correlations
Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations
Yanyi Zhang
Xinyu Li
I. Marsic
HAI
28
23
0
16 Sep 2020
Removing the Background by Adding the Background: Towards Background
  Robust Self-supervised Video Representation Learning
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
Jinpeng Wang
Yuting Gao
Ke Li
Yiqi Lin
A. J. Ma
Hao Cheng
Pai Peng
Feiyue Huang
Rongrong Ji
Xing Sun
SSL
54
96
0
12 Sep 2020
Online Spatiotemporal Action Detection and Prediction via Causal
  Representations
Online Spatiotemporal Action Detection and Prediction via Causal Representations
Gurkirt Singh
3DPC
CML
24
0
0
31 Aug 2020
Self-supervised Video Representation Learning by Uncovering
  Spatio-temporal Statistics
Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Wei Liu
Yunhui Liu
SSL
AI4TS
21
55
0
31 Aug 2020
All About Knowledge Graphs for Actions
All About Knowledge Graphs for Actions
P. Ghosh
Nirat Saini
L. Davis
Abhinav Shrivastava
24
31
0
28 Aug 2020
DMD: A Large-Scale Multi-Modal Driver Monitoring Dataset for Attention
  and Alertness Analysis
DMD: A Large-Scale Multi-Modal Driver Monitoring Dataset for Attention and Alertness Analysis
J. Ortega
Neslihan Köse
P. Cañas
Min-An Chao
A. Unnervik
Marcos Nieto
Oihana Otaegui
L. Salgado
27
91
0
27 Aug 2020
Self-Supervised Human Activity Recognition by Augmenting Generative
  Adversarial Networks
Self-Supervised Human Activity Recognition by Augmenting Generative Adversarial Networks
Mohammad Zaki Zadeh
Ashwin Ramesh Babu
Ashish Jaiswal
F. Makedon
14
16
0
26 Aug 2020
Making a Case for 3D Convolutions for Object Segmentation in Videos
Making a Case for 3D Convolutions for Object Segmentation in Videos
Sabarinath Mahadevan
A. Athar
Aljosa Osep
Sebastian Hennen
Laura Leal-Taixé
Bastian Leibe
VOS
21
87
0
26 Aug 2020
Effective Action Recognition with Embedded Key Point Shifts
Effective Action Recognition with Embedded Key Point Shifts
Haozhi Cao
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
15
7
0
26 Aug 2020
Discriminability Distillation in Group Representation Learning
Discriminability Distillation in Group Representation Learning
Manyuan Zhang
Guanglu Song
Hang Zhou
Yu Liu
FedML
17
18
0
25 Aug 2020
Quantitative Survey of the State of the Art in Sign Language Recognition
Quantitative Survey of the State of the Art in Sign Language Recognition
Oscar Koller
SLR
27
94
0
22 Aug 2020
Previous
123...192021...242526
Next