v1v2 (latest)

Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification

13 December 2017

Papers citing "Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"

50 / 675 papers shown

Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video ClassificationIEEE Access (IEEE Access), 2020

257

01 Dec 2020

Recent Progress in Appearance-based Action Recognition

J. Humphreys

Zhe Chen

Dacheng Tao

170

25 Nov 2020

A3D: Adaptive 3D Networks for Video Action Recognition

176

24 Nov 2020

Play Fair: Frame Attributions in Video ModelsAsian Conference on Computer Vision (ACCV), 2020

Will Price

Dima Damen

FAtt

119

24 Nov 2020

QuerYD: A video dataset with high-quality text and audio narrations

Andreea-Maria Oncescu

João F. Henriques

Yang Liu

Andrew Zisserman

Samuel Albanie

VGen

172

22 Nov 2020

$We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action Detection in Videos$

We don't Need Thousand Proposals

\colon

Single Shot Actor-Action Detection in VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

A. J. Rana

Yogesh S Rawat

ViT

137

22 Nov 2020

3D CNNs with Adaptive Temporal Feature Resolutions

Luc Van Gool

Juergen Gall

3DPC

222

17 Nov 2020

ActBERT: Learning Global-Local Video-Text RepresentationsComputer Vision and Pattern Recognition (CVPR), 2020

Linchao Zhu

Yi Yang

ViT

324

451

14 Nov 2020

Multimodal Pretraining for Dense Video Captioning

180

101

10 Nov 2020

Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition

171

10 Nov 2020

Mutual Modality Learning for Video Action Classification

Stepan Alekseevich Komkov

Maksim Dzabraev

Aleksandr Petiushko

158

04 Nov 2020

PV-NAS: Practical Neural Architecture Search for Video Recognition

304

02 Nov 2020

Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning

249

29 Oct 2020

Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition

291

116

22 Oct 2020

Pose And Joint-Aware Action Recognition

328

16 Oct 2020

Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning

Xinyu Yang

Majid Mirmehdi

T. Burghardt

391

14 Oct 2020

Boosting Continuous Sign Language Recognition via Cross Modality AugmentationACM Multimedia (ACM MM), 2020

182

126

11 Oct 2020

Contrastive Representation Learning: A Framework and ReviewIEEE Access (IEEE Access), 2020

588

848

10 Oct 2020

Support-set bottlenecks for video-text representation learning

Mandela Patrick

Po-Yao (Bernie) Huang

Yuki M. Asano

Florian Metze

Alexander G. Hauptmann

João Henriques

Andrea Vedaldi

342

260

06 Oct 2020

Hierarchical Domain-Adapted Feature Learning for Video Saliency PredictionInternational Journal of Computer Vision (IJCV), 2020

Giovanni Bellitto

Federica Proietto Salanitri

363

02 Oct 2020

PERF-Net: Pose Empowered RGB-Flow NetIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020

272

28 Sep 2020

On the spatiotemporal behavior in biology-mimicking computing systems

J. Végh

Ádám-József Berki

134

18 Sep 2020

Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural NetworksNeural Information Processing Systems (NeurIPS), 2020

Iulia Duta

Andrei Liviu Nicolicioiu

Marius Leordeanu

326

17 Sep 2020

Multi-Label Activity Recognition using Activity-specific Features and Activity CorrelationsComputer Vision and Pattern Recognition (CVPR), 2020

157

16 Sep 2020

Online Spatiotemporal Action Detection and Prediction via Causal Representations

Gurkirt Singh

3DPC CML

181

31 Aug 2020

Self-supervised Video Representation Learning by Uncovering Spatio-temporal StatisticsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Wei Liu

199

31 Aug 2020

DMD: A Large-Scale Multi-Modal Driver Monitoring Dataset for Attention and Alertness Analysis

188

120

27 Aug 2020

Making a Case for 3D Convolutions for Object Segmentation in VideosBritish Machine Vision Conference (BMVC), 2020

Laura Leal-Taixé

322

26 Aug 2020

Effective Action Recognition with Embedded Key Point ShiftsPattern Recognition (Pattern Recognit.), 2020

Yuecong Xu

147

26 Aug 2020

Global-local Enhancement Network for NMFs-aware Sign Language Recognition

242

24 Aug 2020

AssembleNet++: Assembling Modality Representations via Attention Connections

169

18 Aug 2020

Self-supervised Video Representation Learning by Pace Prediction

248

251

13 Aug 2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning

Rui Feng

269

117

13 Aug 2020

TransNet V2: An effective deep network architecture for fast shot transition detectionACM Multimedia (ACM MM), 2020

Tomás Soucek

Jakub Lokoč

301

181

11 Aug 2020

Spatiotemporal Contrastive Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2020

Ming-Hsuan Yang

409

543

09 Aug 2020

PAN: Towards Fast Action Recognition via Learning Persistence of Appearance

156

08 Aug 2020

Exploring Relations in Untrimmed Videos for Self-Supervised Learning

223

06 Aug 2020

Self-supervised Video Representation Learning Using Inter-intra Contrastive FrameworkACM Multimedia (ACM MM), 2020

336

114

06 Aug 2020

Late Temporal Modeling in 3D CNN Architectures with BERT for Action Recognition

198

154

03 Aug 2020

Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition

Jiawei Chen

Jenson Hsiao

C. Ho

200

03 Aug 2020

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)

Yang Liu

...

168

03 Aug 2020

Learning Video Representations from Textual Web Supervision

244

29 Jul 2020

Approximated Bilinear Modules for Temporal ModelingIEEE International Conference on Computer Vision (ICCV), 2019

124

25 Jul 2020

AttentionNAS: Spatiotemporal Attention Cell Search for Video ClassificationEuropean Conference on Computer Vision (ECCV), 2020

294

23 Jul 2020

Perceptron Synthesis Network: Rethinking the Action Scale Variances in Videos

Yuan Tian

Guangtao Zhai

Zhiyong Gao

157

22 Jul 2020

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition

334

22 Jul 2020

Directional Temporal Modeling for Action Recognition

Xinyu Li

Bing Shuai

Joseph Tighe

123

21 Jul 2020

Multi-modal Transformer for Video Retrieval

1.1K

675

21 Jul 2020

Hierarchical Contrastive Motion Learning for Video Action Recognition

290

20 Jul 2020

MotionSqueeze: Neural Motion Feature Learning for Video Understanding

165

143

20 Jul 2020