v1v2 (latest)

Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification

13 December 2017

Papers citing "Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"

50 / 675 papers shown

Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition

Kazuki Omi

Jun Kimata

Toru Tamaki

235

15 Apr 2022

Learning Pixel-Level Distinctions for Video Highlight DetectionComputer Vision and Pattern Recognition (CVPR), 2022

150

10 Apr 2022

Self-Supervised Video Representation Learning with Motion-Contrastive PerceptionIEEE International Conference on Multimedia and Expo (ICME), 2022

Rui Feng

198

10 Apr 2022

Probabilistic Representations for Video Contrastive LearningComputer Vision and Pattern Recognition (CVPR), 2022

312

08 Apr 2022

Frequency Selective Augmentation for Video Representation LearningAAAI Conference on Artificial Intelligence (AAAI), 2022

208

08 Apr 2022

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level RepresentationsIEEE Access (IEEE Access), 2022

Shaobo Min

Hongfa Wang

Wei Liu

335

07 Apr 2022

Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical ConsistencyComputer Vision and Pattern Recognition (CVPR), 2022

242

06 Apr 2022

An Empirical Study of End-to-End Temporal Action DetectionComputer Vision and Pattern Recognition (CVPR), 2022

Xiaolong Liu

S. Bai

Xiang Bai

218

06 Apr 2022

Exploiting Temporal Relations on Radar Perception for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2022

273

03 Apr 2022

Deformable Video TransformerComputer Vision and Pattern Recognition (CVPR), 2022

Jue Wang

Lorenzo Torresani

ViT

198

31 Mar 2022

Video-Text Representation Learning via Differentiable Weak Temporal AlignmentComputer Vision and Pattern Recognition (CVPR), 2022

168

31 Mar 2022

Controllable Augmentations for Video Representation Learning

216

30 Mar 2022

Interpretable Prediction of Pulmonary Hypertension in Newborns using EchocardiogramsGerman Conference on Pattern Recognition (GCPR), 2022

183

24 Mar 2022

Facial Expression Analysis Using Decomposed Multiscale Spatiotemporal NetworksExpert systems with applications (ESWA), 2022

W. Melo

Mohammadhadi Shateri

Miguel Bordallo López

CVBM

167

21 Mar 2022

DirecFormer: A Directed Attention in Transformer Approach to Robust Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

212

19 Mar 2022

Group Contextualization for Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

Y. Hao

Haotong Zhang

Chong-Wah Ngo

Xiangnan He

145

18 Mar 2022

Gate-Shift-Fuse for Video Action RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Swathikiran Sudhakaran

Sergio Escalera

Oswald Lanz

263

16 Mar 2022

Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding

443

11 Mar 2022

A Simple Multi-Modality Transfer Learning Baseline for Sign Language TranslationComputer Vision and Pattern Recognition (CVPR), 2022

222

134

08 Mar 2022

End-to-End Semi-Supervised Learning for Video Action DetectionComputer Vision and Pattern Recognition (CVPR), 2022

Akash Kumar

Yogesh S Rawat

236

08 Mar 2022

Behavior Recognition Based on the Integration of Multigranular Motion Features

07 Mar 2022

Motion-driven Visual Tempo Learning for Video-based Action RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022

Yuanzhong Liu

Junsong Yuan

Zhigang Tu

211

24 Feb 2022

VLP: A Survey on Vision-Language Pre-trainingMachine Intelligence Research (MIR), 2022

Minglun Han

393

287

18 Feb 2022

Shift-Memory Network for Temporal Scene Segmentation

Guo Cheng

J. Zheng

256

17 Feb 2022

Should I take a walk? Estimating Energy Expenditure from Video Data

Kailun Yang

176

01 Feb 2022

vCLIMB: A Novel Video Class Incremental Learning BenchmarkComputer Vision and Pattern Recognition (CVPR), 2022

Andrés Villa

Kumail Alhamoud

Juan Carlos León Alcázar

416

23 Jan 2022

Self-supervised Video Representation Learning with Cascade Positive Retrieval

334

20 Jan 2022

Action Keypoint Network for Efficient Video RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022

Yi Yang

245

17 Jan 2022

Multiview Transformers for Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2022

433

269

12 Jan 2022

Motion-Focused Contrastive Learning of Video RepresentationsIEEE International Conference on Computer Vision (ICCV), 2021

Yiheng Zhang

Tao Mei

193

11 Jan 2022

Representing Videos as Discriminative Sub-graphs for Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2021

Yingwei Pan

Tao Mei

223

11 Jan 2022

Boosting Video Representation Learning with Multi-Faceted IntegrationComputer Vision and Pattern Recognition (CVPR), 2021

Chong-Wah Ngo

Tao Mei

176

11 Jan 2022

Condensing a Sequence to One Informative Frame for Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021

Zhaofan Qiu

Ting Yao

Y. Shu

Chong-Wah Ngo

Tao Mei

227

11 Jan 2022

Optimization Planning for 3D ConvNetsInternational Conference on Machine Learning (ICML), 2022

Zhaofan Qiu

Ting Yao

Chong-Wah Ngo

Tao Mei

3DPC 3DH

208

11 Jan 2022

Discrete and continuous representations and processing in deep learning: Looking forwardAI Open (AO), 2022

300

04 Jan 2022

Fine-grained Multi-Modal Self-Supervised LearningBritish Machine Vision Conference (BMVC), 2021

Duo Wang

S. Karout

SSL

116

22 Dec 2021

Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action RecognitionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

209

22 Dec 2021

Max-Margin Contrastive LearningAAAI Conference on Artificial Intelligence (AAAI), 2021

149

21 Dec 2021

Cross-Model Pseudo-Labeling for Semi-Supervised Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2021

177

17 Dec 2021

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

249

16 Dec 2021

Temporal Transformer Networks with Self-Supervision for Action Recognition

Jun Li

243

14 Dec 2021

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

155

09 Dec 2021

DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition

Roger Zimmermann

181

09 Dec 2021

Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning

K. Navaneet

Soroush Abbasi Koohpayegani

Ajinkya Tejankar

Kossar Pourahmadi

Akshayvarun Subramanya

Hamed Pirsiavash

SSL

233

08 Dec 2021

MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification

295

08 Dec 2021

Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

306

154

08 Dec 2021

Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning

Srijan Das

Michael S. Ryoo

SSL

281

07 Dec 2021

ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints

Srijan Das

Michael S. Ryoo

SSL

211

07 Dec 2021

Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning

Manlin Zhang

Jinpeng Wang

A. J. Ma

155

07 Dec 2021

Time-Equivariant Contrastive Video Representation Learning

Simon Jenni

Hailin Jin

SSL AI4TS

332

07 Dec 2021