Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1712.04851
Cited By
v1
v2 (latest)
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
13 December 2017
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"
50 / 675 papers shown
Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition
Kazuki Omi
Jun Kimata
Toru Tamaki
235
11
0
15 Apr 2022
Learning Pixel-Level Distinctions for Video Highlight Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Fanyue Wei
Biao Wang
Bo Xiao
Yuning Jiang
Wen Li
Lixin Duan
150
25
0
10 Apr 2022
Self-Supervised Video Representation Learning with Motion-Contrastive Perception
IEEE International Conference on Multimedia and Expo (ICME), 2022
Jin-Yuan Liu
Ying Cheng
Yuejie Zhang
Ruiwei Zhao
Rui Feng
SSL
198
1
0
10 Apr 2022
Probabilistic Representations for Video Contrastive Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Jungin Park
Jiyoung Lee
Ig-Jae Kim
Kwanghoon Sohn
SSL
312
53
0
08 Apr 2022
Frequency Selective Augmentation for Video Representation Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Jinhyung Kim
Taeoh Kim
Minho Shim
Dongyoon Han
Dongyoon Wee
Junmo Kim
AI4TS
208
5
0
08 Apr 2022
Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations
IEEE Access (IEEE Access), 2022
Jie Jiang
Shaobo Min
Weijie Kong
Dihong Gong
Hongfa Wang
Zhifeng Li
Wei Liu
VLM
335
30
0
07 Apr 2022
Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
Computer Vision and Pattern Recognition (CVPR), 2022
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yi Tian Xu
Xiang Wang
Mingqian Tang
Changxin Gao
Rong Jin
Nong Sang
SSL
AI4TS
242
18
0
06 Apr 2022
An Empirical Study of End-to-End Temporal Action Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Xiaolong Liu
S. Bai
Xiang Bai
218
68
0
06 Apr 2022
Exploiting Temporal Relations on Radar Perception for Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2022
Peizhao Li
Puzuo Wang
K. Berntorp
Hongfu Liu
273
50
0
03 Apr 2022
Deformable Video Transformer
Computer Vision and Pattern Recognition (CVPR), 2022
Jue Wang
Lorenzo Torresani
ViT
198
31
0
31 Mar 2022
Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Computer Vision and Pattern Recognition (CVPR), 2022
Dohwan Ko
Joonmyung Choi
Juyeon Ko
Shinyeong Noh
Kyoung-Woon On
Eun-Sol Kim
Hyunwoo J. Kim
VGen
AI4TS
168
27
0
31 Mar 2022
Controllable Augmentations for Video Representation Learning
Rui Qian
Weiyao Lin
John See
Dian Li
SSL
AI4TS
216
16
0
30 Mar 2022
Interpretable Prediction of Pulmonary Hypertension in Newborns using Echocardiograms
German Conference on Pattern Recognition (GCPR), 2022
H. Ragnarsdóttir
Laura Manduchi
H. Michel
F. Laumer
S. Wellmann
Ece Ozkan
Julia-Franziska Vogt
183
3
0
24 Mar 2022
Facial Expression Analysis Using Decomposed Multiscale Spatiotemporal Networks
Expert systems with applications (ESWA), 2022
W. Melo
Mohammadhadi Shateri
Miguel Bordallo López
CVBM
167
32
0
21 Mar 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
212
69
0
19 Mar 2022
Group Contextualization for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Y. Hao
Haotong Zhang
Chong-Wah Ngo
Xiangnan He
145
35
0
18 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
263
33
0
16 Mar 2022
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
Yidan Sun
Qin Chao
Yangfeng Ji
Boyang Albert Li
VGen
443
11
0
11 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Computer Vision and Pattern Recognition (CVPR), 2022
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
222
134
0
08 Mar 2022
End-to-End Semi-Supervised Learning for Video Action Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Akash Kumar
Yogesh S Rawat
236
37
0
08 Mar 2022
Behavior Recognition Based on the Integration of Multigranular Motion Features
Lizong Zhang
Yiming Wang
Bei Hui
Xiu Zhang
Sijuan Liu
Shuxin Feng
91
0
0
07 Mar 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yuanzhong Liu
Junsong Yuan
Zhigang Tu
211
78
0
24 Feb 2022
VLP: A Survey on Vision-Language Pre-training
Machine Intelligence Research (MIR), 2022
Feilong Chen
Duzhen Zhang
Minglun Han
Xiuyi Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
393
287
0
18 Feb 2022
Shift-Memory Network for Temporal Scene Segmentation
Guo Cheng
J. Zheng
256
0
0
17 Feb 2022
Should I take a walk? Estimating Energy Expenditure from Video Data
Kunyu Peng
Alina Roitberg
Kailun Yang
Kailai Li
Rainer Stiefelhagen
176
6
0
01 Feb 2022
vCLIMB: A Novel Video Class Incremental Learning Benchmark
Computer Vision and Pattern Recognition (CVPR), 2022
Andrés Villa
Kumail Alhamoud
Juan Carlos León Alcázar
Fabian Caba Heilbron
Victor Escorcia
Guohao Li
CLL
416
43
0
23 Jan 2022
Self-supervised Video Representation Learning with Cascade Positive Retrieval
Cheng-En Wu
Farley Lai
Yujie Hu
Asim Kadav
SSL
AI4TS
334
5
0
20 Jan 2022
Action Keypoint Network for Efficient Video Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
245
8
0
17 Jan 2022
Multiview Transformers for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Shen Yan
Xuehan Xiong
Anurag Arnab
Zhichao Lu
Mi Zhang
Chen Sun
Cordelia Schmid
ViT
433
269
0
12 Jan 2022
Motion-Focused Contrastive Learning of Video Representations
IEEE International Conference on Computer Vision (ICCV), 2021
Rui Li
Yiheng Zhang
Zhaofan Qiu
Ting Yao
Dong Liu
Tao Mei
SSL
193
37
0
11 Jan 2022
Representing Videos as Discriminative Sub-graphs for Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Dong Li
Zhaofan Qiu
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
223
33
0
11 Jan 2022
Boosting Video Representation Learning with Multi-Faceted Integration
Computer Vision and Pattern Recognition (CVPR), 2021
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xiaoping Zhang
Dong Wu
Tao Mei
176
9
0
11 Jan 2022
Condensing a Sequence to One Informative Frame for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Zhaofan Qiu
Ting Yao
Y. Shu
Chong-Wah Ngo
Tao Mei
227
11
0
11 Jan 2022
Optimization Planning for 3D ConvNets
International Conference on Machine Learning (ICML), 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
3DPC
3DH
208
9
0
11 Jan 2022
Discrete and continuous representations and processing in deep learning: Looking forward
AI Open (AO), 2022
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
300
28
0
04 Jan 2022
Fine-grained Multi-Modal Self-Supervised Learning
British Machine Vision Conference (BMVC), 2021
Duo Wang
S. Karout
SSL
116
7
0
22 Dec 2021
Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Sofia Broomé
Ernest Pokropek
Boyu Li
Hedvig Kjellström
209
8
0
22 Dec 2021
Max-Margin Contrastive Learning
AAAI Conference on Artificial Intelligence (AAAI), 2021
Anshul B. Shah
S. Sra
Ramalingam Chellappa
A. Cherian
SSL
149
53
0
21 Dec 2021
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Yinghao Xu
Fangyun Wei
Xiao Sun
Ceyuan Yang
Yujun Shen
Bo Dai
Bolei Zhou
Stephen Lin
VLM
177
62
0
17 Dec 2021
Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation
Yujia Zhang
L. Po
Xuyuan Xu
Mengyang Liu
Yexin Wang
Weifeng Ou
Yuzhi Zhao
Weikang Yu
SSL
AI4TS
249
18
0
16 Dec 2021
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
243
7
0
14 Dec 2021
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Lezhi Li
Xinyu Gong
Junru Wu
Humphrey Shi
Zhicheng Yan
Zinan Lin
VGen
155
1
0
09 Dec 2021
DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
Yuxuan Liang
Pan Zhou
Roger Zimmermann
Shuicheng Yan
ViT
181
25
0
09 Dec 2021
Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning
K. Navaneet
Soroush Abbasi Koohpayegani
Ajinkya Tejankar
Kossar Pourahmadi
Akshayvarun Subramanya
Hamed Pirsiavash
SSL
233
8
0
08 Dec 2021
MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
Rex Liu
Huan Zhang
Hamed Pirsiavash
Xin Liu
ViT
295
16
0
08 Dec 2021
Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Nina Shvetsova
Brian Chen
Andrew Rouditchenko
Samuel Thomas
Brian Kingsbury
Rogerio Feris
David Harwath
James R. Glass
Hilde Kuehne
ViT
306
154
0
08 Dec 2021
Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Srijan Das
Michael S. Ryoo
SSL
281
1
0
07 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
211
30
0
07 Dec 2021
Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning
Manlin Zhang
Jinpeng Wang
A. J. Ma
155
9
0
07 Dec 2021
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni
Hailin Jin
SSL
AI4TS
332
61
0
07 Dec 2021
Previous
1
2
3
...
6
7
8
...
12
13
14
Next