Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1712.04851
Cited By
v1
v2 (latest)
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
13 December 2017
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification"
50 / 675 papers shown
TrUMAn: Trope Understanding in Movies and Animations
International Conference on Information and Knowledge Management (CIKM), 2021
Hung-Ting Su
Po-Wei Shen
Bing-Chen Tsai
Wen-Feng Cheng
Ke-Jyun Wang
Winston H. Hsu
173
6
0
10 Aug 2021
Video Contrastive Learning with Global Context
Haofei Kuang
Yi Zhu
Zhi-Li Zhang
Xinyu Li
Joseph Tighe
Sören Schwertfeger
C. Stachniss
Mu Li
SSL
AI4TS
244
64
0
05 Aug 2021
Token Shift Transformer for Video Classification
ACM Multimedia (ACM MM), 2021
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
287
127
0
05 Aug 2021
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
IEEE International Conference on Computer Vision (ICCV), 2021
Rui Qian
Yuxi Li
Huabin Liu
John See
Shuangrui Ding
Xian Liu
Dian Li
Weiyao Lin
287
43
0
04 Aug 2021
Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2021
Siyuan Yang
Jun Liu
Shijian Lu
Meng Hwa Er
Alex C. Kot
3DH
3DPC
276
110
0
04 Aug 2021
Temporal Alignment Prediction for Few-Shot Video Classification
Fei Pan
Chunlei Xu
Jie Guo
Yanwen Guo
AI4TS
171
2
0
26 Jul 2021
Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
IEEE International Conference on Computer Vision (ICCV), 2021
Abhishek Aich
Meng Zheng
Srikrishna Karanam
Terrence Chen
Amit K. Roy-Chowdhury
Ziyan Wu
308
84
0
25 Jul 2021
Transcript to Video: Efficient Clip Sequencing from Texts
ACM Multimedia (ACM MM), 2021
Yu Xiong
Fabian Caba Heilbron
Dahua Lin
CLIP
228
13
0
25 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
IEEE International Conference on Multimedia and Expo (ICME), 2021
Hanxi Lin
Xinxiao Wu
Jiebo Luo
179
2
0
25 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
International Journal of Computer Vision (IJCV), 2021
Yuan Tian
Manwen Liao
Guangtao Zhai
G. Guo
Zhiyong Gao
179
53
0
22 Jul 2021
Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games
Alina Roitberg
David Schneider
Aulia Djamal
C. Seibold
Simon Reiß
Rainer Stiefelhagen
227
36
0
12 Jul 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
203
19
0
12 Jul 2021
Aligning Correlation Information for Domain Adaptation in Action Recognition
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
215
49
0
11 Jul 2021
Modality specific U-Net variants for biomedical image segmentation: A survey
Artificial Intelligence Review (AIR), 2021
Narinder Singh Punn
Sonali Agarwal
SSeg
379
198
0
09 Jul 2021
Video 3D Sampling for Self-supervised Representation Learning
Wei Li
Dezhao Luo
Bo Fang
Can Ma
Weiping Wang
118
8
0
08 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
195
34
0
06 Jul 2021
Inter-intra Variant Dual Representations forSelf-supervised Video Recognition
Lin Zhang
Qi She
Zhengyang Shen
Changhu Wang
SSL
220
9
0
02 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Computer Vision and Pattern Recognition (CVPR), 2021
Xin Liu
Henglin Shi
Zhaodong Sun
Zitong Yu
Amritpal Singh
Guoying Zhao
225
109
0
01 Jul 2021
Attention Bottlenecks for Multimodal Fusion
Neural Information Processing Systems (NeurIPS), 2021
Arsha Nagrani
Shan Yang
Anurag Arnab
A. Jansen
Cordelia Schmid
Chen Sun
588
704
0
30 Jun 2021
When Video Classification Meets Incremental Classes
ACM Multimedia (ACM MM), 2021
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
200
32
0
30 Jun 2021
Long-Short Temporal Modeling for Efficient Action Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Liyu Wu
Yuexian Zou
Can Zhang
106
1
0
30 Jun 2021
Unsupervised Discovery of Actions in Instructional Videos
British Machine Vision Conference (BMVC), 2021
A. Piergiovanni
A. Angelova
Michael S. Ryoo
Irfan Essa
171
5
0
28 Jun 2021
Hyperbolic Busemann Learning with Ideal Prototypes
Neural Information Processing Systems (NeurIPS), 2021
Mina Ghadimi Atigh
Martin Keller-Ressel
Pascal Mettes
236
51
0
28 Jun 2021
Can An Image Classifier Suffice For Action Recognition?
International Conference on Learning Representations (ICLR), 2021
Quanfu Fan
Chun-Fu Chen
Chen
Yikang Shen
ViT
286
37
0
26 Jun 2021
Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering
Long Hoang Dang
T. Le
Vuong Le
T. Tran
233
68
0
25 Jun 2021
Video Swin Transformer
Ze Liu
Jia Ning
Yue Cao
Yixuan Wei
Zheng Zhang
Stephen Lin
Han Hu
ViT
488
1,884
0
24 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
646
155
0
21 Jun 2021
All You Can Embed: Natural Language based Vehicle Retrieval with Spatio-Temporal Transformers
Carmelo Scribano
D. Sapienza
Giorgia Franchini
M. Verucchi
Marko Bertogna
140
6
0
18 Jun 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
292
13
0
18 Jun 2021
Multi-Granularity Network with Modal Attention for Dense Affective Understanding
Baoming Yan
Lin Wang
Ke Gao
Bo Gao
Xiao-Chang Liu
Chao Ban
Jiang Yang
Xiaobo Li
VGen
96
2
0
18 Jun 2021
MaCLR: Motion-aware Contrastive Learning of Representations for Videos
European Conference on Computer Vision (ECCV), 2021
Fanyi Xiao
Joseph Tighe
Davide Modolo
SSL
194
19
0
17 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
294
11
0
12 Jun 2021
Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers
Neural Information Processing Systems (NeurIPS), 2021
Mandela Patrick
Dylan Campbell
Yuki M. Asano
Ishan Misra
Ishan Misra Florian Metze
Christoph Feichtenhofer
Andrea Vedaldi
João F. Henriques
293
342
0
09 Jun 2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Linjie Li
Jie Lei
Zhe Gan
Licheng Yu
Yen-Chun Chen
...
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
Lijuan Wang
Zicheng Liu
VLM
279
117
0
08 Jun 2021
Transformed ROIs for Capturing Visual Transformations in Videos
Computer Vision and Image Understanding (CVIU), 2021
Abhinav Rai
Fadime Sener
Angela Yao
ViT
234
4
0
06 Jun 2021
ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency
IEEE International Conference on Computer Vision (ICCV), 2021
Deng Huang
Wenhao Wu
Weiwen Hu
Xu Liu
Dongliang He
Zhihua Wu
Xiangmiao Wu
Ming Tan
Errui Ding
SSL
163
55
0
04 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
International Conference on Learning Representations (ICLR), 2021
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
169
65
0
03 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
198
4
0
02 Jun 2021
Connecting Language and Vision for Natural Language-Based Vehicle Retrieval
Shuai Bai
Zhedong Zheng
Xiaohan Wang
Junyang Lin
Zhu Zhang
Chang Zhou
Yi Yang
Hongxia Yang
239
30
0
31 May 2021
Multi-Modal Semantic Inconsistency Detection in Social Media News Posts
Conference on Multimedia Modeling (MMM), 2021
S. McCrae
Kehan Wang
A. Zakhor
148
16
0
26 May 2021
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
ACM Multimedia (ACM MM), 2021
Wenhao Wu
Yuxiang Zhao
Yanwu Xu
Xiao Tan
Dongliang He
...
Jinxing Ye
Yingying Li
Mingde Yao
Zichao Dong
Yifeng Shi
AI4TS
219
33
0
25 May 2021
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
Huanjin Yao
Hujie Huang
ViT
149
31
0
25 May 2021
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Findings (Findings), 2021
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Prahal Arora
Masoumeh Aminzadeh
Christoph Feichtenhofer
Florian Metze
Luke Zettlemoyer
343
146
0
20 May 2021
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Taojiannan Yang
Sijie Zhu
Matías Mendieta
Pu Wang
Ravikumar Balakrishnan
Minwoo Lee
T. Han
M. Shah
Chong Chen
3DH
OOD
258
25
0
14 May 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Computer Vision and Pattern Recognition (CVPR), 2021
Tianrui Hui
Shaofei Huang
Si Liu
Zihan Ding
Guanbin Li
Wenguan Wang
Jizhong Han
Haiwei Yang
208
58
0
14 May 2021
REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2021
Bruno Degardin
Vasco Lopes
Hugo Proencca
3DH
GNN
163
12
0
14 May 2021
Designing Multimodal Datasets for NLP Challenges
James Pustejovsky
E. Holderness
Jingxuan Tu
Parker Glenn
Kyeongmin Rim
Kelley Lynch
R. Brutti
204
5
0
12 May 2021
Temporal-Spatial Feature Pyramid for Video Saliency Detection
Qinyao Chang
Shiping Zhu
218
34
0
10 May 2021
Adaptive Focus for Efficient Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
298
110
0
07 May 2021
Motion-Augmented Self-Training for Video Recognition at Smaller Scale
IEEE International Conference on Computer Vision (ICCV), 2021
Kirill Gavrilyuk
Mihir Jain
I. Karmanov
Cees G. M. Snoek
155
24
0
04 May 2021
Previous
1
2
3
...
8
9
10
...
12
13
14
Next
Page 9 of 14
Page
of 14
Go