ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.11248
  4. Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition

A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
ArXivPDFHTML

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown
Title
Human Action Co-occurrence in Lifestyle Vlogs using Graph Link
  Prediction
Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction
Oana Ignat
Santiago Castro
Weiji Li
Rada Mihalcea
23
0
0
12 Sep 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction
  Understanding
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu
Yong-Lu Li
Zhemin Huang
Michael Xu Liu
Cewu Lu
Yu-Wing Tai
Chi-Keung Tang
EgoV
28
9
0
05 Sep 2023
Towards Contrastive Learning in Music Video Domain
Towards Contrastive Learning in Music Video Domain
Karel Veldkamp
Mariya Hendriksen
Zoltán Szlávik
Alexander Keijser
SSL
34
2
0
01 Sep 2023
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video
  Representation Learning
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning
Minghao Zhu
Xiao Lin
Ronghao Dang
Chengju Liu
Qi Chen
VGen
35
8
0
01 Sep 2023
STint: Self-supervised Temporal Interpolation for Geospatial Data
STint: Self-supervised Temporal Interpolation for Geospatial Data
Nidhin Harilal
B. Hodge
Aneesh Subramanian
C. Monteleoni
48
1
0
31 Aug 2023
Evaluation of Key Spatiotemporal Learners for Print Track Anomaly
  Classification Using Melt Pool Image Streams
Evaluation of Key Spatiotemporal Learners for Print Track Anomaly Classification Using Melt Pool Image Streams
Lynn Cherif
Mutahar Safdar
Guy Lamouche
P. Wanjara
P. Paul
G. Wood
Max Zimmermann
F. Hannesen
Yao Zhao
26
1
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
36
20
0
27 Aug 2023
Improving Video Violence Recognition with Human Interaction Learning on
  3D Skeleton Point Clouds
Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds
Yukun Su
Guosheng Lin
Qingyao Wu
3DH
3DPC
29
3
0
26 Aug 2023
EventTransAct: A video transformer-based framework for Event-camera
  based action recognition
EventTransAct: A video transformer-based framework for Event-camera based action recognition
Tristan de Blegiers
I. Dave
Adeel Yousaf
M. Shah
ViT
38
9
0
25 Aug 2023
Prompting Visual-Language Models for Dynamic Facial Expression
  Recognition
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
Zengqun Zhao
Ioannis Patras
VLM
13
33
0
25 Aug 2023
Spherical Vision Transformer for 360-degree Video Saliency Prediction
Spherical Vision Transformer for 360-degree Video Saliency Prediction
Mert Cokelek
Nevrez Imamoglu
C. Ozcinar
Erkut Erdem
Aykut Erdem
MDE
21
3
0
24 Aug 2023
Temporal-Distributed Backdoor Attack Against Video Based Action
  Recognition
Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Xi Li
Songhe Wang
Rui Huang
Mahanth K. Gowda
G. Kesidis
AAML
41
6
0
21 Aug 2023
UnLoc: A Unified Framework for Video Localization Tasks
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
33
53
0
21 Aug 2023
Joint learning of images and videos with a single Vision Transformer
Joint learning of images and videos with a single Vision Transformer
Shuki Shimizu
Toru Tamaki
ViT
19
0
0
21 Aug 2023
Towards Real-World Visual Tracking with Temporal Contexts
Towards Real-World Visual Tracking with Temporal Contexts
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
39
42
0
20 Aug 2023
Breast Lesion Diagnosis Using Static Images and Dynamic Video
Breast Lesion Diagnosis Using Static Images and Dynamic Video
Yunwen Huang
Hongyu Hu
Ying Zhu
Yi Xu
11
2
0
19 Aug 2023
Spatial-Temporal Alignment Network for Action Recognition
Spatial-Temporal Alignment Network for Action Recognition
Jinhui Ye
Junwei Liang
3DPC
29
1
0
19 Aug 2023
Long-range Multimodal Pretraining for Movie Understanding
Long-range Multimodal Pretraining for Movie Understanding
Dawit Mureja Argaw
Joon-Young Lee
Markus Woodson
In So Kweon
Fabian Caba Heilbron
VLM
30
7
0
18 Aug 2023
Learnt Contrastive Concept Embeddings for Sign Recognition
Learnt Contrastive Concept Embeddings for Sign Recognition
Ryan Wong
Necati Cihan Camgöz
Richard Bowden
29
5
0
18 Aug 2023
Unlimited Knowledge Distillation for Action Recognition in the Dark
Unlimited Knowledge Distillation for Action Recognition in the Dark
Ruibing Jin
Guosheng Lin
Min-man Wu
Jie Lin
Zhengguo Li
Xiaoli Li
Zhenghua Chen
16
1
0
18 Aug 2023
Audio-Visual Glance Network for Efficient Video Recognition
Audio-Visual Glance Network for Efficient Video Recognition
Muhammad Adi Nugroho
Sangmin Woo
Sumin Lee
Changick Kim
19
5
0
18 Aug 2023
OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound
  Video Synthesis
OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound Video Synthesis
Hangyu Zhou
Dong Ni
Ao Chang
Xinrui Zhou
Rusi Chen
...
Yuhao Huang
Tong Han
Zhe-Yu Liu
Deng-Ping Fan
Xin Yang
20
1
0
16 Aug 2023
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Yan Zhu
Junbao Zhuo
B. Ma
Jiajia Geng
Xiaoming Wei
Xiaolin K. Wei
Shuhui Wang
VLM
33
5
0
14 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
43
9
0
10 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
24
7
0
09 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion
  Prompts Learning
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
38
17
0
09 Aug 2023
Temporal DINO: A Self-supervised Video Strategy to Enhance Action
  Prediction
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction
Izzeddin Teeti
Rongali Sai Bhargav
Vivek Singh
Andrew Bradley
Biplab Banerjee
Fabio Cuzzolin
19
1
0
08 Aug 2023
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Shuangrui Ding
Peisen Zhao
Xiaopeng Zhang
Rui Qian
H. Xiong
Qi Tian
ViT
29
16
0
08 Aug 2023
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings
  for Video Action Recognition
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition
S. Chaudhuri
Saumik Bhattacharya
27
3
0
07 Aug 2023
Capturing Co-existing Distortions in User-Generated Content for
  No-reference Video Quality Assessment
Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment
Kun Yuan
Zishang Kong
Chuanchuan Zheng
Ming-Ting Sun
Xingsen Wen
ViT
32
14
0
31 Jul 2023
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action
  Segmentation
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation
Yue Zhang
Hehe Fan
Yi Yang
Mohan S. Kankanhalli
3DPC
18
1
0
31 Jul 2023
Weakly Supervised AI for Efficient Analysis of 3D Pathology Samples
Weakly Supervised AI for Efficient Analysis of 3D Pathology Samples
Andrew H. Song
Mane Williams
Drew F. K. Williamson
Guillaume Jaume
Andrew Zhang
...
R. Serafin
Jonathan T. C. Liu
Alexander S. Baras
Anil V. Parwani
Faisal Mahmood
17
4
0
27 Jul 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature
  Restoration
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
45
7
0
27 Jul 2023
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Xin Yuan
Linjie Li
Jianfeng Wang
Zhengyuan Yang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
DiffM
65
6
0
27 Jul 2023
ProtoASNet: Dynamic Prototypes for Inherently Interpretable and
  Uncertainty-Aware Aortic Stenosis Classification in Echocardiography
ProtoASNet: Dynamic Prototypes for Inherently Interpretable and Uncertainty-Aware Aortic Stenosis Classification in Echocardiography
H. Vaseli
A. Gu
Ahmadi Amiri
Michael Y. Tsang
A. Fung
Nima Kondori
Armin Saadat
Purang Abolmaesumi
T. Tsang
36
12
0
26 Jul 2023
Multi-Modal Machine Learning for Assessing Gaming Skills in Online
  Streaming: A Case Study with CS:GO
Multi-Modal Machine Learning for Assessing Gaming Skills in Online Streaming: A Case Study with CS:GO
Longxiang Zhang
Wenping Wang
51
1
0
23 Jul 2023
In Defense of Clip-based Video Relation Detection
In Defense of Clip-based Video Relation Detection
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
44
5
0
18 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
40
8
0
18 Jul 2023
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence
  Pre-training
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
Hongfei Yan
Yang Liu
Yushen Wei
Zerui Li
Guanbin Li
Liang Lin
34
40
0
17 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action
  Recognition
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
54
19
0
13 Jul 2023
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic
  Facial Expression Recognition
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
37
17
0
05 Jul 2023
Multimodal Imbalance-Aware Gradient Modulation for Weakly-supervised
  Audio-Visual Video Parsing
Multimodal Imbalance-Aware Gradient Modulation for Weakly-supervised Audio-Visual Video Parsing
Jie Fu
Junyu Gao
Changsheng Xu
34
6
0
05 Jul 2023
Streaming egocentric action anticipation: An evaluation scheme and
  approach
Streaming egocentric action anticipation: An evaluation scheme and approach
Antonino Furnari
G. Farinella
EgoV
24
3
0
29 Jun 2023
Efficient Online Processing with Deep Neural Networks
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
26
0
0
23 Jun 2023
Bullying10K: A Large-Scale Neuromorphic Dataset towards
  Privacy-Preserving Bullying Recognition
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition
Yiting Dong
Yang Li
Dongcheng Zhao
Guobin Shen
Yi Zeng
36
12
0
20 Jun 2023
Learning Space-Time Semantic Correspondences
Learning Space-Time Semantic Correspondences
Du Tran
Jitendra Malik
27
0
0
16 Jun 2023
Towards Balanced Active Learning for Multimodal Classification
Towards Balanced Active Learning for Multimodal Classification
Meng Shen
Yizheng Huang
Jianxiong Yin
Heqing Zou
D. Rajan
Simon See
27
5
0
14 Jun 2023
Enhanced Multimodal Representation Learning with Cross-modal KD
Enhanced Multimodal Representation Learning with Cross-modal KD
Mengxi Chen
Linyu Xing
Yu Wang
Ya Zhang
34
11
0
13 Jun 2023
Boosting Breast Ultrasound Video Classification by the Guidance of
  Keyframe Feature Centers
Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature Centers
AnLan Sun
Zhao Zhang
Meng Lei
Yuting Dai
Dong Wang
Liwei Wang
34
5
0
12 Jun 2023
A Large-Scale Analysis on Self-Supervised Video Representation Learning
A Large-Scale Analysis on Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
28
3
0
09 Jun 2023
Previous
123...567...242526
Next