ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.11248
  4. Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition

A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
ArXivPDFHTML

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown
Title
Rethinking Resolution in the Context of Efficient Video Recognition
Rethinking Resolution in the Context of Efficient Video Recognition
Chuofan Ma
Qiushan Guo
Yi-Xin Jiang
Zehuan Yuan
Ping Luo
Xiaojuan Qi
68
12
0
26 Sep 2022
Multi-dataset Training of Transformers for Robust Action Recognition
Multi-dataset Training of Transformers for Robust Action Recognition
Junwei Liang
Enwei Zhang
Jun Zhang
Chunhua Shen
ViT
45
11
0
26 Sep 2022
Leveraging Self-Supervised Training for Unintentional Action Recognition
Leveraging Self-Supervised Training for Unintentional Action Recognition
Enea Duka
Anna Kukleva
Bernt Schiele
38
1
0
23 Sep 2022
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial
  Video Classification
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification
P. Jin
Lichao Mou
Yuansheng Hua
Gui-Song Xia
Xiao Xiang Zhu
AI4TS
26
8
0
22 Sep 2022
Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure
  based FoG Detection
Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection
Kun Hu
Shaohui Mei
Wei Wang
K. E. Martens
Liang Wang
S. Lewis
D. Feng
Zhiyong Wang
37
5
0
22 Sep 2022
Adaptive Local-Component-aware Graph Convolutional Network for One-shot
  Skeleton-based Action Recognition
Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition
Anqi Zhu
Qiuhong Ke
Mingming Gong
James Bailey
49
21
0
21 Sep 2022
Mitigating Representation Bias in Action Recognition: Algorithms and
  Benchmarks
Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks
Haodong Duan
Yue Zhao
Kai-xiang Chen
Yu Xiong
Dahua Lin
13
7
0
20 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space
  Using Joint Cross-Attention
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
56
31
0
19 Sep 2022
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior
  Understanding in the Industrial-like Domain
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain
Francesco Ragusa
Antonino Furnari
G. Farinella
EgoV
46
24
0
19 Sep 2022
SDFE-LV: A Large-Scale, Multi-Source, and Unconstrained Database for
  Spotting Dynamic Facial Expressions in Long Videos
SDFE-LV: A Large-Scale, Multi-Source, and Unconstrained Database for Spotting Dynamic Facial Expressions in Long Videos
Xiaolin Xu
Yuan Zong
Wenming Zheng
Yang Li
Chuangao Tang
Xingxun Jiang
Haolin Jiang
CVBM
43
1
0
18 Sep 2022
On the Surprising Effectiveness of Transformers in Low-Labeled Video
  Recognition
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition
Farrukh Rahman
Ömer Mubarek
Z. Kira
ViT
18
2
0
15 Sep 2022
Moving from 2D to 3D: volumetric medical image classification for rectal
  cancer staging
Moving from 2D to 3D: volumetric medical image classification for rectal cancer staging
Joohyun Lee
J. Oh
Inkyu Shin
You-sung Kim
D. Sohn
Tae-Sung Kim
In So Kweon
MedIm
29
4
0
13 Sep 2022
Action-based Early Autism Diagnosis Using Contrastive Feature Learning
Action-based Early Autism Diagnosis Using Contrastive Feature Learning
Asha Rani
Pankaj Yadav
Yashaswi Verma
24
3
0
12 Sep 2022
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from
  Spatiotemporal Echocardiography
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography
Rand Muhtaseb
Mohammad Yaqub
ViT
27
24
0
09 Sep 2022
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action
  Recognition
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
Tianjiao Li
Lin Geng Foo
Qiuhong Ke
Hossein Rahmani
Anran Wang
Jinghua Wang
Jun Liu
19
21
0
03 Sep 2022
A Novel Self-Knowledge Distillation Approach with Siamese Representation
  Learning for Action Recognition
A Novel Self-Knowledge Distillation Approach with Siamese Representation Learning for Action Recognition
Duc-Quang Vu
T. Phung
Jia-Ching Wang
27
9
0
03 Sep 2022
Temporal Contrastive Learning with Curriculum
Temporal Contrastive Learning with Curriculum
Shuvendu Roy
Ali Etemad
43
3
0
02 Sep 2022
EchoGNN: Explainable Ejection Fraction Estimation with Graph Neural
  Networks
EchoGNN: Explainable Ejection Fraction Estimation with Graph Neural Networks
Masoud Mokhtari
Teresa S. M. Tsang
Purang Abolmaesumi
Renjie Liao
13
18
0
30 Aug 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
23
24
0
29 Aug 2022
Video Mobile-Former: Video Recognition with Efficient Global
  Spatial-temporal Modeling
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Rui Wang
Zuxuan Wu
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Luowei Zhou
Lu Yuan
Yu-Gang Jiang
ViT
43
4
0
25 Aug 2022
Enabling Weakly-Supervised Temporal Action Localization from On-Device
  Learning of the Video Stream
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream
Yue Tang
Yawen Wu
Peipei Zhou
Jingtong Hu
16
2
0
25 Aug 2022
Adaptive Perception Transformer for Temporal Action Localization
Adaptive Perception Transformer for Temporal Action Localization
Yizheng Ouyang
Tianjin Zhang
Weibo Gu
Hongfa Wang
21
3
0
25 Aug 2022
Lane Change Classification and Prediction with Action Recognition
  Networks
Lane Change Classification and Prediction with Action Recognition Networks
Kai-Bin Liang
Jun Wang
A. Bhalerao
16
2
0
24 Aug 2022
Modality Mixer for Multi-modal Action Recognition
Modality Mixer for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Yeonju Park
Muhammad Adi Nugroho
Changick Kim
26
10
0
24 Aug 2022
Hierarchical Compositional Representations for Few-shot Action
  Recognition
Hierarchical Compositional Representations for Few-shot Action Recognition
Chang-bo Li
Jie Zhang
Shuzhe Wu
Xin Jin
Shiguang Shan
30
20
0
19 Aug 2022
Intensity-Aware Loss for Dynamic Facial Expression Recognition in the
  Wild
Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Hanting Li
Hongjing Niu
Zhaoqing Zhu
Feng Zhao
CVBM
36
55
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
16
34
0
18 Aug 2022
Self-Contained Entity Discovery from Captioned Videos
Self-Contained Entity Discovery from Captioned Videos
M. Ayoughi
P. Mettes
Paul T. Groth
28
2
0
13 Aug 2022
Motion Sensitive Contrastive Learning for Self-supervised Video
  Representation
Motion Sensitive Contrastive Learning for Self-supervised Video Representation
Jingcheng Ni
Nana Zhou
Jie Qin
Qianrun Wu
Junqi Liu
Boxun Li
Di Huang
SSL
42
16
0
12 Aug 2022
Seeing your sleep stage: cross-modal distillation from EEG to infrared
  video
Seeing your sleep stage: cross-modal distillation from EEG to infrared video
Jianan Han
Shenmin Zhang
Aidong Men
Yang Liu
Z. Yao
Yan-Tao Yan
Qingchao Chen
33
4
0
11 Aug 2022
Frozen CLIP Models are Efficient Video Learners
Frozen CLIP Models are Efficient Video Learners
Ziyi Lin
Shijie Geng
Renrui Zhang
Peng Gao
Gerard de Melo
Xiaogang Wang
Jifeng Dai
Yu Qiao
Hongsheng Li
CLIP
VLM
16
200
0
06 Aug 2022
Blockwise Temporal-Spatial Pathway Network
Blockwise Temporal-Spatial Pathway Network
SeulGi Hong
Min-Kook Choi
26
1
0
05 Aug 2022
Expanding Language-Image Pretrained Models for General Video Recognition
Expanding Language-Image Pretrained Models for General Video Recognition
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
VLM
CLIP
ViT
40
313
0
04 Aug 2022
Surgical Skill Assessment via Video Semantic Aggregation
Surgical Skill Assessment via Video Semantic Aggregation
Zhenqiang Li
Lin Gu
Weimin Wang
Ryosuke Nakamura
Yoichi Sato
28
13
0
04 Aug 2022
Multimodal Generation of Novel Action Appearances for Synthetic-to-Real
  Recognition of Activities of Daily Living
Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living
Zdravko Marinov
David Schneider
Alina Roitberg
Rainer Stiefelhagen
VGen
32
2
0
03 Aug 2022
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action
  Recognition
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition
M. C. Leong
Haosong Zhang
Huibin Tan
Liyuan Li
J. Lim
ViT
39
8
0
03 Aug 2022
Two-Stream Transformer Architecture for Long Video Understanding
Two-Stream Transformer Architecture for Long Video Understanding
Edward Fish
Jon Weinbren
Andrew Gilbert
ViT
33
6
0
02 Aug 2022
Video Question Answering with Iterative Video-Text Co-Tokenization
Video Question Answering with Iterative Video-Text Co-Tokenization
A. Piergiovanni
K. Morton
Weicheng Kuo
Michael S. Ryoo
A. Angelova
34
18
0
01 Aug 2022
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training
  Framework for Temporal Grounding
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
Jiachang Hao
Haifeng Sun
Pengfei Ren
Jingyu Wang
Q. Qi
J. Liao
31
26
0
29 Jul 2022
Adaptive occlusion sensitivity analysis for visually explaining video
  recognition networks
Adaptive occlusion sensitivity analysis for visually explaining video recognition networks
Tomoki Uchiyama
Naoya Sogi
S. Iizuka
Koichiro Niinuma
Kazuhiro Fukui
24
2
0
26 Jul 2022
Compositional Human-Scene Interaction Synthesis with Semantic Control
Compositional Human-Scene Interaction Synthesis with Semantic Control
Kaifeng Zhao
Shaofei Wang
Yan Zhang
Thabo Beeler
Siyu Tang
30
65
0
26 Jul 2022
Bodily Behaviors in Social Interaction: Novel Annotations and
  State-of-the-Art Evaluation
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
Michal Balazia
Philippe Muller
Ákos Levente Tánczos
A. V. Liechtenstein
Franccois Brémond
17
22
0
26 Jul 2022
Static and Dynamic Concepts for Self-supervised Video Representation
  Learning
Static and Dynamic Concepts for Self-supervised Video Representation Learning
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
SSL
36
22
0
26 Jul 2022
Graph Neural Network and Spatiotemporal Transformer Attention for 3D
  Video Object Detection from Point Clouds
Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Junbo Yin
Jianbing Shen
Xin Gao
David J. Crandall
Ruigang Yang
3DPC
ViT
38
59
0
26 Jul 2022
CODiT: Conformal Out-of-Distribution Detection in Time-Series Data
CODiT: Conformal Out-of-Distribution Detection in Time-Series Data
R. Kaur
Kaustubh Sridhar
Sangdon Park
Susmit Jha
Anirban Roy
O. Sokolsky
Insup Lee
OODD
AI4TS
144
1
0
24 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
32
42
0
24 Jul 2022
Self-supervised contrastive learning of echocardiogram videos enables
  label-efficient cardiac disease diagnosis
Self-supervised contrastive learning of echocardiogram videos enables label-efficient cardiac disease diagnosis
G. Holste
Evangelos K. Oikonomou
Bobak J. Mortazavi
Zhangyang Wang
Rohan Khera
27
9
0
23 Jul 2022
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised
  Contrastive Loss
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss
Riccardo Franceschini
Enrico Fini
Cigdem Beyan
Alessandro Conti
F. Arrigoni
Elisa Ricci
SSL
OffRL
34
16
0
23 Jul 2022
Inductive and Transductive Few-Shot Video Classification via Appearance
  and Temporal Alignments
Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments
Khoi Duc Minh Nguyen
Quoc-Huy Tran
Khoi Nguyen
Binh-Son Hua
Rang Nguyen
28
29
0
21 Jul 2022
Sequence Models for Drone vs Bird Classification
Sequence Models for Drone vs Bird Classification
Fatih Çagatay Akyön
Erdem Akagündüz
S. Altinuc
A. Temi̇zel
21
1
0
21 Jul 2022
Previous
123...91011...242526
Next