v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

Neural Information Processing Systems (NeurIPS), 2014

9 June 2014

Karen Simonyan

Andrew Zisserman

ArXiv (abs)PDF HTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,340 papers shown

Beyond still images: Temporal features and input variance resilienceScientific Reports (Sci Rep), 2023

AmirHosein Fadaei

M. Dehaqani

270

01 Nov 2023

1DFormer: a Transformer Architecture Learning 1D Landmark Representations for Facial Landmark TrackingInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

Bing Yin

151

01 Nov 2023

CHAMMI: A benchmark for channel-adaptive models in microscopy imagingNeural Information Processing Systems (NeurIPS), 2023

184

30 Oct 2023

On the Relevance of Temporal Features for Medical Ultrasound Video RecognitionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023

D. H. Smith

J. P. Lineberger

G. H. Baker

169

16 Oct 2023

Watt For What: Rethinking Deep Learning's Energy-Performance Relationship

Shashank Narayana Gowda

HAI

179

10 Oct 2023

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic AlignmentInternational Conference on Learning Representations (ICLR), 2023

Bin Lin

...

Wei Liu

740

336

03 Oct 2023

Telling Stories for Common Sense Zero-Shot Action RecognitionAsian Conference on Computer Vision (ACCV), 2023

Shreyank N. Gowda

Carolina Scarton

LM&Ro

195

29 Sep 2023

A Survey on Deep Learning Techniques for Action Anticipation

300

29 Sep 2023

Training a Large Video Model on a Single Machine in a Day

Yue Zhao

Philipp Krahenbuhl

VLM

273

28 Sep 2023

Local Compressed Video Stream Learning for Generic Event Boundary DetectionInternational Journal of Computer Vision (IJCV), 2023

228

27 Sep 2023

CPR-Coach: Recognizing Composite Error Actions based on Single-class TrainingComputer Vision and Pattern Recognition (CVPR), 2023

Dingkang Yang

Xiao Zhao

Peng Zhai

Lihua Zhang

345

21 Sep 2023

Exploring Self-supervised Skeleton-based Action Recognition in Occluded Environments

Kailun Yang

318

21 Sep 2023

SkeleTR: Towrads Skeleton-based Action Recognition in the Wild

245

20 Sep 2023

Selective Volume Mixup for Video Action Recognition

Tao Mei

212

18 Sep 2023

Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool

Josiah W. Smith

Murat Torlak

187

16 Sep 2023

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer LearningIEEE International Conference on Computer Vision (ICCV), 2023

216

14 Sep 2023

Judging a video by its bitstream coverData Compression Conference (DCC), 2023

117

14 Sep 2023

TransNet: A Transfer Learning-Based Network for Human Action RecognitionInternational Conference on Machine Learning and Applications (ICMLA), 2023

Khaled Alomar

Xiaohao Cai

236

13 Sep 2023

Enhancing multimodal cooperation via sample-level modality valuationComputer Vision and Pattern Recognition (CVPR), 2023

468

12 Sep 2023

EgoPCA: A New Framework for Egocentric Hand-Object Interaction UnderstandingIEEE International Conference on Computer Vision (ICCV), 2023

175

05 Sep 2023

SOAR: Scene-debiasing Open-set Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2023

Gang Hua

316

03 Sep 2023

Towards Contrastive Learning in Music Video Domain

210

01 Sep 2023

Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph ReasoningACM Multimedia (ACM MM), 2023

Zheng Wang

203

29 Aug 2023

Evaluation of Key Spatiotemporal Learners for Print Track Anomaly Classification Using Melt Pool Image StreamsIFAC-PapersOnLine (IFAC-PapersOnLine), 2023

103

28 Aug 2023

LAC: Latent Action Composition for Skeleton-based Action SegmentationIEEE International Conference on Computer Vision (ICCV), 2023

538

28 Aug 2023

Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds

Qingxin Xiao

Guosheng Lin

Qingyao Wu

3DH 3DPC

196

26 Aug 2023

TriGait: Aligning and Fusing Skeleton and Silhouette Gait Data via a Tri-Branch Network

271

25 Aug 2023

AccFlow: Backward Accumulation for Long-Range Optical FlowIEEE International Conference on Computer Vision (ICCV), 2023

Xiaohong Liu

Guangtao Zhai

155

25 Aug 2023

MOFO: MOtion FOcused Self-Supervision for Video Understanding

Mona Ahmadian

Frank Guerin

Andrew Gilbert

307

23 Aug 2023

Sign Language Translation with Iterative PrototypeIEEE International Conference on Computer Vision (ICCV), 2023

Hao Feng

23 Aug 2023

Temporal-Distributed Backdoor Attack Against Video Based Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2023

380

21 Aug 2023

MGMAE: Motion Guided Masking for Video Masked AutoencodingIEEE International Conference on Computer Vision (ICCV), 2023

Yu Qiao

151

21 Aug 2023

Visual Crowd Analysis: Open Research ProblemsThe AI Magazine (AI Mag.), 2023

287

21 Aug 2023

Joint learning of images and videos with a single Vision Transformer

Shuki Shimizu

Toru Tamaki

ViT

181

21 Aug 2023

Spatial-Temporal Alignment Network for Action Recognition

Jinhui Ye

Junwei Liang

3DPC

161

19 Aug 2023

Unlimited Knowledge Distillation for Action Recognition in the Dark

Guosheng Lin

155

18 Aug 2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud VideosIEEE International Conference on Computer Vision (ICCV), 2023

Hehe Fan

149

18 Aug 2023

Event-Guided Procedure Planning from Instructional Videos with Text SupervisionIEEE International Conference on Computer Vision (ICCV), 2023

Jingke Meng

144

17 Aug 2023

FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical FlowACM Multimedia (ACM MM), 2023

Jinlong Peng

144

14 Aug 2023

Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention

Liang Shang

297

10 Aug 2023

Temporally-Adaptive Models for Efficient Video Understanding

Ziwei Liu

205

10 Aug 2023

JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition

257

09 Aug 2023

View while Moving: Efficient Video Recognition in Long-untrimmed VideosACM Multimedia (ACM MM), 2023

Lanshan Zhang

Yang Liu

261

09 Aug 2023

ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action RecognitionIndian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023

S. Chaudhuri

Saumik Bhattacharya

175

07 Aug 2023

Incorporating Pre-training Data Matters in Unsupervised Domain AdaptationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

313

06 Aug 2023

SkateboardAI: The Coolest Video Action Recognition for SkateboardingAAAI Conference on Artificial Intelligence (AAAI), 2023

Hanxiao Chen

ViT

118

02 Aug 2023

Sample Less, Learn More: Efficient Action Recognition via Frame Feature RestorationACM Multimedia (ACM MM), 2023

220

27 Jul 2023

Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding EmotionProceedings of the IEEE (Proc. IEEE), 2023

335

25 Jul 2023

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question AnsweringIEEE transactions on multimedia (IEEE TMM), 2023

Hehe Fan

211

25 Jul 2023

On the Connection between Pre-training Data Diversity and Fine-tuning RobustnessNeural Information Processing Systems (NeurIPS), 2023

Thao Nguyen

190

24 Jul 2023