ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXivPDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,275 papers shown
Title
Video 3D Sampling for Self-supervised Representation Learning
Video 3D Sampling for Self-supervised Representation Learning
Wei Li
Dezhao Luo
Bo Fang
Yu Zhou
Weiping Wang
28
6
0
08 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
59
95
0
01 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding
  and Emotion Analysis
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Xin Liu
Henglin Shi
Haoyu Chen
Zitong Yu
Xiaobai Li
Guoying Zhao
21
80
0
01 Jul 2021
CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation
CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation
Ankit Singh
36
108
0
30 Jun 2021
When Video Classification Meets Incremental Classes
When Video Classification Meets Incremental Classes
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
27
28
0
30 Jun 2021
Long-Short Temporal Modeling for Efficient Action Recognition
Long-Short Temporal Modeling for Efficient Action Recognition
Liyu Wu
Yuexian Zou
Can Zhang
21
1
0
30 Jun 2021
Interflow: Aggregating Multi-layer Feature Mappings with Attention
  Mechanism
Interflow: Aggregating Multi-layer Feature Mappings with Attention Mechanism
Zhicheng Cai
16
1
0
26 Jun 2021
Exploring Temporal Context and Human Movement Dynamics for Online Action
  Detection in Videos
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos
V. Vasileiou
N. Kardaris
Petros Maragos
20
2
0
26 Jun 2021
Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily
  Long Videos of Seizures
Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily Long Videos of Seizures
Fernando Pérez-García
C. Scott
Rachel Sparks
B. Diehl
Sébastien Ourselin
SLR
14
17
0
22 Jun 2021
Towards Long-Form Video Understanding
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
59
166
0
21 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
37
127
0
21 Jun 2021
Does Optimal Source Task Performance Imply Optimal Pre-training for a
  Target Task?
Does Optimal Source Task Performance Imply Optimal Pre-training for a Target Task?
Steven Gutstein
Brent Lance
Sanjay Shakkottai
27
1
0
21 Jun 2021
OadTR: Online Action Detection with Transformers
OadTR: Online Action Detection with Transformers
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhe Zuo
Changxin Gao
Nong Sang
OffRL
ViT
34
110
0
21 Jun 2021
Improving Ultrasound Tongue Image Reconstruction from Lip Images Using
  Self-supervised Learning and Attention Mechanism
Improving Ultrasound Tongue Image Reconstruction from Lip Images Using Self-supervised Learning and Attention Mechanism
Haiyang Liu
Jihang Zhang
21
4
0
20 Jun 2021
Self-supervised Video Representation Learning with Cross-Stream
  Prototypical Contrasting
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
40
11
0
18 Jun 2021
Long-Short Temporal Contrastive Learning of Video Transformers
Long-Short Temporal Contrastive Learning of Video Transformers
Jue Wang
Gedas Bertasius
Du Tran
Lorenzo Torresani
VLM
ViT
35
50
0
17 Jun 2021
mPyPl: Python Monadic Pipeline Library for Complex Functional Data
  Processing
mPyPl: Python Monadic Pipeline Library for Complex Functional Data Processing
Dmitry Soshnikov
Yana Valieva
AI4CE
6
0
0
16 Jun 2021
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group
  and Activity Detection
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
Mahsa Ehsanpour
F. Saleh
Silvio Savarese
Ian Reid
Hamid Rezatofighi
30
42
0
16 Jun 2021
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Mateusz Malinowski
Dimitrios Vytiniotis
G. Swirszcz
Viorica Patraucean
João Carreira
30
8
0
15 Jun 2021
Influential Rank: A New Perspective of Post-training for Robust Model
  against Noisy Labels
Influential Rank: A New Perspective of Post-training for Robust Model against Noisy Labels
Seulki Park
Hwanjun Song
Daeho Um
D. Jo
Sangdoo Yun
J. Choi
NoLa
37
0
0
14 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
27
11
0
12 Jun 2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding
  Evaluation
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Linjie Li
Jie Lei
Zhe Gan
Licheng Yu
Yen-Chun Chen
...
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
Lijuan Wang
Zicheng Liu
VLM
32
100
0
08 Jun 2021
Learning by Distillation: A Self-Supervised Learning Framework for
  Optical Flow Estimation
Learning by Distillation: A Self-Supervised Learning Framework for Optical Flow Estimation
Pengpeng Liu
M. Lyu
Irwin King
Jia Xu
21
7
0
08 Jun 2021
White Paper Assistance: A Step Forward Beyond the Shortcut Learning
White Paper Assistance: A Step Forward Beyond the Shortcut Learning
Xuan Cheng
Tianshu Xie
Xiaomin Wang
Jiali Deng
Minghui Liu
Meilin Liu
AAML
21
0
0
08 Jun 2021
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker
  Detection in the Wild
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
Okan Kopuklu
Maja Taseska
Gerhard Rigoll
3DV
34
45
0
07 Jun 2021
Video Imprint
Video Imprint
Zhanning Gao
Le Wang
Nebojsa Jojic
Zhenxing Niu
N. Zheng
G. Hua
32
5
0
07 Jun 2021
Anticipative Video Transformer
Anticipative Video Transformer
Rohit Girdhar
Kristen Grauman
ViT
27
209
0
03 Jun 2021
Cross-Domain First Person Audio-Visual Action Recognition through
  Relative Norm Alignment
Cross-Domain First Person Audio-Visual Action Recognition through Relative Norm Alignment
M. Planamente
Chiara Plizzari
Emanuele Alberti
Barbara Caputo
EgoV
22
12
0
03 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
30
55
0
03 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
29
4
0
02 Jun 2021
Connecting Language and Vision for Natural Language-Based Vehicle
  Retrieval
Connecting Language and Vision for Natural Language-Based Vehicle Retrieval
Shuai Bai
Zhedong Zheng
Xiaohan Wang
Junyang Lin
Zhu Zhang
Chang Zhou
Yi Yang
Hongxia Yang
26
27
0
31 May 2021
A Study On the Effects of Pre-processing On Spatio-temporal Action
  Recognition Using Spiking Neural Networks Trained with STDP
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
6
5
0
31 May 2021
Transferable Sparse Adversarial Attack
Transferable Sparse Adversarial Attack
Ziwen He
Wei Wang
Jing Dong
Tieniu Tan
AAML
19
20
0
31 May 2021
Unsupervised detection of mouse behavioural anomalies using two-stream
  convolutional autoencoders
Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders
Ezechukwu I. Nwokedi
R. Bains
L. Bidaut
S. Wells
Xujiong Ye
James M. Brown
24
2
0
28 May 2021
Tracking Without Re-recognition in Humans and Machines
Tracking Without Re-recognition in Humans and Machines
Drew Linsley
Girik Malik
Junkyung Kim
L. Govindarajan
E. Mingolla
Thomas Serre
31
18
0
27 May 2021
Improving Sign Language Translation with Monolingual Data by Sign
  Back-Translation
Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Hao Zhou
Wen-gang Zhou
Weizhen Qi
Junfu Pu
Houqiang Li
SLR
35
182
0
26 May 2021
DSANet: Dynamic Segment Aggregation Network for Video-Level
  Representation Learning
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
Wenhao Wu
Yuxiang Zhao
Yanwu Xu
Xiao Tan
Dongliang He
...
Jinxing Ye
Yingying Li
Mingde Yao
Zichao Dong
Yifeng Shi
AI4TS
30
27
0
25 May 2021
Temporal Action Proposal Generation with Transformers
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
Huanjin Yao
Hujie Huang
ViT
38
27
0
25 May 2021
FineAction: A Fine-Grained Video Dataset for Temporal Action
  Localization
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
Yi Liu
Limin Wang
Yali Wang
Xiao Ma
Yu Qiao
24
56
0
24 May 2021
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Dipika Singhania
R. Rahaman
Angela Yao
AI4TS
27
55
0
23 May 2021
An Attractor-Guided Neural Networks for Skeleton-Based Human Motion
  Prediction
An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction
Pengxiang Ding
Junying Wang
Jianqin Yin
3DH
39
0
0
20 May 2021
Anabranch Network for Camouflaged Object Segmentation
Anabranch Network for Camouflaged Object Segmentation
Trung-Nghia Le
Tam V. Nguyen
Zhongliang Nie
M. Tran
Akihiro Sugimoto
27
478
0
20 May 2021
NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions
NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions
Junbin Xiao
Xindi Shang
Angela Yao
Tat-Seng Chua
45
448
0
18 May 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of
  Daily Living
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
Francois Bremond
ViT
48
67
0
17 May 2021
Leveraging Semantic Scene Characteristics and Multi-Stream Convolutional
  Architectures in a Contextual Approach for Video-Based Visual Emotion
  Recognition in the Wild
Leveraging Semantic Scene Characteristics and Multi-Stream Convolutional Architectures in a Contextual Approach for Video-Based Visual Emotion Recognition in the Wild
Ioannis Pikoulis
P. Filntisis
Petros Maragos
34
14
0
16 May 2021
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized
  Sports Actions
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
Yixuan Li
Lei Chen
Runyu He
Zhenzhi Wang
Gangshan Wu
Limin Wang
27
97
0
16 May 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor
  Segmentation
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui
Shaofei Huang
Si Liu
Zihan Ding
Guanbin Li
Wenguan Wang
Jizhong Han
Fei-Yue Wang
20
46
0
14 May 2021
REGINA - Reasoning Graph Convolutional Networks in Human Action
  Recognition
REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition
Bruno Degardin
Vasco Lopes
Hugo Proencca
3DH
GNN
40
10
0
14 May 2021
Contrastive Learning of Image Representations with Cross-Video
  Cycle-Consistency
Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency
Haiping Wu
Xiaolong Wang
SSL
30
31
0
13 May 2021
Home Action Genome: Cooperative Compositional Action Understanding
Home Action Genome: Cooperative Compositional Action Understanding
Nishant Rai
Haofeng Chen
Jingwei Ji
Rishi Desai
Kazuki Kozuka
Shun Ishizaka
Ehsan Adeli
Juan Carlos Niebles
29
73
0
11 May 2021
Previous
123...141516...444546
Next