ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.02953
  4. Cited By
Temporal Segment Networks for Action Recognition in Videos

Temporal Segment Networks for Action Recognition in Videos

8 May 2017
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXivPDFHTML

Papers citing "Temporal Segment Networks for Action Recognition in Videos"

50 / 298 papers shown
Title
Taylor Videos for Action Recognition
Taylor Videos for Action Recognition
Lei Wang
Xiuyuan Yuan
Tom Gedeon
Liang Zheng
26
6
0
05 Feb 2024
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for
  Computer Vision: A survey
Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
Haruna Yunusa
Shiyin Qin
Abdulrahman Hamman Adama Chukkol
Abdulganiyu Abdu Yusuf
Isah Bello
A. Lawan
ViT
43
13
0
05 Feb 2024
MIFI: MultI-camera Feature Integration for Roust 3D Distracted Driver
  Activity Recognition
MIFI: MultI-camera Feature Integration for Roust 3D Distracted Driver Activity Recognition
Jian Kuang
Wenjing Li
Fang Li
Jun Zhang
Zhongcheng Wu
29
1
0
25 Jan 2024
Adversarial Augmentation Training Makes Action Recognition Models More
  Robust to Realistic Video Distribution Shifts
Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts
Kiyoon Kim
Shreyank N. Gowda
Panagiotis Eustratiadis
Antreas Antoniou
Robert B Fisher
45
2
0
21 Jan 2024
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision,
  Language, Audio, and Action
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
37
144
0
28 Dec 2023
Deep Learning Approaches for Seizure Video Analysis: A Review
Deep Learning Approaches for Seizure Video Analysis: A Review
David Ahmedt-Aristizabal
M. Armin
Zeeshan Hayder
Norberto Garcia-Cairasco
Lars Petersson
Clinton Fookes
Simon Denman
A. McGonigal
32
21
0
18 Dec 2023
Counterfactual World Modeling for Physical Dynamics Understanding
Counterfactual World Modeling for Physical Dynamics Understanding
Rahul Venkatesh
Honglin Chen
Kevin T. Feigelis
Daniel M. Bear
Khaled Jedoui
...
Wanhee Lee
Sherry Liu
Kevin A. Smith
Judith E. Fan
Daniel L. K. Yamins
VGen
40
1
0
11 Dec 2023
DVANet: Disentangling View and Action Features for Multi-View Action
  Recognition
DVANet: Disentangling View and Action Features for Multi-View Action Recognition
Nyle Siddiqui
Praveen Tirupattur
Mubarak Shah
ViT
29
18
0
10 Dec 2023
A Review of Machine Learning Methods Applied to Video Analysis Systems
A Review of Machine Learning Methods Applied to Video Analysis Systems
Marios S. Pattichis
Venkatesh Jatla
Alvaro E. Ullao Cerna
19
3
0
08 Dec 2023
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Arun V. Reddy
William Paul
Corban Rivera
Ketul Shah
Celso M. de Melo
Rama Chellappa
37
4
0
05 Dec 2023
CAST: Cross-Attention in Space and Time for Video Action Recognition
CAST: Cross-Attention in Space and Time for Video Action Recognition
Dongho Lee
Jongseo Lee
Jinwoo Choi
EgoV
35
12
0
30 Nov 2023
Object-based (yet Class-agnostic) Video Domain Adaptation
Object-based (yet Class-agnostic) Video Domain Adaptation
Dantong Niu
Amir Bar
Roei Herzig
Trevor Darrell
Anna Rohrbach
22
1
0
29 Nov 2023
F4D: Factorized 4D Convolutional Neural Network for Efficient
  Video-level Representation Learning
F4D: Factorized 4D Convolutional Neural Network for Efficient Video-level Representation Learning
Mohammad Al-Saad
Lakshmish Ramaswamy
S. Bhandarkar
AI4TS
24
0
0
28 Nov 2023
Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities
  Using Web Instructional Videos
Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Takehiko Ohkawa
Takuma Yagi
Taichi Nishimura
Ryosuke Furuta
Atsushi Hashimoto
Yoshitaka Ushiku
Yoichi Sato
EgoV
49
8
0
28 Nov 2023
BatchNorm-based Weakly Supervised Video Anomaly Detection
BatchNorm-based Weakly Supervised Video Anomaly Detection
Yixuan Zhou
Yi Qu
Xing Xu
Fumin Shen
Jingkuan Song
Hengtao Shen
28
18
0
26 Nov 2023
Learning Human Action Recognition Representations Without Real Humans
Learning Human Action Recognition Representations Without Real Humans
Howard Zhong
Samarth Mishra
Donghyun Kim
SouYoung Jin
Rameswar Panda
Hildegard Kuehne
Leonid Karlinsky
Venkatesh Saligrama
Aude Oliva
Rogerio Feris
24
3
0
10 Nov 2023
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Zhiyu Zhao
Bingkun Huang
Sen Xing
Gangshan Wu
Yu Qiao
Limin Wang
39
5
0
06 Nov 2023
Large Models for Time Series and Spatio-Temporal Data: A Survey and
  Outlook
Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook
Ming Jin
Qingsong Wen
Yuxuan Liang
Chaoli Zhang
Siqiao Xue
...
Shirui Pan
Vincent S. Tseng
Yu Zheng
Lei Chen
Hui Xiong
AI4TS
SyDa
35
117
0
16 Oct 2023
Weakly-Supervised Video Anomaly Detection with Snippet Anomalous
  Attention
Weakly-Supervised Video Anomaly Detection with Snippet Anomalous Attention
Yidan Fan
Yongxin Yu
Wenhuan Lu
Yahong Han
25
20
0
28 Sep 2023
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object
  Interactions in Industrial Scenarios
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios
Francesco Ragusa
Rosario Leonardi
Michele Mazzamuto
Claudia Bonanno
Rosario Scavo
Antonino Furnari
G. Farinella
30
7
0
26 Sep 2023
Towards Lexical Analysis of Dog Vocalizations via Online Videos
Towards Lexical Analysis of Dog Vocalizations via Online Videos
Yufei Wang
Chunhao Zhang
Jieyi Huang
Mengyue Wu
Ke Zhu
14
1
0
21 Sep 2023
Collaborative Three-Stream Transformers for Video Captioning
Collaborative Three-Stream Transformers for Video Captioning
Hao Wang
Libo Zhang
Hengrui Fan
Tiejian Luo
36
6
0
18 Sep 2023
Privacy-preserving Early Detection of Epileptic Seizures in Videos
Privacy-preserving Early Detection of Epileptic Seizures in Videos
Deval Mehta
Shobi Sivathamboo
Hugh Simpson
Patrick Kwan
Terence OBrien
Zongyuan Ge
18
5
0
15 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
24
18
0
14 Sep 2023
Temporal Action Localization with Enhanced Instant Discriminability
Temporal Action Localization with Enhanced Instant Discriminability
Ding Shi
Qiong Cao
Yujie Zhong
Shan An
Jian Cheng
Haogang Zhu
Dacheng Tao
39
9
0
11 Sep 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction
  Understanding
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu
Yong-Lu Li
Zhemin Huang
Michael Xu Liu
Cewu Lu
Yu-Wing Tai
Chi-Keung Tang
EgoV
25
9
0
05 Sep 2023
SOAR: Scene-debiasing Open-set Action Recognition
SOAR: Scene-debiasing Open-set Action Recognition
Yuanhao Zhai
Ziyi Liu
Zhenyu Wu
Yi Wu
Chunluan Zhou
David Doermann
Junsong Yuan
Gang Hua
21
11
0
03 Sep 2023
Motion-Guided Masking for Spatiotemporal Representation Learning
Motion-Guided Masking for Spatiotemporal Representation Learning
D. Fan
Jue Wang
Shuai Liao
Yi Zhu
Vimal Bhat
H. Santos-Villalobos
M. Rohith
Xinyu Li
VGen
37
19
0
24 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
39
30
0
21 Aug 2023
Audio-Visual Glance Network for Efficient Video Recognition
Audio-Visual Glance Network for Efficient Video Recognition
Muhammad Adi Nugroho
Sangmin Woo
Sumin Lee
Changick Kim
16
5
0
18 Aug 2023
GaitFormer: Revisiting Intrinsic Periodicity for Gait Recognition
GaitFormer: Revisiting Intrinsic Periodicity for Gait Recognition
Qianyu Wu
Rui Xiao
Kaixin Xu
Jingcheng Ni
Boxun Li
Ziyao Xu
CVBM
40
2
0
25 Jul 2023
3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in
  Autonomous Driving
3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving
Qipeng Li
Yuan Zhuang
Yiwen Chen
Jianzhu Huai
Miaopeng Li
Tianbing Ma
Yufei Tang
Xinlian Liang
3DPC
38
3
0
18 Jul 2023
Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition
Jianyuan Ni
Hao Tang
A. Ngu
Gaowen Liu
Yan Yan
32
3
0
07 Jul 2023
VideoGLUE: Video General Understanding Evaluation of Foundation Models
VideoGLUE: Video General Understanding Evaluation of Foundation Models
Liangzhe Yuan
N. B. Gundavarapu
Long Zhao
Hao Zhou
Huayu Chen
...
Florian Schroff
Hartwig Adam
Ming Yang
Ting Liu
Boqing Gong
ELM
37
9
0
06 Jul 2023
Fine-grained Action Analysis: A Multi-modality and Multi-task Dataset of
  Figure Skating
Fine-grained Action Analysis: A Multi-modality and Multi-task Dataset of Figure Skating
Shengyuan Liu
Yuanyuan Ding
Guihong Lao
Sihan Zhang
Ning Zhou
Wen-Yue Chen
Hao Liu
21
2
0
06 Jul 2023
CLANet: A Comprehensive Framework for Cross-Batch Cell Line
  Identification Using Brightfield Images
CLANet: A Comprehensive Framework for Cross-Batch Cell Line Identification Using Brightfield Images
Lei Tong
A. Corrigan
Navin Rathna Kumar
Kerry Hallbrook
Jonathan Orme
Yinhai Wang
Huiyu Zhou
11
0
0
28 Jun 2023
Efficient Online Processing with Deep Neural Networks
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
23
0
0
23 Jun 2023
How can objects help action recognition?
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
35
14
0
20 Jun 2023
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
  of Videos
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
16
5
0
07 Jun 2023
Teacher Agent: A Knowledge Distillation-Free Framework for
  Rehearsal-based Video Incremental Learning
Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning
Shengqin Jiang
Yao-Huei Fang
Haokui Zhang
Qingshan Liu
Yuankai Qi
Yang Yang
Peifeng Wang
CLL
25
0
0
01 Jun 2023
Just a Glimpse: Rethinking Temporal Information for Video Continual
  Learning
Just a Glimpse: Rethinking Temporal Information for Video Continual Learning
Lama Alssum
Juan Carlos León Alcázar
Merey Ramazanova
Chen Zhao
Guohao Li
CLL
24
6
0
28 May 2023
Motion-Based Sign Language Video Summarization using Curvature and
  Torsion
Motion-Based Sign Language Video Summarization using Curvature and Torsion
Evangelos Sartinas
E. Psarakis
D. Kosmopoulos
SLR
25
1
0
26 May 2023
Learning Emotion Representations from Verbal and Nonverbal Communication
Learning Emotion Representations from Verbal and Nonverbal Communication
Sitao Zhang
Yimu Pan
J. Z. Wang
VLM
69
21
0
22 May 2023
Self-Supervised 3D Action Representation Learning with Skeleton Cloud
  Colorization
Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization
Siyuan Yang
Jun Liu
Shijian Lu
Er Meng Hwa
Yongjian Hu
Alex C. Kot
3DPC
3DH
30
16
0
18 Apr 2023
Multimodal Representation Learning of Cardiovascular Magnetic Resonance
  Imaging
Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging
Jielin Qiu
Peide Huang
Makiya Nakashima
Jae-Hyeok Lee
Jiacheng Zhu
...
Byung-Hak Kim
Debbie Kwon
Douglas Weber
Ding Zhao
David Chen
SSL
21
5
0
16 Apr 2023
VARS: Video Assistant Referee System for Automated Soccer Decision
  Making from Multiple Views
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
Jan Held
A. Cioppa
Silvio Giancola
Abdullah Hamdi
Guohao Li
Marc Van Droogenbroeck
27
29
0
10 Apr 2023
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao
Zhan Tong
Limin Wang
Mike Zheng Shou
33
9
0
07 Apr 2023
Bodily expressed emotion understanding through integrating Laban
  movement analysis
Bodily expressed emotion understanding through integrating Laban movement analysis
Chenyan Wu
Dolzodmaa Davaasuren
T. Shafir
Rachelle Tsachor
James Z. Wang
32
6
0
05 Apr 2023
Weakly Supervised Video Representation Learning with Unaligned Text for
  Sequential Videos
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViT
AI4TS
23
11
0
22 Mar 2023
Augmenting and Aligning Snippets for Few-Shot Video Domain Adaptation
Augmenting and Aligning Snippets for Few-Shot Video Domain Adaptation
Yuecong Xu
Jianfei Yang
Yunjiao Zhou
Zhenghua Chen
Min-man Wu
Xiaoli Li
32
5
0
18 Mar 2023
Previous
123456
Next