ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.02953
  4. Cited By
Temporal Segment Networks for Action Recognition in Videos

Temporal Segment Networks for Action Recognition in Videos

8 May 2017
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
    ViT
ArXivPDFHTML

Papers citing "Temporal Segment Networks for Action Recognition in Videos"

50 / 298 papers shown
Title
Learning Grounded Vision-Language Representation for Versatile
  Understanding in Untrimmed Videos
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Teng Wang
Jinrui Zhang
Feng Zheng
Wenhao Jiang
Ran Cheng
Ping Luo
VLM
33
11
0
11 Mar 2023
Event Voxel Set Transformer for Spatiotemporal Representation Learning
  on Event Streams
Event Voxel Set Transformer for Spatiotemporal Representation Learning on Event Streams
Bochen Xie
Yongjian Deng
Z. Shao
Hai Liu
Qingsong Xu
Youfu Li
42
4
0
07 Mar 2023
Dynamic Storyboard Generation in an Engine-based Virtual Environment for
  Video Production
Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production
Anyi Rao
Xuekun Jiang
Yuwei Guo
Linning Xu
Lei Yang
Libiao Jin
Dahua Lin
Bo Dai
VGen
28
15
0
30 Jan 2023
Revisiting the Spatial and Temporal Modeling for Few-shot Action
  Recognition
Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition
Jiazheng Xing
Mengmeng Wang
Yong-Jin Liu
B. Mu
ViT
21
33
0
19 Jan 2023
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot
  Action Recognition
HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Zhe Zuo
Changxin Gao
Rong Jin
Nong Sang
25
23
0
09 Jan 2023
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
Ming Li
Xiangyu Xu
Hehe Fan
Pan Zhou
Jun Liu
Jia-Wei Liu
Jiahe Li
Jussi Keppo
Mike Zheng Shou
Shuicheng Yan
ViT
PICV
45
13
0
08 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
72
23
0
03 Jan 2023
STEPs: Self-Supervised Key Step Extraction and Localization from
  Unlabeled Procedural Videos
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
Anshul B. Shah
Benjamin Lundell
H. Sawhney
Ramalingam Chellappa
SSL
16
8
0
02 Jan 2023
Policy Adaptation from Foundation Model Feedback
Policy Adaptation from Foundation Model Feedback
Yuying Ge
Annabella Macaluso
Erran L. Li
Ping Luo
Xiaolong Wang
LM&Ro
27
12
0
14 Dec 2022
PIVOT: Prompting for Video Continual Learning
PIVOT: Prompting for Video Continual Learning
Andrés Villa
Juan Carlos León Alcázar
Motasem Alfarra
Kumail Alhamoud
J. Hurtado
Fabian Caba Heilbron
Alvaro Soto
Guohao Li
VLM
CLL
40
45
0
09 Dec 2022
Masked Video Distillation: Rethinking Masked Feature Modeling for
  Self-supervised Video Representation Learning
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Lu Yuan
Yu-Gang Jiang
VGen
32
87
0
08 Dec 2022
Multimodal Vision Transformers with Forced Attention for Behavior
  Analysis
Multimodal Vision Transformers with Forced Attention for Behavior Analysis
Tanay Agrawal
Michal Balazia
Philippe Muller
Franccois Brémond
ViT
23
9
0
07 Dec 2022
Self-supervised and Weakly Supervised Contrastive Learning for Frame-wise Action Representations
Minghao Chen
Renbo Tu
Chenxi Huang
Yuqi Lin
Boxi Wu
Deng Cai
SSL
AI4TS
26
1
0
06 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
Spatio-Temporal Crop Aggregation for Video Representation Learning
Sepehr Sameni
Simon Jenni
Paolo Favaro
24
3
0
30 Nov 2022
Mitigating and Evaluating Static Bias of Action Representations in the
  Background and the Foreground
Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground
Haoxin Li
Yuan Liu
Hanwang Zhang
Boyang Li
30
15
0
23 Nov 2022
Knowledge Prompting for Few-shot Action Recognition
Knowledge Prompting for Few-shot Action Recognition
Yuheng Shi
Xinxiao Wu
Hanxi Lin
VLM
19
4
0
22 Nov 2022
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant
  Spatiotemporal Tokens
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
Sung Ju Hwang
31
6
0
19 Nov 2022
Exploring Video Quality Assessment on User Generated Contents from
  Aesthetic and Technical Perspectives
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives
Haoning Wu
Erli Zhang
Liang Liao
Chaofeng Chen
Jingwen Hou
Annan Wang
Wenxiu Sun
Qiong Yan
Weisi Lin
23
144
0
09 Nov 2022
Bringing Online Egocentric Action Recognition into the wild
Bringing Online Egocentric Action Recognition into the wild
Gabriele Goletto
M. Planamente
Barbara Caputo
Giuseppe Averta
EgoV
19
3
0
06 Nov 2022
DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection
  Network
DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network
T. K. Vijay
Yash Raghuwanshi
D. P. Dogra
Heeseung Choi
Ig-Jae Kim
11
17
0
02 Nov 2022
Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action
  Recognition
Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action Recognition
Lei Wang
Piotr Koniusz
42
27
0
30 Oct 2022
Baby Physical Safety Monitoring in Smart Home Using Action Recognition
  System
Baby Physical Safety Monitoring in Smart Home Using Action Recognition System
Victor A. Adewopo
Nelly Elsayed
Kelly Anderson
26
6
0
22 Oct 2022
Linear Video Transformer with Feature Fixation
Linear Video Transformer with Feature Fixation
Kaiyue Lu
Zexia Liu
Jianyuan Wang
Weixuan Sun
Zhen Qin
...
Xuyang Shen
Huizhong Deng
Xiaodong Han
Yuchao Dai
Yiran Zhong
30
4
0
15 Oct 2022
Masked Motion Encoding for Self-Supervised Video Representation Learning
Masked Motion Encoding for Self-Supervised Video Representation Learning
Xinyu Sun
Peihao Chen
Liang-Chieh Chen
Chan Li
Thomas H. Li
Mingkui Tan
Chuang Gan
27
29
0
12 Oct 2022
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long
  Livestream Videos
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Jielin Qiu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Ding Zhao
Hailin Jin
AI4TS
17
5
0
12 Oct 2022
Neighbourhood Representative Sampling for Efficient End-to-end Video
  Quality Assessment
Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment
Haoning Wu
Chaofeng Chen
Liang Liao
Jingwen Hou
Wenxiu Sun
Qiong Yan
Liang Feng
Weisi Lin
51
44
0
11 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised
  Video Transformer Pre-training
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
95
8
0
11 Oct 2022
An Action Is Worth Multiple Words: Handling Ambiguity in Action
  Recognition
An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition
Kiyoon Kim
Davide Moltisanti
Oisin Mac Aodha
Laura Sevilla-Lara
16
0
0
10 Oct 2022
A Closer Look at Temporal Ordering in the Segmentation of Instructional
  Videos
A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos
Anil Batra
Shreyank N. Gowda
Frank Keller
Laura Sevilla-Lara
34
5
0
30 Sep 2022
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial
  Video Classification
FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification
P. Jin
Lichao Mou
Yuansheng Hua
Gui-Song Xia
Xiao Xiang Zhu
AI4TS
24
8
0
22 Sep 2022
Recipe Generation from Unsegmented Cooking Videos
Recipe Generation from Unsegmented Cooking Videos
Taichi Nishimura
Atsushi Hashimoto
Yoshitaka Ushiku
Hirotaka Kameko
Shinsuke Mori
25
3
0
21 Sep 2022
Real-time Online Video Detection with Temporal Smoothing Transformers
Real-time Online Video Detection with Temporal Smoothing Transformers
Yue Zhao
Philipp Krahenbuhl
ViT
69
57
0
19 Sep 2022
SDFE-LV: A Large-Scale, Multi-Source, and Unconstrained Database for
  Spotting Dynamic Facial Expressions in Long Videos
SDFE-LV: A Large-Scale, Multi-Source, and Unconstrained Database for Spotting Dynamic Facial Expressions in Long Videos
Xiaolin Xu
Yuan Zong
Wenming Zheng
Yang Li
Chuangao Tang
Xingxun Jiang
Haolin Jiang
CVBM
43
1
0
18 Sep 2022
Moving from 2D to 3D: volumetric medical image classification for rectal
  cancer staging
Moving from 2D to 3D: volumetric medical image classification for rectal cancer staging
Joohyun Lee
J. Oh
Inkyu Shin
You-sung Kim
D. Sohn
Tae-Sung Kim
In So Kweon
MedIm
24
4
0
13 Sep 2022
Video Mobile-Former: Video Recognition with Efficient Global
  Spatial-temporal Modeling
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Rui Wang
Zuxuan Wu
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Luowei Zhou
Lu Yuan
Yu-Gang Jiang
ViT
40
4
0
25 Aug 2022
Efficient Attention-free Video Shift Transformers
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
29
1
0
23 Aug 2022
Progressive Cross-modal Knowledge Distillation for Human Action
  Recognition
Progressive Cross-modal Knowledge Distillation for Human Action Recognition
Jianyuan Ni
A. Ngu
Yan Yan
HAI
20
20
0
17 Aug 2022
ViT-ReT: Vision and Recurrent Transformer Neural Networks for Human
  Activity Recognition in Videos
ViT-ReT: Vision and Recurrent Transformer Neural Networks for Human Activity Recognition in Videos
James Wensel
Hayat Ullah
Arslan Munir
ViT
18
42
0
16 Aug 2022
Leveraging Endo- and Exo-Temporal Regularization for Black-box Video
  Domain Adaptation
Leveraging Endo- and Exo-Temporal Regularization for Black-box Video Domain Adaptation
Yuecong Xu
Jianfei Yang
Haozhi Cao
Min-man Wu
Xiaoli Li
Lihua Xie
Zhenghua Chen
36
4
0
10 Aug 2022
Human Activity Recognition Using Cascaded Dual Attention CNN and
  Bi-Directional GRU Framework
Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework
Hayat Ullah
Arslan Munir
HAI
21
27
0
09 Aug 2022
Blockwise Temporal-Spatial Pathway Network
Blockwise Temporal-Spatial Pathway Network
SeulGi Hong
Min-Kook Choi
19
1
0
05 Aug 2022
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action
  Recognition
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition
M. C. Leong
Haosong Zhang
Huibin Tan
Liyuan Li
J. Lim
ViT
39
8
0
03 Aug 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
32
42
0
24 Jul 2022
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
Grant Van Horn
Rui Qian
Kimberly Wilber
Hartwig Adam
Oisin Mac Aodha
Serge J. Belongie
27
10
0
21 Jul 2022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng
Zizheng Pan
Mingfei Han
Xiaojun Chang
Bohan Zhuang
ViT
19
25
0
21 Jul 2022
Unifying Event Detection and Captioning as Sequence Generation via
  Pre-Training
Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Qi Zhang
Yuqing Song
Qin Jin
30
24
0
18 Jul 2022
ReAct: Temporal Action Detection with Relational Queries
ReAct: Temporal Action Detection with Relational Queries
Ding Shi
Yujie Zhong
Qiong Cao
Jing Zhang
Lin Ma
Jia Li
Dacheng Tao
ViT
28
68
0
14 Jul 2022
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene
  Segmentation and Classification
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Ye Liu
Lingfeng Qiao
Di Yin
Zhuoxuan Jiang
Xinghua Jiang
Deqiang Jiang
Bo Ren
21
7
0
04 Jul 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
Junting Pan
Ziyi Lin
Xiatian Zhu
Jing Shao
Hongsheng Li
27
190
0
27 Jun 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation
  Learning
Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
35
6
0
21 Jun 2022
Previous
123456
Next