ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.01398
  4. Cited By
TEA: Temporal Excitation and Aggregation for Action Recognition

TEA: Temporal Excitation and Aggregation for Action Recognition

Computer Vision and Pattern Recognition (CVPR), 2020
3 April 2020
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
    ViT
ArXiv (abs)PDFHTML

Papers citing "TEA: Temporal Excitation and Aggregation for Action Recognition"

50 / 162 papers shown
Unleashing Temporal Capacity of Spiking Neural Networks through Spatiotemporal Separation
Unleashing Temporal Capacity of Spiking Neural Networks through Spatiotemporal Separation
Yiting Dong
Zhaofei Yu
Jianhao Ding
Zijie Xu
T. Huang
81
0
0
05 Dec 2025
GA2-CLIP: Generic Attribute Anchor for Efficient Prompt Tuningin Video-Language Models
GA2-CLIP: Generic Attribute Anchor for Efficient Prompt Tuningin Video-Language Models
Bin Wang
Ruotong Hu
Wenqian Wang
W. Li
Mingliang Gao
Runmin Cong
Wei Zhang
VLM
166
0
0
27 Nov 2025
Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition
Towards an Effective Action-Region Tracking Framework for Fine-grained Video Action Recognition
Baoli Sun
Y. X. R. Wang
Xinzhu Ma
Zhihui Wang
Kun Lu
Zhiyong Wang
264
0
0
26 Nov 2025
Smooth regularization for efficient video recognition
Smooth regularization for efficient video recognition
Gil Goldman
Raja Giryes
Mahadev Satyanarayanan
AI4TS
296
0
0
25 Nov 2025
A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
A Renaissance of Explicit Motion Information Mining from Transformers for Action Recognition
Peiqin Zhuang
Wenlong Zhang
Yichao Wu
Ding Liang
Luping Zhou
Yali Wang
Wanli Ouyang
261
1
0
21 Oct 2025
Watch Where You Move: Region-aware Dynamic Aggregation and Excitation for Gait Recognition
Watch Where You Move: Region-aware Dynamic Aggregation and Excitation for Gait RecognitionIEEE transactions on multimedia (TMM), 2025
Binyuan Huang
Yongdong Luo
Xianda Guo
Xiawu Zheng
Zheng Hua Zhu
Jiahui Pan
Chengju Zhou
185
1
0
18 Oct 2025
EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow Estimation
EDCFlow: Exploring Temporally Dense Difference Maps for Event-based Optical Flow EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Daikun Liu
Lei Cheng
Teng Wang
Changyin Sun
264
4
0
04 Jun 2025
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
Tianyuan Qu
Longxiang Tang
Bohao Peng
Senqiao Yang
Bei Yu
Jiaya Jia
VLM
1.1K
15
0
16 Mar 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
405
0
0
11 Feb 2025
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
343
16
0
15 Dec 2024
Making Every Frame Matter: Continuous Activity Recognition in Streaming Video via Adaptive Video Context Modeling
Making Every Frame Matter: Continuous Activity Recognition in Streaming Video via Adaptive Video Context Modeling
Hao Wu
Donglin Bai
Shiqi Jiang
Qianxi Zhang
Yue Yang
Ting Cao
Fengyuan Xu
Yunxin Liu
Fengyuan Xu
667
0
0
19 Oct 2024
TDS-CLIP: Temporal Difference Side Network for Efficient VideoAction Recognition
TDS-CLIP: Temporal Difference Side Network for Efficient VideoAction Recognition
Bin Wang
W. Li
Wenqian Wang
Mingliang Gao
Runmin Cong
Wei Emma Zhang
VLM
218
1
0
20 Aug 2024
Dynamic and Compressive Adaptation of Transformers From Images to Videos
Dynamic and Compressive Adaptation of Transformers From Images to Videos
Guozhen Zhang
Jingyu Liu
Shengming Cao
Xiaotong Zhao
Kevin Zhao
Kai Ma
Limin Wang
ViT
520
2
0
13 Aug 2024
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Habib Hajimolahoseini
Walid Ahmed
Austin Wen
Yang Liu
309
0
0
23 Jul 2024
C2C: Component-to-Composition Learning for Zero-Shot Compositional
  Action Recognition
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition
Rongchang Li
Zhenhua Feng
Tianyang Xu
Linze Li
Xiao-Jun Wu
Muhammad Awais
Sara Atito
Josef Kittler
CoGe
474
14
0
08 Jul 2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for
  Efficient Video Recognition
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Y. Hao
Diansong Zhou
Zhicai Wang
Chong-Wah Ngo
Meng Wang
ViT
304
14
0
03 Jul 2024
No Time to Waste: Squeeze Time into Channel for Mobile Video
  Understanding
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Yingjie Zhai
Wenshuo Li
Yehui Tang
Xinghao Chen
Yunhe Wang
ViT
287
2
0
14 May 2024
Learning Correlation Structures for Vision Transformers
Learning Correlation Structures for Vision Transformers
Manjin Kim
Paul Hongsuck Seo
Cordelia Schmid
Minsu Cho
ViT
368
30
0
05 Apr 2024
Don't Judge by the Look: Towards Motion Coherent Video Representation
Don't Judge by the Look: Towards Motion Coherent Video RepresentationInternational Conference on Learning Representations (ICLR), 2024
Yitian Zhang
Yue Bai
Huan Wang
Yizhou Wang
Yun Fu
320
3
0
14 Mar 2024
M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action
  Recognition
M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2024
Mengmeng Wang
Jiazheng Xing
Boyuan Jiang
Jun Chen
Jianbiao Mei
Xingxing Zuo
Guang Dai
Jingdong Wang
Yong-Jin Liu
VLM
249
11
0
22 Jan 2024
F4D: Factorized 4D Convolutional Neural Network for Efficient
  Video-level Representation Learning
F4D: Factorized 4D Convolutional Neural Network for Efficient Video-level Representation LearningInternational Conference on Agents and Artificial Intelligence (ICAART), 2023
Mohammad Al-Saad
Lakshmish Ramaswamy
S. Bhandarkar
AI4TS
193
4
0
28 Nov 2023
Semantic-aware Temporal Channel-wise Attention for Cardiac Function
  Assessment
Semantic-aware Temporal Channel-wise Attention for Cardiac Function AssessmentIEEE International Symposium on Biomedical Imaging (ISBI), 2022
Guanqi Chen
Guanbin Li
110
0
0
09 Oct 2023
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to
  Video
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to VideoEuropean Conference on Computer Vision (ECCV), 2023
Xinhao Li
Yuhan Zhu
Limin Wang
VLM
356
20
0
02 Oct 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer LearningIEEE International Conference on Computer Vision (ICCV), 2023
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
266
35
0
14 Sep 2023
TransNet: A Transfer Learning-Based Network for Human Action Recognition
TransNet: A Transfer Learning-Based Network for Human Action RecognitionInternational Conference on Machine Learning and Applications (ICMLA), 2023
Khaled Alomar
Xiaohao Cai
324
1
0
13 Sep 2023
IndGIC: Supervised Action Recognition under Low Illumination
IndGIC: Supervised Action Recognition under Low Illumination
Jing-Teng Zeng
217
4
0
29 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
359
38
0
27 Aug 2023
Improving Video Violence Recognition with Human Interaction Learning on
  3D Skeleton Point Clouds
Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds
Qingxin Xiao
Guosheng Lin
Qingyao Wu
3DH3DPC
251
6
0
26 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked AutoencodingIEEE International Conference on Computer Vision (ICCV), 2023
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
265
56
0
21 Aug 2023
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Orthogonal Temporal Interpolation for Zero-Shot Video RecognitionACM Multimedia (ACM MM), 2023
Yan Zhu
Junbao Zhuo
B. Ma
Jiajia Geng
Xiaoming Wei
Xiaolin K. Wei
Shuhui Wang
VLM
215
8
0
14 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
238
18
0
10 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion
  Prompts Learning
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts LearningACM Multimedia (ACM MM), 2023
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
231
33
0
09 Aug 2023
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Prune Spatio-temporal Tokens by Semantic-aware Temporal AccumulationIEEE International Conference on Computer Vision (ICCV), 2023
Shuangrui Ding
Peisen Zhao
Xiaopeng Zhang
Rui Qian
H. Xiong
Qi Tian
ViT
246
28
0
08 Aug 2023
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings
  for Video Action Recognition
ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action RecognitionIndian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023
S. Chaudhuri
Saumik Bhattacharya
212
8
0
07 Aug 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature
  Restoration
Sample Less, Learn More: Efficient Action Recognition via Frame Feature RestorationACM Multimedia (ACM MM), 2023
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
274
9
0
27 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
259
17
0
18 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action
  Recognition
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2023
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
339
32
0
13 Jul 2023
Deep Neural Networks in Video Human Action Recognition: A Review
Deep Neural Networks in Video Human Action Recognition: A Review
Zihan Wang
Yang Yang
Zhi Liu
Y. Zheng
322
9
0
25 May 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual MaskingComputer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
496
623
0
29 Mar 2023
Frame Flexible Network
Frame Flexible NetworkComputer Vision and Pattern Recognition (CVPR), 2023
Yitian Zhang
Yue Bai
Chang Liu
Huan Wang
Sheng Li
Yun Fu
248
5
0
26 Mar 2023
Multi-view knowledge distillation transformer for human action
  recognition
Multi-view knowledge distillation transformer for human action recognition
Yi Lin
Vincent S. Tseng
ViT
281
2
0
25 Mar 2023
Mutual Information-Based Temporal Difference Learning for Human Pose
  Estimation in Video
Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in VideoComputer Vision and Pattern Recognition (CVPR), 2023
Runyang Feng
Yixing Gao
Xueqi Ma
Tze Ho Elden Tse
H. Chang
3DH
449
33
0
15 Mar 2023
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video
  Recognition
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video RecognitionInternational Conference on Learning Representations (ICLR), 2023
Junyan Wang
Zhenhong Sun
Yichen Qian
Dong Gong
Xiuyu Sun
Ming Lin
Maurice Pagnucco
Yang Song
3DPC
236
14
0
05 Mar 2023
Improving Zero-Shot Action Recognition using Human Instruction with Text
  Description
Improving Zero-Shot Action Recognition using Human Instruction with Text Description
Na Wu
Hiroshi Kera
K. Kawamoto
267
12
0
21 Jan 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition
  with Pre-trained Vision-Language Models
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
438
83
0
31 Dec 2022
An end-to-end multi-scale network for action prediction in videos
An end-to-end multi-scale network for action prediction in videos
Xiaofan Liu
Jianqin Yin
Yuanxi Sun
Zhicheng Zhang
Jin Tang
226
1
0
31 Dec 2022
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language
  Recognition
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition
Xi Shen
Zhedong Zheng
Yi Yang
SLR
422
28
0
25 Dec 2022
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera
  Based Activity Recognition
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity RecognitionNeural Networks (NN), 2022
Santosh Kumar Yadav
Achleshwar Luthra
Esha Pahwa
K. Tiwari
Heena Rathore
Hari Mohan Pandey
Peter Corcoran
249
21
0
07 Dec 2022
VLG: General Video Recognition with Web Textual Knowledge
VLG: General Video Recognition with Web Textual KnowledgeInternational Journal of Computer Vision (IJCV), 2022
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
380
4
0
03 Dec 2022
Video Test-Time Adaptation for Action Recognition
Video Test-Time Adaptation for Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Wei Lin
M. Jehanzeb Mirza
Mateusz Koziñski
Horst Possegger
Hilde Kuehne
Horst Bischof
TTA
326
50
0
24 Nov 2022
1234
Next
Page 1 of 4