ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.11128
  4. Cited By
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition

Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition

18 December 2023
Xiao Wang
Yao Rong
Shiao Wang
Yuan Chen
Zhe Wu
Bowei Jiang
Yonghong Tian
Jin Tang
    ViT
ArXivPDFHTML

Papers citing "Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition"

10 / 10 papers shown
Title
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Shiao Wang
X. Wang
Bo Jiang
Lin Zhu
G. Li
Y. Wang
Yonghong Tian
Jin Tang
40
0
0
08 Apr 2025
Uncertainty-aware Bridge based Mobile-Former Network for Event-based
  Pattern Recognition
Uncertainty-aware Bridge based Mobile-Former Network for Event-based Pattern Recognition
Haoxiang Yang
Chengguo Yuan
Yabin Zhu
Langlang Chen
Xiao Wang
Jin Tang
45
0
0
20 Jan 2024
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
Xiao Wang
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
64
14
0
08 Aug 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
1,899
0
30 Jan 2023
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
255
5,353
0
11 Nov 2021
The Right to Talk: An Audio-Visual Transformer Approach
The Right to Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong
C. Duong
T. D. Vu
H. Pham
Bhiksha Raj
Ngan Le
Khoa Luu
35
36
0
06 Aug 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
270
1,939
0
09 Feb 2021
Video Transformer Network
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
188
375
0
01 Feb 2021
Trear: Transformer-based RGB-D Egocentric Action Recognition
Trear: Transformer-based RGB-D Egocentric Action Recognition
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
166
88
0
05 Jan 2021
Temporal Binary Representation for Event-Based Action Recognition
Temporal Binary Representation for Event-Based Action Recognition
Simone Undri Innocenti
Federico Becattini
F. Pernici
A. Bimbo
34
54
0
18 Oct 2020
1