ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.04730
  4. Cited By
X3D: Expanding Architectures for Efficient Video Recognition

X3D: Expanding Architectures for Efficient Video Recognition

9 April 2020
Christoph Feichtenhofer
ArXivPDFHTML

Papers citing "X3D: Expanding Architectures for Efficient Video Recognition"

50 / 526 papers shown
Title
Deep Learning-Based Real-Time Rate Control for Live Streaming on
  Wireless Networks
Deep Learning-Based Real-Time Rate Control for Live Streaming on Wireless Networks
Matin Mortaheb
M. A. Khojastepour
S. Chakradhar
S. Ulukus
8
0
0
27 Sep 2023
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object
  Interactions in Industrial Scenarios
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios
Francesco Ragusa
Rosario Leonardi
Michele Mazzamuto
Claudia Bonanno
Rosario Scavo
Antonino Furnari
G. Farinella
14
7
0
26 Sep 2023
Egocentric RGB+Depth Action Recognition in Industry-Like Settings
Egocentric RGB+Depth Action Recognition in Industry-Like Settings
Jyoti Kini
Sarah Fleischer
I. Dave
Mubarak Shah
EgoV
16
2
0
25 Sep 2023
S3TC: Spiking Separated Spatial and Temporal Convolutions with
  Unsupervised STDP-based Learning for Action Recognition
S3TC: Spiking Separated Spatial and Temporal Convolutions with Unsupervised STDP-based Learning for Action Recognition
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
11
2
0
22 Sep 2023
Selective Volume Mixup for Video Action Recognition
Selective Volume Mixup for Video Action Recognition
Yi Tan
Zhaofan Qiu
Y. Hao
Ting Yao
Xiangnan He
Tao Mei
ViT
20
2
0
18 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
11
18
0
14 Sep 2023
ATM: Action Temporality Modeling for Video Question Answering
ATM: Action Temporality Modeling for Video Question Answering
Junwen Chen
Jie Zhu
Yu Kong
9
1
0
05 Sep 2023
AAN: Attributes-Aware Network for Temporal Action Detection
AAN: Attributes-Aware Network for Temporal Action Detection
Rui Dai
Srijan Das
Michael S. Ryoo
François Brémond
19
4
0
01 Sep 2023
Deep Video Codec Control for Vision Models
Deep Video Codec Control for Vision Models
Christoph Reich
Biplob K. Debnath
Deep Patel
Tim Prangemeier
Daniel Cremers
S. Chakradhar
8
1
0
30 Aug 2023
IndGIC: Supervised Action Recognition under Low Illumination
IndGIC: Supervised Action Recognition under Low Illumination
Jing-Teng Zeng
19
1
0
29 Aug 2023
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior
  Graph Reasoning
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
Jian Zhao
13
3
0
29 Aug 2023
CEFHRI: A Communication Efficient Federated Learning Framework for
  Recognizing Industrial Human-Robot Interaction
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot Interaction
Umar Khalid
Hasan Iqbal
Saeed Vahidian
Jing Hua
C. L. P. Chen
14
2
0
29 Aug 2023
LAC: Latent Action Composition for Skeleton-based Action Segmentation
LAC: Latent Action Composition for Skeleton-based Action Segmentation
Di Yang
Yaohui Wang
A. Dantcheva
Quan Kong
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
22
9
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
6
20
0
27 Aug 2023
EventTransAct: A video transformer-based framework for Event-camera
  based action recognition
EventTransAct: A video transformer-based framework for Event-camera based action recognition
Tristan de Blegiers
I. Dave
Adeel Yousaf
M. Shah
ViT
18
9
0
25 Aug 2023
An All Deep System for Badminton Game Analysis
An All Deep System for Badminton Game Analysis
Po-Yung Chou
Yu-Chun Lo
Bo Xie
Chu-Hsing Lin
Yu-Yung Kao
6
0
0
24 Aug 2023
Towards Privacy-Supporting Fall Detection via Deep Unsupervised
  RGB2Depth Adaptation
Towards Privacy-Supporting Fall Detection via Deep Unsupervised RGB2Depth Adaptation
Hejun Xiao
Kunyu Peng
Xiangsheng Huang
Alina Roitberg
Hao Li
Zhao Wang
Rainer Stiefelhagen
6
3
0
23 Aug 2023
Joint learning of images and videos with a single Vision Transformer
Joint learning of images and videos with a single Vision Transformer
Shuki Shimizu
Toru Tamaki
ViT
11
0
0
21 Aug 2023
Action Class Relation Detection and Classification Across Multiple Video
  Datasets
Action Class Relation Detection and Classification Across Multiple Video Datasets
Yuya Yoshikawa
Yutaro Shigeto
Masashi Shimbo
A. Takeuchi
11
0
0
15 Aug 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
17
5
0
10 Aug 2023
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
Faegheh Sardari
A. Mustafa
Philip J. B. Jackson
A. Hilton
ViT
11
6
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
9
7
0
09 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion
  Prompts Learning
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
16
17
0
09 Aug 2023
Objects do not disappear: Video object detection by single-frame object
  location anticipation
Objects do not disappear: Video object detection by single-frame object location anticipation
X. Liu
F. Karimi Nejadasl
J. C. V. Gemert
O. Booij
S. Pintea
11
5
0
09 Aug 2023
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
Shuangrui Ding
Peisen Zhao
Xiaopeng Zhang
Rui Qian
H. Xiong
Qi Tian
ViT
14
16
0
08 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
Xiao Wang
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
64
14
0
08 Aug 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
16
4
0
03 Aug 2023
An X3D Neural Network Analysis for Runner's Performance Assessment in a
  Wild Sporting Environment
An X3D Neural Network Analysis for Runner's Performance Assessment in a Wild Sporting Environment
David Freire-Obregón
J. Lorenzo-Navarro
Oliverio J. Santana
Daniel Hernández-Sosa
Modesto Castrillón-Santana
11
1
0
22 Jul 2023
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical
  Phase Recognition
TUNeS: A Temporal U-Net with Self-Attention for Video-based Surgical Phase Recognition
Isabel Funke
Dominik Rivoir
Stefanie Krell
Stefanie Speidel
16
2
0
19 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
30
8
0
18 Jul 2023
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Mennatullah Siam
R. Karim
Henghui Zhao
Richard P. Wildes
VOS
14
2
0
15 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action
  Recognition
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
F. Khan
ViT
41
12
0
13 Jul 2023
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel
  Segmentation
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation
MD Wahiduzzaman Khan
Hong Sheng
Hu Zhang
Heming Du
Sen Wang
...
Jack Phu
A. Agar
Zichen Huang
M. Golzan
Xin Yu
11
5
0
13 Jul 2023
SwiFT: Swin 4D fMRI Transformer
SwiFT: Swin 4D fMRI Transformer
P. Y. Kim
Junbeom Kwon
Sunghwan Joo
Sang-Peel Bae
Donggyu Lee
Yoonho Jung
Shinjae Yoo
Jiook Cha
Taesup Moon
MedIm
15
20
0
12 Jul 2023
Human-to-Human Interaction Detection
Human-to-Human Interaction Detection
Zhenhua Wang
Kaining Ying
Jiajun Meng
J. Ning
22
2
0
02 Jul 2023
Streaming egocentric action anticipation: An evaluation scheme and
  approach
Streaming egocentric action anticipation: An evaluation scheme and approach
Antonino Furnari
G. Farinella
EgoV
11
3
0
29 Jun 2023
SpotEM: Efficient Video Search for Episodic Memory
SpotEM: Efficient Video Search for Episodic Memory
Santhosh Kumar Ramakrishnan
Ziad Al-Halah
Kristen Grauman
VLM
13
9
0
28 Jun 2023
Spiking Two-Stream Methods with Unsupervised STDP-based Learning for
  Action Recognition
Spiking Two-Stream Methods with Unsupervised STDP-based Learning for Action Recognition
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
7
3
0
23 Jun 2023
Efficient Online Processing with Deep Neural Networks
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
16
0
0
23 Jun 2023
Bullying10K: A Large-Scale Neuromorphic Dataset towards
  Privacy-Preserving Bullying Recognition
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition
Yiting Dong
Yang Li
Dongcheng Zhao
Guobin Shen
Yi Zeng
18
12
0
20 Jun 2023
A survey on deep learning approaches for data integration in autonomous
  driving system
A survey on deep learning approaches for data integration in autonomous driving system
Xi Zhu
Likang Wang
Caifa Zhou
Xiya Cao
Yue Gong
L. Chen
12
1
0
17 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in
  Vision Transformers
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
17
4
0
15 Jun 2023
Point-Voxel Absorbing Graph Representation Learning for Event Stream
  based Recognition
Point-Voxel Absorbing Graph Representation Learning for Event Stream based Recognition
Bowei Jiang
Chengguo Yuan
Xiao Wang
Zhimin Bao
Lin Zhu
Yonghong Tian
Bin Luo
GNN
3DPC
10
4
0
08 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video
  Using Multiple Instances Learning
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Q. Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
8
7
0
06 Jun 2023
Diversifying Joint Vision-Language Tokenization Learning
Diversifying Joint Vision-Language Tokenization Learning
Vardaan Pahuja
A. Piergiovanni
A. Angelova
11
0
0
06 Jun 2023
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order
  Learning
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning
Jianghui Wang
Yuxuan Wang
Dongyan Zhao
Zilong Zheng
35
0
0
04 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
J. C. V. Gemert
8
9
0
31 May 2023
FMM-X3D: FPGA-based modeling and mapping of X3D for Human Action
  Recognition
FMM-X3D: FPGA-based modeling and mapping of X3D for Human Action Recognition
Petros Toupas
C. Bouganis
Dimitrios Tzovaras
10
3
0
29 May 2023
Cross-view Action Recognition Understanding From Exocentric to
  Egocentric Perspective
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Thanh-Dat Truong
Khoa Luu
EgoV
19
9
0
25 May 2023
Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos
Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos
Błażej Leporowski
Arian Bakhtiarnia
Nicole Bonnici
A. Muscat
Luca Zanella
Yiming Wang
Alexandros Iosifidis
14
1
0
24 May 2023
Previous
12345...91011
Next