ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.04730
  4. Cited By
X3D: Expanding Architectures for Efficient Video Recognition

X3D: Expanding Architectures for Efficient Video Recognition

9 April 2020
Christoph Feichtenhofer
ArXivPDFHTML

Papers citing "X3D: Expanding Architectures for Efficient Video Recognition"

50 / 526 papers shown
Title
EZSR: Event-based Zero-Shot Recognition
EZSR: Event-based Zero-Shot Recognition
Yan Yang
Sehwan Kim
Dongxu Li
Y. Sun
20
0
0
31 Jul 2024
PEAR: Phrase-Based Hand-Object Interaction Anticipation
PEAR: Phrase-Based Hand-Object Interaction Anticipation
Zichen Zhang
Hongcheng Luo
Wei Zhai
N. A. Ushakov
Yu Kang
28
5
0
31 Jul 2024
Classification Matters: Improving Video Action Detection with
  Class-Specific Attention
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
29
0
0
29 Jul 2024
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Habib Hajimolahoseini
Walid Ahmed
Austin Wen
Yang Liu
14
0
0
23 Jul 2024
Human-Centric Transformer for Domain Adaptive Action Recognition
Human-Centric Transformer for Domain Adaptive Action Recognition
Kun-Yu Lin
Jiaming Zhou
Wei-Shi Zheng
18
6
0
15 Jul 2024
C2C: Component-to-Composition Learning for Zero-Shot Compositional
  Action Recognition
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition
Rongchang Li
Zhenhua Feng
Tianyang Xu
Linze Li
Xiao-Jun Wu
Muhammad Awais
Sara Atito
Josef Kittler
CoGe
37
5
0
08 Jul 2024
DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action
  Recognition
DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition
Qi Wang
Zhou Xu
Yuming Lin
Jingtao Ye
Hongsheng Li
Guangming Zhu
Syed Afaq Ali Shah
Mohammed Bennamoun
Liang Zhang
AI4TS
25
5
0
06 Jul 2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for
  Efficient Video Recognition
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Y. Hao
Diansong Zhou
Zhicai Wang
Chong-Wah Ngo
Meng Wang
ViT
19
1
0
03 Jul 2024
Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion
  Approach for Event Stream Recognition
Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition
Lan Chen
Dong Li
Xiao Wang
Pengpeng Shao
Wei Zhang
Yaowei Wang
Yonghong Tian
Jin Tang
57
2
0
27 Jun 2024
EgoVideo: Exploring Egocentric Foundation Model and Downstream
  Adaptation
EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
Baoqi Pei
Guo Chen
Jilan Xu
Yuping He
Yicheng Liu
...
Yifei Huang
Yali Wang
Tong Lu
Limin Wang
Yu Qiao
EgoV
16
10
0
26 Jun 2024
Expressive Keypoints for Skeleton-based Action Recognition via Skeleton
  Transformation
Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Yijie Yang
Jinlu Zhang
Jiaxu Zhang
Zhigang Tu
21
4
0
26 Jun 2024
ViLCo-Bench: VIdeo Language COntinual learning Benchmark
ViLCo-Bench: VIdeo Language COntinual learning Benchmark
Tianqi Tang
Shohreh Deldari
Hao Xue
Celso De Melo
Flora D. Salim
CLL
19
2
0
19 Jun 2024
Recognition of Dynamic Hand Gestures in Long Distance using a Web-Camera
  for Robot Guidance
Recognition of Dynamic Hand Gestures in Long Distance using a Web-Camera for Robot Guidance
Eran Bamani Beeri
Eden Nissinman
A. Sintov
16
0
0
18 Jun 2024
FCA-RAC: First Cycle Annotated Repetitive Action Counting
FCA-RAC: First Cycle Annotated Repetitive Action Counting
Jiada Lu
Weiwei Zhou
Xiang Qian
Dongze Lian
Yanyu Xu
Weifeng Wang
Lina Cao
Shenghua Gao
16
0
0
18 Jun 2024
Skim then Focus: Integrating Contextual and Fine-grained Views for
  Repetitive Action Counting
Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting
Zhengqi Zhao
Xiaohu Huang
Hao Zhou
Kun Yao
Errui Ding
Jingdong Wang
Xinggang Wang
Wenyu Liu
Bin Feng
18
0
0
13 Jun 2024
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow
  Understanding
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Ming Hu
Peng Xia
Lin Wang
Siyuan Yan
Feilong Tang
...
Xuelian Cheng
Jun Cheng
Chi Liu
Kaijing Zhou
Zongyuan Ge
27
1
0
11 Jun 2024
Video-based Exercise Classification and Activated Muscle Group
  Prediction with Hybrid X3D-SlowFast Network
Video-based Exercise Classification and Activated Muscle Group Prediction with Hybrid X3D-SlowFast Network
Manvik Pasula
Pramit Saha
18
0
0
10 Jun 2024
An Effective-Efficient Approach for Dense Multi-Label Action Detection
An Effective-Efficient Approach for Dense Multi-Label Action Detection
Faegheh Sardari
Armin Mustafa
Philip J. B. Jackson
Adrian Hilton
22
0
0
10 Jun 2024
AFF-ttention! Affordances and Attention models for Short-Term Object
  Interaction Anticipation
AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation
Lorenzo Mur-Labadia
Ruben Martinez-Cantin
Jose J. Guerrero
G. Farinella
Antonino Furnari
19
4
0
03 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a
  Hybrid Model
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedIm
ViT
17
7
0
02 Jun 2024
From Forest to Zoo: Great Ape Behavior Recognition with ChimpBehave
From Forest to Zoo: Great Ape Behavior Recognition with ChimpBehave
Michael Fuchs
Emilie Genty
Adrian Bangerter
Klaus Zuberbühler
Paul Cotofrei
14
2
0
30 May 2024
PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse
  PreTrained Models from the Wild
PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
Kun Yuan
Hongbo Liu
Mading Li
Muyi Sun
Ming-hui Sun
Jiachao Gong
Jinhua Hao
Chao Zhou
Yansong Tang
ViT
36
5
0
28 May 2024
BaboonLand Dataset: Tracking Primates in the Wild and Automating
  Behaviour Recognition from Drone Videos
BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos
Isla Duporge
Maksim Kholiavchenko
Roi Harel
Scott Wolf
Daniel Rubenstein
...
Stephen Lee
Julie Barreau
Jenna Kline
Michelle Ramirez
Charles V. Stewart
14
7
0
27 May 2024
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to
  Biological Motion Perception
Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception
Shuangpeng Han
Ziyu Wang
Mengmi Zhang
24
0
0
26 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications
Generative Artificial Intelligence: A Systematic Review and Applications
S. S. Sengar
Affan Bin Hasan
Sanjay Kumar
Fiona Carroll
MedIm
23
46
0
17 May 2024
No Time to Waste: Squeeze Time into Channel for Mobile Video
  Understanding
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding
Yingjie Zhai
Wenshuo Li
Yehui Tang
Xinghao Chen
Yunhe Wang
ViT
17
0
0
14 May 2024
A Semantic and Motion-Aware Spatiotemporal Transformer Network for
  Action Detection
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
Matthew Korban
Peter Youngs
Scott T. Acton
ViT
14
5
0
13 May 2024
A Survey on Backbones for Deep Video Action Recognition
A Survey on Backbones for Deep Video Action Recognition
Zixuan Tang
Youjun Zhao
Yuhang Wen
Mengyuan Liu
17
0
0
09 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
76
35
0
06 May 2024
TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for
  Dynamic UAV-based Scenes
TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes
Christopher Maxey
Jaehoon Choi
Yonghan Lee
Hyungtae Lee
Dinesh Manocha
Heesung Kwon
19
1
0
04 May 2024
Multi-view Action Recognition via Directed Gromov-Wasserstein
  Discrepancy
Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy
Hoang-Quan Nguyen
Thanh-Dat Truong
Khoa Luu
27
1
0
02 May 2024
SFMViT: SlowFast Meet ViT in Chaotic World
SFMViT: SlowFast Meet ViT in Chaotic World
Jiaying Lin
Jiajun Wen
Mengyuan Liu
Jinfu Liu
Baiqiao Yin
Yue Li
ViT
25
1
0
25 Apr 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for
  Long Sequence Modelling: Methods, Applications, and Challenges
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
19
37
0
24 Apr 2024
DeepLocalization: Using change point detection for Temporal Action
  Localization
DeepLocalization: Using change point detection for Temporal Action Localization
Mohammed Shaiqur Rahman
Ibne Farabi Shihab
Lynna Chu
Anuj Sharma
32
1
0
18 Apr 2024
Simultaneous Detection and Interaction Reasoning for Object-Centric
  Action Recognition
Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition
Xunsong Li
Pengzhan Sun
Yangcen Liu
Lixin Duan
Wen Li
25
1
0
18 Apr 2024
Application of Deep Learning Methods to Processing of Noisy Medical
  Video Data
Application of Deep Learning Methods to Processing of Noisy Medical Video Data
Danil Afonchikov
E. Kornaeva
Irina Makovik
Alexey Kornaev
18
0
0
16 Apr 2024
The 8th AI City Challenge
The 8th AI City Challenge
Shuo Wang
D. Anastasiu
Zhenghang Tang
Ming-Ching Chang
Yue Yao
...
Xunlei Wu
S. Pusegaonkar
Yizhou Wang
Sujit Biswas
Rama Chellappa
20
31
0
15 Apr 2024
Multimodal Attack Detection for Action Recognition Models
Multimodal Attack Detection for Action Recognition Models
Furkan Mumcu
Yasin Yılmaz
AAML
20
1
0
13 Apr 2024
An Animation-based Augmentation Approach for Action Recognition from
  Discontinuous Video
An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video
Xingyu Song
Zhan Li
Shi Chen
Xin-Qiang Cai
K. Demachi
21
2
0
10 Apr 2024
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng
Yujie Zhong
Chengjian Feng
Lin Ma
50
7
0
07 Apr 2024
Koala: Key frame-conditioned long video-LLM
Koala: Key frame-conditioned long video-LLM
Reuben Tan
Ximeng Sun
Ping Hu
Jui-hsien Wang
Hanieh Deilamsalehy
Bryan A. Plummer
Bryan C. Russell
Kate Saenko
25
35
0
05 Apr 2024
Learning Correlation Structures for Vision Transformers
Learning Correlation Structures for Vision Transformers
Manjin Kim
Paul Hongsuck Seo
Cordelia Schmid
Minsu Cho
ViT
16
7
0
05 Apr 2024
A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection
A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection
Chih-Chung Hsu
Chia-Ming Lee
Chiang Fan Yang
Yi-Shiuan Chou
Chih-Yu Jiang
Shen-Chieh Tai
Chin-Han Tsai
20
0
0
02 Apr 2024
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
Petros Toupas
Zhewen Yu
C. Bouganis
Dimitrios Tzovaras
25
0
0
27 Mar 2024
OmniVid: A Generative Framework for Universal Video Understanding
OmniVid: A Generative Framework for Universal Video Understanding
Junke Wang
Dongdong Chen
Chong Luo
Bo He
Lu Yuan
Zuxuan Wu
Yu-Gang Jiang
VLM
VGen
63
14
0
26 Mar 2024
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for
  Faster Inference
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference
Tanvir Mahmud
Burhaneddin Yaman
Chun-Hao Liu
Diana Marculescu
23
2
0
24 Mar 2024
Selective, Interpretable, and Motion Consistent Privacy Attribute
  Obfuscation for Action Recognition
Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition
Filip Ilic
Henghui Zhao
T. Pock
Richard P. Wildes
PICV
AAML
23
2
0
19 Mar 2024
ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation
  for Event-based Action Recognition and More
ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More
Jiazhou Zhou
Xueye Zheng
Yuanhuiyi Lyu
Lin Wang
64
17
0
19 Mar 2024
IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video
  Action Counting
IVAC-P2L: Leveraging Irregular Repetition Priors for Improving Video Action Counting
Hang Wang
Zhi-Qi Cheng
Youtian Du
Lei Zhang
11
1
0
18 Mar 2024
On the Utility of 3D Hand Poses for Action Recognition
On the Utility of 3D Hand Poses for Action Recognition
Md Salman Shamil
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
27
5
0
14 Mar 2024
Previous
12345...91011
Next