ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.04730
  4. Cited By
X3D: Expanding Architectures for Efficient Video Recognition

X3D: Expanding Architectures for Efficient Video Recognition

9 April 2020
Christoph Feichtenhofer
ArXivPDFHTML

Papers citing "X3D: Expanding Architectures for Efficient Video Recognition"

50 / 526 papers shown
Title
Transformers in Action Recognition: A Review on Temporal Modeling
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
19
8
0
29 Dec 2022
Joint Engagement Classification using Video Augmentation Techniques for
  Multi-person Human-robot Interaction
Joint Engagement Classification using Video Augmentation Techniques for Multi-person Human-robot Interaction
Y. Kim
Huili Chen
Sharifa Alghowinem
C. Breazeal
Hae Won Park
8
2
0
28 Dec 2022
A Survey on Human Action Recognition
A Survey on Human Action Recognition
Zhou Shuchang
16
0
0
20 Dec 2022
Policy Adaptation from Foundation Model Feedback
Policy Adaptation from Foundation Model Feedback
Yuying Ge
Annabella Macaluso
Erran L. Li
Ping Luo
Xiaolong Wang
LM&Ro
14
11
0
14 Dec 2022
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open
  Set Action Recognition
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition
Dawei Du
Ameya Shringi
A. Hoogs
Christopher Funk
13
2
0
12 Dec 2022
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Sangwon Kim
Dasom Ahn
ByoungChul Ko
ViT
3DPC
20
22
0
12 Dec 2022
VindLU: A Recipe for Effective Video-and-Language Pretraining
VindLU: A Recipe for Effective Video-and-Language Pretraining
Feng Cheng
Xizi Wang
Jie Lei
David J. Crandall
Mohit Bansal
Gedas Bertasius
VLM
9
78
0
09 Dec 2022
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene
  Segmentation
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
8
2
0
09 Dec 2022
Deep Architectures for Content Moderation and Movie Content Rating
Deep Architectures for Content Moderation and Movie Content Rating
Fatih Çagatay Akyön
A. Temi̇zel
20
4
0
08 Dec 2022
Masked Video Distillation: Rethinking Masked Feature Modeling for
  Self-supervised Video Representation Learning
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Lu Yuan
Yu-Gang Jiang
VGen
11
86
0
08 Dec 2022
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video
  Learning
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
A. Piergiovanni
Weicheng Kuo
A. Angelova
ViT
21
54
0
06 Dec 2022
VLG: General Video Recognition with Web Textual Knowledge
VLG: General Video Recognition with Web Textual Knowledge
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
24
0
0
03 Dec 2022
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal
  Action Localization
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Chen Zhao
Shuming Liu
K. Mangalam
Bernard Ghanem
8
17
0
25 Nov 2022
Towards Good Practices for Missing Modality Robust Action Recognition
Towards Good Practices for Missing Modality Robust Action Recognition
Sangmin Woo
Sumin Lee
Yeonju Park
Muhammad Adi Nugroho
Changick Kim
22
42
0
25 Nov 2022
Video Test-Time Adaptation for Action Recognition
Video Test-Time Adaptation for Action Recognition
Wei Lin
M. Jehanzeb Mirza
Mateusz Koziñski
Horst Possegger
Hilde Kuehne
Horst Bischof
TTA
29
31
0
24 Nov 2022
SVFormer: Semi-supervised Video Transformer for Action Recognition
SVFormer: Semi-supervised Video Transformer for Action Recognition
Zhen Xing
Qi Dai
Hang-Rui Hu
Jingjing Chen
Zuxuan Wu
Yu-Gang Jiang
ViT
14
67
0
23 Nov 2022
Query Efficient Cross-Dataset Transferable Black-Box Attack on Action
  Recognition
Query Efficient Cross-Dataset Transferable Black-Box Attack on Action Recognition
Rohit Gupta
Naveed Akhtar
Gaurav Kumar Nayak
Ajmal Saeed Mian
M. Shah
AAML
16
1
0
23 Nov 2022
Dynamic Appearance: A Video Representation for Action Recognition with
  Joint Training
Dynamic Appearance: A Video Representation for Action Recognition with Joint Training
Guoxi Huang
A. Bors
11
1
0
23 Nov 2022
MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection
MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection
D. Coccomini
Giorgos Kordopatis-Zilos
Giuseppe Amato
R. Caldelli
Fabrizio Falchi
Symeon Papadopoulos
Claudio Gennaro
17
14
0
20 Nov 2022
HARDVS: Revisiting Human Activity Recognition with Dynamic Vision
  Sensors
HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors
Xiao Wang
Zong-Yao Wu
Bowei Jiang
Zhimin Bao
Lin Zhu
Guoqiu Li
Yaowei Wang
Yonghong Tian
18
36
0
17 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video
  UniFormer
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
10
75
0
17 Nov 2022
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Guo Chen
Sen Xing
Zhe Chen
Yi Wang
Kunchang Li
...
Hongjie Zhang
Tong Lu
Yali Wang
Liming Wang
Yu Qiao
28
46
0
17 Nov 2022
Structured Pruning Adapters
Structured Pruning Adapters
Lukas Hedegaard
Aman Alok
Juby Jose
Alexandros Iosifidis
25
10
0
17 Nov 2022
Token Turing Machines
Token Turing Machines
Michael S. Ryoo
K. Gopalakrishnan
Kumara Kahatapitiya
Ted Xiao
Kanishka Rao
Austin Stone
Yao Lu
Julian Ibarz
Anurag Arnab
24
21
0
16 Nov 2022
Dynamic Temporal Filtering in Video Models
Dynamic Temporal Filtering in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
6
17
0
15 Nov 2022
Eat-Radar: Continuous Fine-Grained Intake Gesture Detection Using FMCW
  Radar and 3D Temporal Convolutional Network with Attention
Eat-Radar: Continuous Fine-Grained Intake Gesture Detection Using FMCW Radar and 3D Temporal Convolutional Network with Attention
C. Wang
T. S. Kumar
W. de Raedt
Guido Camps
Hans Hallez
Bart Vanrumste
8
6
0
08 Nov 2022
Bringing Online Egocentric Action Recognition into the wild
Bringing Online Egocentric Action Recognition into the wild
Gabriele Goletto
M. Planamente
Barbara Caputo
Giuseppe Averta
EgoV
11
3
0
06 Nov 2022
Quantifying and Learning Static vs. Dynamic Information in Deep
  Spatiotemporal Networks
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks
M. Kowal
Mennatullah Siam
Md. Amirul Islam
Neil D. B. Bruce
Richard P. Wildes
Konstantinos G. Derpanis
FAtt
4
3
0
03 Nov 2022
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental
  Learning
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning
Yixuan Pei
Zhiwu Qing
Jun Cen
Xiang Wang
Shiwei Zhang
Yaxiong Wang
Mingqian Tang
Nong Sang
Xueming Qian
4
13
0
02 Nov 2022
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early
  Intent Prediction
TAMFormer: Multi-Modal Transformer with Learned Attention Mask for Early Intent Prediction
Nada Osman
Guglielmo Camporese
Lamberto Ballan
15
8
0
26 Oct 2022
GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online
  Action Prediction
GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
Samrudhdhi B. Rangrej
Kevin J Liang
Tal Hassner
James J. Clark
17
3
0
24 Oct 2022
Holistic Interaction Transformer Network for Action Detection
Holistic Interaction Transformer Network for Action Detection
Gueter Josmy Faure
Min-Hung Chen
S. Lai
31
36
0
23 Oct 2022
Linear Video Transformer with Feature Fixation
Linear Video Transformer with Feature Fixation
Kaiyue Lu
Zexia Liu
Jianyuan Wang
Weixuan Sun
Zhen Qin
...
Xuyang Shen
Huizhong Deng
Xiaodong Han
Yuchao Dai
Yiran Zhong
20
4
0
15 Oct 2022
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for
  Human Action Recognition
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn
Sangwon Kim
H. Hong
ByoungChul Ko
ViT
21
92
0
14 Oct 2022
Overlooked Video Classification in Weakly Supervised Video Anomaly
  Detection
Overlooked Video Classification in Weakly Supervised Video Anomaly Detection
Weijun Tan
Qi Yao
Jingfeng Liu
AI4TS
9
10
0
13 Oct 2022
DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action
  Recognition
DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition
Haodong Duan
Jiaqi Wang
Kai-xiang Chen
Dahua Lin
22
28
0
12 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised
  Video Transformer Pre-training
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
85
8
0
11 Oct 2022
DCVQE: A Hierarchical Transformer for Video Quality Assessment
DCVQE: A Hierarchical Transformer for Video Quality Assessment
Zu-Hua Li
Lei Yang
ViT
19
2
0
10 Oct 2022
Fast and Robust Video-Based Exercise Classification via Body Pose
  Tracking and Scalable Multivariate Time Series Classifiers
Fast and Robust Video-Based Exercise Classification via Body Pose Tracking and Scalable Multivariate Time Series Classifiers
Ashish Singh
Antonio Bevilacqua
Thach le Nguyen
Feiyan Hu
Kevin McGuinness
Martin O'Reilly
D. Whelan
Brian Caulfield
Georgiana Ifrim
25
12
0
02 Oct 2022
AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition
AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition
Yulin Wang
Yang Yue
Xin-Wen Xu
Ali Hassani
V. Kulikov
Nikita Orlov
S. Song
Humphrey Shi
Gao Huang
11
17
0
27 Sep 2022
Rethinking Resolution in the Context of Efficient Video Recognition
Rethinking Resolution in the Context of Efficient Video Recognition
Chuofan Ma
Qiushan Guo
Yi-Xin Jiang
Zehuan Yuan
Ping Luo
Xiaojuan Qi
54
12
0
26 Sep 2022
Multi-dataset Training of Transformers for Robust Action Recognition
Multi-dataset Training of Transformers for Robust Action Recognition
Junwei Liang
Enwei Zhang
Jun Zhang
Chunhua Shen
ViT
29
11
0
26 Sep 2022
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action
  Recognition from Egocentric RGB Videos
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos
Yilin Wen
Hao Pan
Lei Yang
Jia-Yu Pan
Taku Komura
Wenping Wang
35
26
0
20 Sep 2022
Real-time Online Video Detection with Temporal Smoothing Transformers
Real-time Online Video Detection with Temporal Smoothing Transformers
Yue Zhao
Philipp Krahenbuhl
ViT
69
56
0
19 Sep 2022
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior
  Understanding in the Industrial-like Domain
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain
Francesco Ragusa
Antonino Furnari
G. Farinella
EgoV
28
23
0
19 Sep 2022
Differentiable Frequency-based Disentanglement for Aerial Video Action
  Recognition
Differentiable Frequency-based Disentanglement for Aerial Video Action Recognition
D. Kothandaraman
Ming-Shun Lin
Dinesh Manocha
17
6
0
15 Sep 2022
On the Surprising Effectiveness of Transformers in Low-Labeled Video
  Recognition
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition
Farrukh Rahman
Ömer Mubarek
Z. Kira
ViT
10
2
0
15 Sep 2022
Spatio-Temporal Action Detection Under Large Motion
Spatio-Temporal Action Detection Under Large Motion
Gurkirt Singh
Vasileios Choutas
Suman Saha
F. I. F. Richard Yu
Luc Van Gool
10
11
0
06 Sep 2022
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action
  Recognition
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
Tianjiao Li
Lin Geng Foo
Qiuhong Ke
Hossein Rahmani
Anran Wang
Jinghua Wang
J. Liu
8
21
0
03 Sep 2022
ViA: View-invariant Skeleton Action Representation Learning via Motion
  Retargeting
ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
16
4
0
31 Aug 2022
Previous
123...567...91011
Next