Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.04730
Cited By
X3D: Expanding Architectures for Efficient Video Recognition
9 April 2020
Christoph Feichtenhofer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"X3D: Expanding Architectures for Efficient Video Recognition"
50 / 526 papers shown
Title
AnimalMotionCLIP: Embedding motion in CLIP for Animal Behavior Analysis
Enmin Zhong
Carlos R. del-Blanco
Daniel Berjón
F. Jaureguizar
Narciso N. García
24
0
0
30 Apr 2025
Beyond the Horizon: Decoupling UAVs Multi-View Action Recognition via Partial Order Transfer
Wenxuan Liu
X. Zhong
Zhuo Zhou
S. Yang
Chia-Wen Lin
Alex Chichung Kot
32
0
0
29 Apr 2025
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
Jongseo Lee
Wooil Lee
Gyeong-Moon Park
Seong Tae Kim
Jinwoo Choi
28
0
0
17 Apr 2025
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
Xin Li
Kun Yuan
B. Li
Fengbin Guan
Yizhen Shao
...
Guohua Zhang
Z. Huang
Y. Deng
Qingmiao Jiang
Lu Chen
45
7
0
17 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
98
0
0
17 Apr 2025
Exploring Video-Based Driver Activity Recognition under Noisy Labels
Linjuan Fan
Di Wen
Kunyu Peng
Kailun Yang
J. Zhang
...
Yufan Chen
Junwei Zheng
Jiamin Wu
Xudong Han
Rainer Stiefelhagen
NoLa
45
0
0
16 Apr 2025
Decision-based AI Visual Navigation for Cardiac Ultrasounds
Andy Dimnaku
Dominic Yurk
Zhiyuan Gao
Arun Padmanabhan
Mandar Aras
Yaser Abu-Mostafa
31
0
0
16 Apr 2025
LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding
Ziyi Wang
Haoran Wu
Yiming Rong
Deyang Jiang
Yixin Zhang
Y. Zhao
Shuang Xu
Bo Xu
VLM
41
0
0
09 Apr 2025
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Shiao Wang
X. Wang
Bo Jiang
Lin Zhu
G. Li
Y. Wang
Yonghong Tian
Jin Tang
45
0
0
08 Apr 2025
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
Jongseo Lee
Joohyun Chang
Dongho Lee
Jinwoo Choi
46
0
0
30 Mar 2025
Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation
Jonathan Attard
Dylan Seychell
41
0
0
27 Mar 2025
ATARS: An Aerial Traffic Atomic Activity Recognition and Temporal Segmentation Dataset
Zihao Chen
Hsuanyu Wu
Chi-Hsi Kung
Yi-Ting Chen
Yan-Tsung Peng
29
0
0
24 Mar 2025
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
50
0
0
17 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
44
0
0
17 Mar 2025
VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining
Yunze Liu
Peiran Wu
C. Liang
Junxiao Shen
Limin Wang
Li Yi
Mamba
35
0
0
16 Mar 2025
STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications
Andrew Gao
Jun Liu
AI4TS
53
0
0
11 Mar 2025
VoD: Learning Volume of Differences for Video-Based Deepfake Detection
Ying Xu
Marius Pedersen
Kiran Raja
29
0
0
10 Mar 2025
Sign Language Translation using Frame and Event Stream: Benchmark Dataset and Algorithms
X. Wang
Y. Li
Fuling Wang
Bo Jiang
Y. Wang
Yonghong Tian
Jin Tang
Bin Luo
SLR
74
0
0
09 Mar 2025
Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks
Jeremie Ochin
Guillaume Devineau
Bogdan Stanciulescu
Sotiris Manitsaris
3DPC
61
0
0
24 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
31
0
0
11 Feb 2025
Can masking background and object reduce static bias for zero-shot action recognition?
Takumi Fukuzawa
Kensho Hara
Hirokatsu Kataoka
Toru Tamaki
30
0
0
22 Jan 2025
Human Activity Recognition in an Open World
D. Prijatelj
Samuel Grieggs
Jin Huang
Dawei Du
Ameya Shringi
Christopher Funk
Adam Kaufman
Eric Robertson
Walter J. Scheirer University of Notre Dame
53
3
0
17 Jan 2025
Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Sujia Wang
Xiangwei Shen
Yansong Tang
Xin Luna Dong
Wenjia Geng
Lei Chen
36
0
0
13 Jan 2025
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
72
3
0
15 Dec 2024
Streaming Detection of Queried Event Start
Cristobal Eyzaguirre
Eric Tang
S. Buch
Adrien Gaidon
Jiajun Wu
Juan Carlos Niebles
69
0
0
04 Dec 2024
Progress-Aware Video Frame Captioning
Zihui Xue
Joungbin An
Xitong Yang
Kristen Grauman
90
1
0
03 Dec 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
114
1
0
25 Nov 2024
OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions
Guanyu Zhou
Wenxuan Liu
Wenxin Huang
Xuemei Jia
X. Zhong
Chia-Wen Lin
CML
64
0
0
24 Nov 2024
When Spatial meets Temporal in Action Recognition
H. Chen
Lei Wang
Y. Chen
Tom Gedeon
Piotr Koniusz
75
0
0
22 Nov 2024
Principles of Visual Tokens for Efficient Video Understanding
Xinyue Hao
Gen Li
Shreyank N. Gowda
Robert B Fisher
Jonathan Huang
Anurag Arnab
Laura Sevilla-Lara
64
0
0
20 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
K. Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
32
2
0
17 Nov 2024
AM Flow: Adapters for Temporal Processing in Action Recognition
Tanay Agrawal
Abid Ali
A. Dantcheva
François Brémond
21
0
0
04 Nov 2024
First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Atomic Activity Recognition 2024
Ruyang Li
Tengfei Zhang
Heng Zhang
Tiejun Liu
Yanwei Wang
Xuelei Li
17
0
0
30 Oct 2024
Enhancing Action Recognition by Leveraging the Hierarchical Structure of Actions and Textual Context
Manuel Benavent-Lledo
David Mulero-Pérez
David Ortiz-Perez
José García Rodríguez
Antonis Argyros
19
0
0
28 Oct 2024
Detecting Adversarial Examples
Furkan Mumcu
Yasin Yilmaz
AAML
13
0
0
22 Oct 2024
SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Jiaqi Chen
Yan Yang
Shizhuo Deng
Da Teng
Liyuan Pan
Mamba
19
0
0
22 Oct 2024
Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024
Jiamin Cao
Lingqi Wang
Kexin Zhang
Y. Yang
Licheng Jiao
Yuwei Guo
19
0
0
21 Oct 2024
Storyboard guided Alignment for Fine-grained Video Action Recognition
Enqi Liu
Liyuan Pan
Yan Yang
Yiran Zhong
Zhijing Wu
Xinxiao Wu
Liu Liu
18
0
0
18 Oct 2024
Evaluating Model Performance with Hard-Swish Activation Function Adjustments
Sai Abhinav Pydimarry
Shekhar Madhav Khairnar
Sofia Garces Palacios
Ganesh Sankaranarayanan
Darian Hoagland
Dmitry Nepomnayshy
Huu Phong Nguyen
18
0
0
09 Oct 2024
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection
Zhe Luo
Weina Fu
Shuai Liu
Saeed Anwar
Muhammad Saqib
Sambit Bakshi
Khan Muhammad
24
2
0
08 Oct 2024
Enhancing Temporal Modeling of Video LLMs via Time Gating
Zi-Yuan Hu
Yiwu Zhong
Shijia Huang
M. Lyu
Liwei Wang
VLM
14
0
0
08 Oct 2024
Loose Social-Interaction Recognition in Real-world Therapy Scenarios
Abid Ali
Rui Dai
Ashish Marisetty
Guillaume Astruc
Monique Thonnat
J. Odobez
Susanne Thümmler
Francois Bremond
24
1
0
30 Sep 2024
CycleCrash: A Dataset of Bicycle Collision Videos for Collision Prediction and Analysis
Nishq Poorav Desai
Ali Etemad
Michael A. Greenspan
18
0
0
30 Sep 2024
Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment
Keyne Oei
Amr Gomaa
Anna Maria Feit
João Belo
18
0
0
06 Sep 2024
GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting
Jun Li
Jinying Wu
Qiming Li
Feifei Guo
16
0
0
31 Aug 2024
Towards Infusing Auxiliary Knowledge for Distracted Driver Detection
Ishwar B Balappanawar
Ashmit Chamoli
Ruwan Wickramarachchi
Aditya Mishra
Ponnurangam Kumaraguru
Amit P. Sheth
15
0
0
29 Aug 2024
Online pre-training with long-form videos
Itsuki Kato
Kodai Kamiya
Toru Tamaki
OnRL
16
0
0
28 Aug 2024
HabitAction: A Video Dataset for Human Habitual Behavior Recognition
Hongwu Li
Zhenliang Zhang
Wei Wang
16
0
0
24 Aug 2024
Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms
Xiao Wang
Shiao Wang
Pengpeng Shao
Bo Jiang
Lin Zhu
Yonghong Tian
55
1
0
19 Aug 2024
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
Ahmed Abdelkawy
Asem A. Ali
Aly A. Farag
3DPC
18
0
0
10 Aug 2024
1
2
3
4
...
9
10
11
Next