Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.08680
Cited By
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
19 April 2022
Wang Zeng
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Ouyang Wanli
Xiaogang Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer"
20 / 20 papers shown
Title
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
48
0
0
06 May 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
59
0
0
03 Apr 2025
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution
Xin Liu
Jie Liu
J. Tang
Gangshan Wu
SupR
ViT
54
0
0
10 Mar 2025
Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation
Zhengwen Shen
Yulian Li
Han Zhang
Yuchen Weng
Jun Wang
35
0
0
19 Jan 2025
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Yonghao Yu
Dongcheng Zhao
Guobin Shen
Yiting Dong
Yi Zeng
34
0
0
11 Sep 2024
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network
Xinyi Zhang
Qiqi Bao
Qinpeng Cui
Wenming Yang
Qingmin Liao
3DH
Mamba
26
1
0
06 Aug 2024
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
Haonan Wang
Jie Liu
Jie Tang
Gangshan Wu
Bo Xu
Y. Kevin Chou
Yong Wang
ViT
24
2
0
15 Jul 2024
PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation
Nermin Samet
Cédric Rommel
David Picard
Eduardo Valle
DiffM
42
0
0
14 Jul 2024
Arena: A Patch-of-Interest ViT Inference Acceleration System for Edge-Assisted Video Analytics
Haosong Peng
Wei Feng
Hao Li
Yufeng Zhan
Qihua Zhou
Yuanqing Xia
16
2
0
14 Apr 2024
Markerless human pose estimation for biomedical applications: a survey
Andrea Avogaro
Federico Cunico
Bodo Rosenhahn
Francesco Setti
3DH
14
14
0
01 Aug 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
11
0
0
29 Jul 2023
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
Peng Jin
Jinfa Huang
Pengfei Xiong
Shangxuan Tian
Chang-rui Liu
Xiang Ji
Li-ming Yuan
Jie Chen
23
48
0
25 Mar 2023
Learning Hierarchical Image Segmentation For Recognition and By Recognition
Tsung-Wei Ke
Sangwoo Mo
Stella X. Yu
VLM
22
9
0
01 Oct 2022
G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model
Pan Xie
Qipeng Zhang
Zexian Li
Hao Tang
Yao Du
Xiaohui Hu
DiffM
19
12
0
19 Aug 2022
PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation
Wentao Jiang
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Sishuo Liu
ViT
19
23
0
16 Aug 2022
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
Whole-Body Human Pose Estimation in the Wild
Sheng Jin
Lumin Xu
Jin Xu
Can Wang
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
3DH
130
235
0
23 Jul 2020
Single-Network Whole-Body Pose Estimation
Gines Hidalgo
Yaadhav Raaj
Haroon Idrees
Donglai Xiang
Hanbyul Joo
Tomas Simon
Yaser Sheikh
3DH
115
100
0
30 Sep 2019
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
190
3,480
0
20 Aug 2019
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
279
39,083
0
01 Sep 2014
1