Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.04246
Cited By
ViTPose++: Vision Transformer for Generic Body Pose Estimation
7 December 2022
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ViTPose++: Vision Transformer for Generic Body Pose Estimation"
17 / 17 papers shown
Title
Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer
Daniel Kienzle
Robin Schon
Rainer Lienhart
ShinÍchi Satoh
63
0
0
28 Apr 2025
Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video
Runyang Feng
Haoming Chen
3DH
42
0
0
15 Feb 2025
Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images
Wei-Lun Chen
Chia-Yeh Hsieh
Yu-Hsiang Kao
Kai-Chun Liu
Sheng-Yu Peng
Yu Tsao
82
0
0
30 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
91
45
0
03 Jan 2025
Overview of MWE history, challenges, and horizons: standing at the 20th anniversary of the MWE workshop series via MWE-UD2024
Lifeng Han
Kilian Evang
Archna Bhatia
Gosse Bouma
A. Seza Doğruöz
Marcos Garcia
Voula Giouli
Joakim Nivre
Alexandre Rademacher
AI4TS
33
1
0
25 Dec 2024
Automatic Ultrasound Curve Angle Measurement via Affinity Clustering for Adolescent Idiopathic Scoliosis Evaluation
Yihao Zhou
T. Lee
K. Lai
Chonglin Wu
Hin Ting Lau
...
Shing-Chow Chan
W. Chu
J. C. Cheng
Tsz-Ping Lam
Yongping Zheng
25
1
0
06 May 2024
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
27
19
0
28 Aug 2023
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie
Zigang Geng
Jingcheng Hu
Zheng-Wei Zhang
Han Hu
Yue Cao
VLM
186
105
0
26 May 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
239
2,554
0
04 May 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
TransPose: Keypoint Localization via Transformer
Sen Yang
Zhibin Quan
Mu Nie
Wankou Yang
ViT
132
252
0
28 Dec 2020
Whole-Body Human Pose Estimation in the Wild
Sheng Jin
Lumin Xu
Jin Xu
Can Wang
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
3DH
130
235
0
23 Jul 2020
Learning Delicate Local Representations for Multi-Person Pose Estimation
Yuanhao Cai
Zhicheng Wang
Zhengxiong Luo
Binyi Yin
Angang Du
Haoqian Wang
X. Zhang
Xinyu Zhou
Erjin Zhou
Jian-jun Sun
103
169
0
09 Mar 2020
Towards High Performance Human Keypoint Detection
Jing Zhang
Zhe Chen
Dacheng Tao
3DH
80
70
0
03 Feb 2020
Single-Stage Multi-Person Pose Machines
Xuecheng Nie
Jianfeng Zhang
Shuicheng Yan
Jiashi Feng
3DH
106
217
0
24 Aug 2019
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
183
5,138
0
16 Sep 2016
1