ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.09408
  4. Cited By
HRFormer: High-Resolution Transformer for Dense Prediction

HRFormer: High-Resolution Transformer for Dense Prediction

18 October 2021
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
    ViT
ArXivPDFHTML

Papers citing "HRFormer: High-Resolution Transformer for Dense Prediction"

50 / 128 papers shown
Title
GaitRef: Gait Recognition with Refined Sequential Skeletons
GaitRef: Gait Recognition with Refined Sequential Skeletons
Haidong Zhu
Wanrong Zheng
Zhao-Heng Zheng
Ramkant Nevatia
CVBM
30
17
0
16 Apr 2023
PP-MobileSeg: Explore the Fast and Accurate Semantic Segmentation Model
  on Mobile Devices
PP-MobileSeg: Explore the Fast and Accurate Semantic Segmentation Model on Mobile Devices
Shiyu Tang
Ting Sun
Juncai Peng
Guowei Chen
Yuying Hao
Manhui Lin
Z. Xiao
Jiangbin You
Yi Liu
ViT
17
14
0
11 Apr 2023
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and
  Progressive Shift
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and Progressive Shift
Gaojie Wu
Weishi Zheng
Yutong Lu
Q. Tian
ViT
40
15
0
07 Apr 2023
All Keypoints You Need: Detecting Arbitrary Keypoints on the Body of
  Triple, High, and Long Jump Athletes
All Keypoints You Need: Detecting Arbitrary Keypoints on the Body of Triple, High, and Long Jump Athletes
K. Ludwig
Julian Lorenz
Robin Schon
Rainer Lienhart
3DH
13
9
0
06 Apr 2023
Recurrence without Recurrence: Stable Video Landmark Detection with Deep
  Equilibrium Models
Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models
P. Micaelli
Arash Vahdat
Hongxu Yin
Jan Kautz
Pavlo Molchanov
17
17
0
02 Apr 2023
Beyond Appearance: a Semantic Controllable Self-Supervised Learning
  Framework for Human-Centric Visual Tasks
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Weihua Chen
Xianzhe Xu
Jian Jia
Haowen Luo
Yaohua Wang
F. Wang
Rong Jin
Xiuyu Sun
SSL
31
93
0
30 Mar 2023
Global Relation Modeling and Refinement for Bottom-Up Human Pose
  Estimation
Global Relation Modeling and Refinement for Bottom-Up Human Pose Estimation
Ruoqi Yin
Jianqin Yin
3DH
22
0
0
27 Mar 2023
MonoATT: Online Monocular 3D Object Detection with Adaptive Token
  Transformer
MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer
Yunsong Zhou
Hongzi Zhu
Quan Liu
Shan Chang
Minyi Guo
ViT
51
25
0
23 Mar 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
23
47
0
21 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
18
0
0
15 Mar 2023
TransMatting: Tri-token Equipped Transformer Model for Image Matting
TransMatting: Tri-token Equipped Transformer Model for Image Matting
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
15
3
0
11 Mar 2023
HumanBench: Towards General Human-centric Perception with Projector
  Assisted Pretraining
HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining
Shixiang Tang
Cheng Chen
Qingsong Xie
Meilin Chen
Yizhou Wang
...
Feng Zhu
Haiyang Yang
Li Yi
Rui Zhao
Wanli Ouyang
VLM
14
35
0
10 Mar 2023
Capturing the motion of every joint: 3D human pose and shape estimation
  with independent tokens
Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens
Sen Yang
Wen Heng
Gang Liu
Guozhong Luo
Wankou Yang
Gang Yu
3DH
ViT
18
11
0
01 Mar 2023
CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction
CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction
Gang Zhang
Zi-Hua Li
Chufeng Tang
Jianmin Li
Xiaolin Hu
24
15
0
13 Feb 2023
CARD: Semantic Segmentation with Efficient Class-Aware Regularized
  Decoder
CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder
Ye Huang
Di Kang
Liang Chen
W. Jia
Xiangjian He
Lixin Duan
Xuefei Zhe
Linchao Bao
32
2
0
11 Jan 2023
HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection
HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection
Bin Tang
Zhengyi Liu
Yacheng Tan
Qian He
ViT
24
76
0
08 Jan 2023
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
19
5
0
28 Dec 2022
Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training
  Approach based on Geometrical Constraints
Bridging the Domain Gap in Satellite Pose Estimation: a Self-Training Approach based on Geometrical Constraints
Zi Wang
Minglin Chen
Yulan Guo
Zhang Li
Qifeng Yu
15
33
0
23 Dec 2022
Multi-Scale Feature Fusion Transformer Network for End-to-End Single
  Channel Speech Separation
Multi-Scale Feature Fusion Transformer Network for End-to-End Single Channel Speech Separation
Yinhao Xu
Jian Zhou
L. Tao
H. Kwan
12
0
0
14 Dec 2022
Video Prediction by Efficient Transformers
Video Prediction by Efficient Transformers
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
28
33
0
12 Dec 2022
ViTPose++: Vision Transformer for Generic Body Pose Estimation
ViTPose++: Vision Transformer for Generic Body Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
32
40
0
07 Dec 2022
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for
  Semantic Segmentation
IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation
Lihua Fu
Haoyue Tian
Xiang Zhai
Pan Gao
Xiaojiang Peng
ViT
22
9
0
06 Dec 2022
RbA: Segmenting Unknown Regions Rejected by All
RbA: Segmenting Unknown Regions Rejected by All
Nazir Nayal
Mısra Yavuz
João F. Henriques
Fatma Guney
UQCV
19
46
0
25 Nov 2022
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
  Propagation in Transformers
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
Zhuo Huang
Zhiyou Zhao
Banghuai Li
Jungong Han
3DPC
ViT
23
55
0
23 Oct 2022
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View
  Completion
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
Philippe Weinzaepfel
Vincent Leroy
Thomas Lucas
Romain Brégier
Yohann Cabon
Vaibhav Arora
L. Antsfeld
Boris Chidlovskii
G. Csurka
Jérôme Revaud
SSL
31
64
0
19 Oct 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
Chen Liang
Wenguan Wang
Jiaxu Miao
Yi Yang
VLM
28
117
0
05 Oct 2022
Bridged Transformer for Vision and Point Cloud 3D Object Detection
Bridged Transformer for Vision and Point Cloud 3D Object Detection
Yikai Wang
Tengqi Ye
Lele Cao
Wen-bing Huang
Fuchun Sun
Fengxiang He
Dacheng Tao
ViT
32
34
0
04 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
22
25
0
03 Oct 2022
Heatmap Distribution Matching for Human Pose Estimation
Heatmap Distribution Matching for Human Pose Estimation
Haoxuan Qu
Li Xu
Yujun Cai
Lin Geng Foo
Jun Liu
6
14
0
03 Oct 2022
Effective Vision Transformer Training: A Data-Centric Perspective
Effective Vision Transformer Training: A Data-Centric Perspective
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
24
5
0
29 Sep 2022
MAFormer: A Transformer Network with Multi-scale Attention Fusion for
  Visual Recognition
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition
Y. Wang
H. Sun
Xiaodi Wang
Bin Zhang
Chaonan Li
Ying Xin
Baochang Zhang
Errui Ding
Shumin Han
ViT
23
9
0
31 Aug 2022
TransMatting: Enhancing Transparent Objects Matting with Transformers
TransMatting: Enhancing Transparent Objects Matting with Transformers
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
11
20
0
05 Aug 2022
Pose for Everything: Towards Category-Agnostic Pose Estimation
Pose for Everything: Towards Category-Agnostic Pose Estimation
Lumin Xu
Sheng Jin
Wang Zeng
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
8
35
0
21 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
19
33
0
18 Jul 2022
Tracking Objects as Pixel-wise Distributions
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
26
54
0
12 Jul 2022
PseudoClick: Interactive Image Segmentation with Click Imitation
PseudoClick: Interactive Image Segmentation with Click Imitation
Qin Liu
Meng Zheng
Benjamin Planche
Srikrishna Karanam
Terrence Chen
Marc Niethammer
Ziyan Wu
VLM
38
57
0
12 Jul 2022
BMD-GAN: Bone mineral density estimation using x-ray image decomposition
  into projections of bone-segmented quantitative computed tomography using
  hierarchical learning
BMD-GAN: Bone mineral density estimation using x-ray image decomposition into projections of bone-segmented quantitative computed tomography using hierarchical learning
Yidong Gu
Y. Otake
Keisuke Uemura
Mazen Soufi
Masaki Takao
Nobuhiko Sugano
Yoshinobu Sato
22
5
0
07 Jul 2022
Learning Cross-Image Object Semantic Relation in Transformer for
  Few-Shot Fine-Grained Image Classification
Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification
Bo-Wen Zhang
Jiakang Yuan
Baopu Li
Tao Chen
Jiayuan Fan
Botian Shi
ViT
9
31
0
02 Jul 2022
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object
  Detection
HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Tim Broedermann
Christos Sakaridis
Dengxin Dai
Luc Van Gool
50
30
0
30 Jun 2022
I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose
  Estimation
I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation
Yiwei Ding
W. Deng
Yinglin Zheng
Peng Liu
Meihong Wang
Xuan Cheng
Jianmin Bao
Dong Chen
Ming Zeng
3DH
12
13
0
22 Jun 2022
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Yuxiang Yang
Junjie Yang
Yufei Xu
Jing Zhang
Long Lan
Dacheng Tao
13
38
0
12 Jun 2022
FeatER: An Efficient Network for Human Reconstruction via Feature
  Map-Based TransformER
FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER
Ce Zheng
Matías Mendieta
Taojiannan Yang
Guo-Jun Qi
C. L. P. Chen
ViT
3DH
14
14
0
30 May 2022
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation
Xu Cao
Xiaoye Li
Liya Ma
Yi Huang
X. Feng
Zening Chen
H. Zeng
Jianguo Cao
ViT
11
21
0
11 May 2022
Activating More Pixels in Image Super-Resolution Transformer
Activating More Pixels in Image Super-Resolution Transformer
Xiangyu Chen
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
ViT
59
598
0
09 May 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
22
509
0
26 Apr 2022
Progressive Training of A Two-Stage Framework for Video Restoration
Progressive Training of A Two-Stage Framework for Video Restoration
Mei Zheng
Qunliang Xing
Minglang Qiao
Mai Xu
Lai Jiang
Huaida Liu
Ying Chen
30
9
0
21 Apr 2022
Not All Tokens Are Equal: Human-centric Visual Analysis via Token
  Clustering Transformer
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Wang Zeng
Sheng Jin
Wentao Liu
Chao Qian
Ping Luo
Ouyang Wanli
Xiaogang Wang
ViT
16
119
0
19 Apr 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
Wenqiang Zhang
Zilong Huang
Guozhong Luo
Tao Chen
Xinggang Wang
Wenyu Liu
Gang Yu
Chunhua Shen
ViT
11
196
0
12 Apr 2022
DaViT: Dual Attention Vision Transformers
DaViT: Dual Attention Vision Transformers
Mingyu Ding
Bin Xiao
Noel Codella
Ping Luo
Jingdong Wang
Lu Yuan
ViT
30
240
0
07 Apr 2022
MixFormer: Mixing Features across Windows and Dimensions
MixFormer: Mixing Features across Windows and Dimensions
Qiang Chen
Qiman Wu
Jian Wang
Qinghao Hu
T. Hu
Errui Ding
Jian Cheng
Jingdong Wang
MDE
ViT
10
101
0
06 Apr 2022
Previous
123
Next