ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.09408
  4. Cited By
HRFormer: High-Resolution Transformer for Dense Prediction

HRFormer: High-Resolution Transformer for Dense Prediction

18 October 2021
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
    ViT
ArXivPDFHTML

Papers citing "HRFormer: High-Resolution Transformer for Dense Prediction"

50 / 128 papers shown
Title
A Novel Hybrid Approach for Retinal Vessel Segmentation with Dynamic Long-Range Dependency and Multi-Scale Retinal Edge Fusion Enhancement
A Novel Hybrid Approach for Retinal Vessel Segmentation with Dynamic Long-Range Dependency and Multi-Scale Retinal Edge Fusion Enhancement
Yihao Ouyang
Xunheng Kuang
Mengjia Xiong
Zhida Wang
Yuanquan Wang
38
0
0
18 Apr 2025
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
Yuseon Kim
Kyongseok Park
29
0
0
10 Apr 2025
AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements
Calvin Yeung
Tomohiro Suzuki
Ryota Tanaka
Zhuoer Yin
Keisuke Fujii
3DH
68
1
0
10 Mar 2025
Transformers with Joint Tokens and Local-Global Attention for Efficient Human Pose Estimation
K. A. Kinfu
René Vidal
ViT
26
0
0
28 Feb 2025
PolaFormer: Polarity-aware Linear Attention for Vision Transformers
Weikang Meng
Yadan Luo
Xin Li
D. Jiang
Zheng Zhang
106
0
0
25 Jan 2025
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for
  Image Manipulation Localization
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization
Xuekang Zhu
Xiaochen Ma
Lei Su
Zhuohang Jiang
Bo Du
Xiwen Wang
Zeyu Lei
Wentao Feng
Chi-Man Pun
Jizhe Zhou
AI4CE
62
3
0
18 Dec 2024
Multi-Exposure Image Fusion via Distilled 3D LUT Grid with Editable Mode
Multi-Exposure Image Fusion via Distilled 3D LUT Grid with Editable Mode
Xin Su
Zhuoran Zheng
62
0
0
18 Dec 2024
KptLLM: Unveiling the Power of Large Language Model for Keypoint
  Comprehension
KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Jie-jin Yang
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chen Qian
Ruimao Zhang
MLLM
65
2
0
04 Nov 2024
Visual-Geometric Collaborative Guidance for Affordance Learning
Visual-Geometric Collaborative Guidance for Affordance Learning
Hongchen Luo
Wei-dong Zhai
J. Wang
Yang Cao
Zheng-jun Zha
20
0
0
15 Oct 2024
Occluded Human Pose Estimation based on Limb Joint Augmentation
Occluded Human Pose Estimation based on Limb Joint Augmentation
Gangtao Han
Chunxiao Song
Song Wang
Hao Wang
Enqing Chen
Guanghui Wang
3DH
35
1
0
13 Oct 2024
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction
Hao Zhang
Yongqiang Ma
Wenqi Shao
Ping Luo
Nanning Zheng
Kaipeng Zhang
Mamba
28
1
0
04 Oct 2024
SkinFormer: Learning Statistical Texture Representation with Transformer
  for Skin Lesion Segmentation
SkinFormer: Learning Statistical Texture Representation with Transformer for Skin Lesion Segmentation
Rongtao Xu
Changwei Wang
Jiguang Zhang
Shibiao Xu
Weiliang Meng
Xiaopeng Zhang
ViT
MedIm
26
2
0
13 Sep 2024
GateAttentionPose: Enhancing Pose Estimation with Agent Attention and
  Improved Gated Convolutions
GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions
Liang Feng
Zhixuan Shen
Lihua Wen
Shiyao Li
Ming Xu
CVBM
28
0
0
12 Sep 2024
MVTN: A Multiscale Video Transformer Network for Hand Gesture
  Recognition
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
28
1
0
05 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
35
1
0
05 Sep 2024
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu
Zhuoyang Zhang
Samir Khaki
Shang Yang
Haotian Tang
Chenfeng Xu
Kurt Keutzer
Song Han
SSeg
44
1
0
26 Jul 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
44
0
0
12 Jul 2024
Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose
  Estimation
Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation
Junjia Han
3DH
16
0
0
10 Jul 2024
PoseBench: Benchmarking the Robustness of Pose Estimation Models under
  Corruptions
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions
Sihan Ma
Jing Zhang
Qiong Cao
Dacheng Tao
24
2
0
20 Jun 2024
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic
  Segmentation
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
Bingyu Li
Da Zhang
Zhiyuan Zhao
Junyu Gao
Xuelong Li
28
5
0
24 May 2024
Segformer++: Efficient Token-Merging Strategies for High-Resolution
  Semantic Segmentation
Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation
Daniel Kienzle
Marco Kantonis
Robin Schon
Rainer Lienhart
25
2
0
23 May 2024
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose
  Estimation
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation
Xiangyu Xu
Lijuan Liu
Shuicheng Yan
32
10
0
23 Apr 2024
Adaptive Patching for High-resolution Image Segmentation with
  Transformers
Adaptive Patching for High-resolution Image Segmentation with Transformers
Enzhi Zhang
Isaac Lyngaas
Peng Chen
Xiao Wang
Jun Igarashi
Yuankai Huo
M. Wahib
M. Munetomo
MedIm
19
1
0
15 Apr 2024
Implicit and Explicit Language Guidance for Diffusion-based Visual
  Perception
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception
Hefeng Wang
Jiale Cao
Jin Xie
Aiping Yang
Yanwei Pang
VLM
DiffM
35
2
0
11 Apr 2024
GaitSTR: Gait Recognition with Sequential Two-stream Refinement
GaitSTR: Gait Recognition with Sequential Two-stream Refinement
Wanrong Zheng
Haidong Zhu
Zhao-Heng Zheng
Ramkant Nevatia
CVBM
38
5
0
02 Apr 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
23
15
0
18 Mar 2024
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for
  3D Image Registration
EfficientMorph: Parameter-Efficient Transformer-Based Architecture for 3D Image Registration
Abu Zahid Bin Aziz
Mokshagna Sai Teja Karanam
Tushar Kataria
Shireen Elhabian
ViT
MedIm
23
1
0
16 Mar 2024
LoLiSRFlow: Joint Single Image Low-light Enhancement and
  Super-resolution via Cross-scale Transformer-based Conditional Flow
LoLiSRFlow: Joint Single Image Low-light Enhancement and Super-resolution via Cross-scale Transformer-based Conditional Flow
Ziyu Yue
Jiaxin Gao
Sihan Xie
Yang Liu
Zhixun Su
18
1
0
29 Feb 2024
NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human
  Pose Estimation in Top-View Fisheye Images
NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images
Jingrui Yu
Dipankar Nandi
Roman Seidel
G. Hirtz
3DH
32
0
0
28 Feb 2024
DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person
  Pose Estimation
DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation
Shang Wu
Bin Wang
27
2
0
26 Feb 2024
Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training
Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training
Huayi Zhou
Mukun Luo
Fei Jiang
Yue Ding
Hongtao Lu
Kui Jia
44
0
0
18 Feb 2024
APTv2: Benchmarking Animal Pose Estimation and Tracking with a
  Large-scale Dataset and Beyond
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond
Yuxiang Yang
Yingqi Deng
Yufei Xu
Jing Zhang
15
4
0
25 Dec 2023
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDE
VLM
11
15
0
22 Dec 2023
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic
  Scene Understanding
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding
Shuo Wang
Jing Li
Zibo Zhao
Dongze Lian
Binbin Huang
Xiaomei Wang
Zhengxin Li
Shenghua Gao
22
4
0
06 Nov 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan
Xinyu Zhang
Hao Zhou
Jian Wang
Zhongwei Qiu
...
Junyu Han
Errui Ding
Lanfen Lin
Fei Wu
Jingdong Wang
30
18
0
31 Oct 2023
Human Pose-based Estimation, Tracking and Action Recognition with Deep
  Learning: A Survey
Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey
Lijuan Zhou
Xiang Meng
Zhihuan Liu
Mengqi Wu
Zhimin Gao
Pichao Wang
32
3
0
19 Oct 2023
Accelerating Vision Transformers Based on Heterogeneous Attention
  Patterns
Accelerating Vision Transformers Based on Heterogeneous Attention Patterns
Deli Yu
Teng Xi
Jianwei Li
Baopu Li
Gang Zhang
Haocheng Feng
Junyu Han
Jingtuo Liu
Errui Ding
Jingdong Wang
ViT
26
0
0
11 Oct 2023
PointHR: Exploring High-Resolution Architectures for 3D Point Cloud
  Segmentation
PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Haibo Qiu
Baosheng Yu
Yixin Chen
Dacheng Tao
3DPC
21
1
0
11 Oct 2023
Distilling Efficient Vision Transformers from CNNs for Semantic
  Segmentation
Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation
Xueye Zheng
Yunhao Luo
Pengyuan Zhou
Lin Wang
27
12
0
11 Oct 2023
Context-Aware Neural Video Compression on Solar Dynamics Observatory
Context-Aware Neural Video Compression on Solar Dynamics Observatory
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
ViT
8
2
0
19 Sep 2023
Understanding Dark Scenes by Contrasting Multi-Modal Observations
Understanding Dark Scenes by Contrasting Multi-Modal Observations
Xiaoyu Dong
Naoto Yokoya
27
5
0
23 Aug 2023
Spatial Transform Decoupling for Oriented Object Detection
Spatial Transform Decoupling for Oriented Object Detection
Hongtian Yu
Yunjie Tian
QiXiang Ye
Yunfan Liu
32
26
0
21 Aug 2023
Joint Coordinate Regression and Association For Multi-Person Pose
  Estimation, A Pure Neural Network Approach
Joint Coordinate Regression and Association For Multi-Person Pose Estimation, A Pure Neural Network Approach
Dongyang Yu
Yun-Hao Xie
Wangpeng An
Li Zhang
Yufeng Yao
3DV
19
5
0
03 Jul 2023
Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation
Zhongwei Qiu
Qiansheng Yang
Jian Wang
Xiyu Wang
Chang Xu
Dongmei Fu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
DiffM
22
13
0
29 Jun 2023
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene
  Understanding
InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Hanrong Ye
Dan Xu
ViT
23
10
0
08 Jun 2023
Efficient Vision Transformer for Human Pose Estimation via Patch
  Selection
Efficient Vision Transformer for Human Pose Estimation via Patch Selection
K. A. Kinfu
René Vidal
ViT
31
4
0
07 Jun 2023
DFormer: Diffusion-guided Transformer for Universal Image Segmentation
DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Hefeng Wang
Jiale Cao
Rao Muhammad Anwer
J. Xie
F. Khan
Yanwei Pang
DiffM
23
18
0
06 Jun 2023
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active
  Stereo Cues
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active Stereo Cues
Stefanie Walz
Mario Bijelic
Andrea Ramazzina
Amanpreet Walia
Fahim Mannan
Felix Heide
MDE
20
6
0
22 May 2023
VisiTherS: Visible-thermal infrared stereo disparity estimation of human
  silhouette
VisiTherS: Visible-thermal infrared stereo disparity estimation of human silhouette
Noreen Anwar
Philippe Duplessis-Guindon
Guillaume-Alexandre Bilodeau
W. Bouachir
6
0
0
22 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in
  Autonomous Driving: A Comprehensive Review
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
25
91
0
20 Apr 2023
123
Next