Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.09408
Cited By
HRFormer: High-Resolution Transformer for Dense Prediction
18 October 2021
Yuhui Yuan
Rao Fu
Lang Huang
Weihong Lin
Chao Zhang
Xilin Chen
Jingdong Wang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HRFormer: High-Resolution Transformer for Dense Prediction"
28 / 128 papers shown
Title
ITTR: Unpaired Image-to-Image Translation with Transformers
Wanfeng Zheng
Qiang Li
Guoxin Zhang
Pengfei Wan
Zhong-ming Wang
ViT
27
17
0
30 Mar 2022
VPTR: Efficient Transformers for Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
19
18
0
29 Mar 2022
Rethinking Semantic Segmentation: A Prototype View
Tianfei Zhou
Wenguan Wang
E. Konukoglu
Luc Van Gool
SSeg
23
259
0
28 Mar 2022
ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer
Rui Yang
Hailong Ma
Jie Wu
Yansong Tang
Xuefeng Xiao
Min Zheng
Xiu Li
ViT
19
53
0
21 Mar 2022
Hyperbolic Uncertainty Aware Semantic Segmentation
Bike Chen
Wei Peng
Xiaofeng Cao
Juha Roning
UQCV
16
15
0
16 Mar 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
11
84
0
15 Mar 2022
Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution
Jinsu Yoo
Taehoon Kim
Sihaeng Lee
Seunghyeon Kim
H. Lee
Tae Hyun Kim
SupR
ViT
31
51
0
15 Mar 2022
Self-Promoted Supervision for Few-Shot Transformer
Bowen Dong
Pan Zhou
Shuicheng Yan
W. Zuo
ViT
22
28
0
14 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
21
295
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
19
13
0
08 Mar 2022
Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Chaoxiang Ma
Simon Reiß
Kunyu Peng
Rainer Stiefelhagen
ViT
22
72
0
02 Mar 2022
Single UHD Image Dehazing via Interpretable Pyramid Network
Boxue Xiao
Zhuoran Zheng
Xiang Chen
Chengfeng Lv
Yunliang Zhuang
Tao Wang
14
26
0
17 Feb 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
142
361
0
24 Jan 2022
Poseur: Direct Human Pose Regression with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
A. Hengel
ViT
25
81
0
19 Jan 2022
Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
S. Song
Li Erran Li
Gao Huang
ViT
22
452
0
03 Jan 2022
ELSA: Enhanced Local Self-Attention for Vision Transformer
Jingkai Zhou
Pichao Wang
Fan Wang
Qiong Liu
Hao Li
Rong Jin
ViT
21
37
0
23 Dec 2021
iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR Images
Qin Liu
Zhenlin Xu
Yining Jiao
Marc Niethammer
ViT
MedIm
34
35
0
21 Dec 2021
MPViT: Multi-Path Vision Transformer for Dense Prediction
Youngwan Lee
Jonghee Kim
Jeffrey Willette
Sung Ju Hwang
ViT
13
243
0
21 Dec 2021
Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos
Pengfei Pei
Xianfeng Zhao
Yun Cao
Jinchuan Li
Xiaowei Yi
ViT
19
8
0
15 Dec 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
69
330
0
11 Nov 2021
Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation
Jiaqi Gu
Hyoukjun Kwon
Dilin Wang
Wei Ye
Meng Li
Yu-Hsin Chen
Liangzhen Lai
Vikas Chandra
D. Pan
ViT
11
182
0
01 Nov 2021
On the Connection between Local Attention and Dynamic Depth-wise Convolution
Qi Han
Zejia Fan
Qi Dai
Lei-huan Sun
Ming-Ming Cheng
Jiaying Liu
Jingdong Wang
ViT
8
104
0
08 Jun 2021
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
282
1,523
0
27 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,622
0
24 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,981
0
09 Feb 2021
Video Transformer Network
Daniel Neimark
Omri Bar
Maya Zohar
Dotan Asselmann
ViT
193
421
0
01 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
270
979
0
27 Jan 2021
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Vijay Badrinarayanan
Alex Kendall
R. Cipolla
SSeg
435
15,631
0
02 Nov 2015
Previous
1
2
3