Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.06278
Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation
13 July 2021
Bowen Cheng
A. Schwing
Alexander Kirillov
VLM
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Per-Pixel Classification is Not All You Need for Semantic Segmentation"
50 / 261 papers shown
Title
Temporal Saliency Query Network for Efficient Video Recognition
Boyang Xia
Zhihao Wang
Wenhao Wu
Haoran Wang
Jungong Han
45
15
0
21 Jul 2022
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds
Weiwei Sun
Daniel Rebain
Renjie Liao
V. Tankovich
S. Yazdani
K. M. Yi
Andrea Tagliasacchi
3DPC
15
13
0
20 Jul 2022
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
VLM
30
65
0
17 Jul 2022
Online Video Instance Segmentation via Robust Context Fusion
Xiang Li
Jinglu Wang
Xiaohao Xu
Bhiksha Raj
Yan Lu
35
5
0
12 Jul 2022
Tracking Objects as Pixel-wise Distributions
Zelin Zhao
Ze Wu
Yueqing Zhuang
Boxun Li
Jiaya Jia
VOT
26
54
0
12 Jul 2022
SFNet: Faster and Accurate Semantic Segmentation via Semantic Flow
Xiangtai Li
Jiangning Zhang
Yibo Yang
Guangliang Cheng
Kuiyuan Yang
Yu Tong
Dacheng Tao
SSeg
AI4TS
46
28
0
10 Jul 2022
Dual Decision Improves Open-Set Panoptic Segmentation
Hainan Xu
Hao Chen
Lingqiao Liu
Yufei Yin
VLM
21
6
0
06 Jul 2022
Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention
Gary Leung
Jun Gao
Xiaohui Zeng
Sanja Fidler
18
3
0
05 Jul 2022
PhotoScene: Photorealistic Material and Lighting Transfer for Indoor Scenes
Yu-Ying Yeh
Zhengqin Li
Yannick Hold-Geoffroy
Rui Zhu
Zexiang Xu
Miloš Hašan
Kalyan Sunkavalli
Manmohan Chandraker
3DV
27
30
0
02 Jul 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Yibo Yang
Yong Liu
Dacheng Tao
ViT
34
32
0
19 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
MedIm
32
89
0
17 Jun 2022
Learning Implicit Feature Alignment Function for Semantic Segmentation
Hanzhe Hu
Yinbo Chen
Jiarui Xu
Shubhankar Borse
H. Cai
Fatih Porikli
X. Wang
31
47
0
17 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
44
366
0
06 Jun 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle
A. de La Fortelle
3DPC
49
12
0
26 May 2022
Weakly-supervised segmentation of referring expressions
Robin Strudel
Ivan Laptev
Cordelia Schmid
19
21
0
10 May 2022
Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation
Jianrong Zhang
Tianyi Wu
Chuan-Yong Ding
Hongwei Zhao
Guodong Guo
ISeg
20
15
0
28 Apr 2022
Joint Forecasting of Panoptic Segmentations with Difference Attention
Colin Graber
Cyril Jazra
Wenjie Luo
Liangyan Gui
A. Schwing
AI4TS
24
1
0
14 Apr 2022
Panoptic, Instance and Semantic Relations: A Relational Context Encoder to Enhance Panoptic Segmentation
Shubhankar Borse
Hyojin Park
H. Cai
Debasmit Das
Risheek Garrepalli
Fatih Porikli
ISeg
33
13
0
11 Apr 2022
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition
Shilin Xu
Xiangtai Li
Jingbo Wang
Guangliang Cheng
Yunhai Tong
Dacheng Tao
ViT
23
27
0
10 Apr 2022
Learning Local and Global Temporal Contexts for Video Semantic Segmentation
Guolei Sun
Yun Liu
Henghui Ding
Min Wu
Luc Van Gool
25
32
0
07 Apr 2022
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Z. Li
Cheng Lu
Jia Qin
Chunle Guo
Mingg-Ming Cheng
41
149
0
06 Apr 2022
BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning
Zhi Hou
Baosheng Yu
Chaoyue Wang
Yibing Zhan
Dacheng Tao
ViT
20
11
0
04 Apr 2022
Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He
Jianfei Cai
Zizheng Pan
Jing Liu
Jing Zhang
Dacheng Tao
Bohan Zhuang
34
17
0
04 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
24
191
0
03 Apr 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
Estelle Aflalo
Meng Du
Shao-Yen Tseng
Yongfei Liu
Chenfei Wu
Nan Duan
Vasudev Lal
23
45
0
30 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
F. Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViT
MedIm
27
28
0
24 Mar 2022
Sparse Instance Activation for Real-Time Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Wenqiang Zhang
Qian Zhang
Chang Huang
Zhaoxiang Zhang
Wenyu Liu
ISeg
33
125
0
24 Mar 2022
Unsupervised Salient Object Detection with Spectral Cluster Voting
Gyungin Shin
Samuel Albanie
Weidi Xie
13
65
0
23 Mar 2022
GOSS: Towards Generalized Open-set Semantic Segmentation
Jie Hong
Weihong Li
Junlin Han
Jiyang Zheng
Pengfei Fang
Mehrtash Harandi
L. Petersson
VLM
19
19
0
23 Mar 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
24
263
0
22 Mar 2022
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers
Jiaming Zhang
Huayao Liu
Kailun Yang
Xinxin Hu
Ruiping Liu
Rainer Stiefelhagen
ViT
23
297
0
09 Mar 2022
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
Hao He
Yuhui Yuan
Xiangyu Yue
Han Hu
VOS
VLM
22
13
0
08 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
32
131
0
08 Mar 2022
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
Lixiang Ru
Yibing Zhan
Baosheng Yu
Bo Du
ViT
39
181
0
05 Mar 2022
FedDrive: Generalizing Federated Learning to Semantic Segmentation in Autonomous Driving
Lidia Fantauzzo
Eros Fani
Debora Caldarola
A. Tavera
Fabio Cermelli
Marco Ciccone
Barbara Caputo
FedML
21
52
0
28 Feb 2022
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
25
328
0
16 Feb 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
147
361
0
24 Jan 2022
Language as Queries for Referring Video Object Segmentation
Jiannan Wu
Yi-Xin Jiang
Pei Sun
Zehuan Yuan
Ping Luo
23
141
0
03 Jan 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
29
92
0
23 Dec 2021
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Golnaz Ghiasi
Xiuye Gu
Yin Cui
Tsung-Yi Lin
VLM
32
370
0
22 Dec 2021
Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
Yi Zhou
Hui Zhang
Hana Lee
Shuyang Sun
Pingjun Li
Yangguang Zhu
ByungIn Yoo
Xiaojuan Qi
Jae-Joon Han
VOS
25
26
0
16 Dec 2021
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
Rui Dai
Srijan Das
Kumara Kahatapitiya
Michael S. Ryoo
F. Brémond
ViT
36
73
0
07 Dec 2021
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Haobo Yuan
Xiangtai Li
Yibo Yang
Guangliang Cheng
Jing Zhang
Yunhai Tong
Lefei Zhang
Dacheng Tao
MDE
38
42
0
05 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
87
2,265
0
02 Dec 2021
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
Yucheng Tang
Dong Yang
Wenqi Li
H. Roth
Bennett Landman
Daguang Xu
V. Nath
Ali Hatamizadeh
ViT
MedIm
30
517
0
29 Nov 2021
High Quality Segmentation for Ultra High-resolution Images
Tiancheng Shen
Yuechen Zhang
Lu Qi
Jason Kuen
Xingyu Xie
Jianlong Wu
Zhe-nan Lin
Jiaya Jia
71
43
0
29 Nov 2021
Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He
Jianfei Cai
Jing Liu
Zizheng Pan
Jing Zhang
Dacheng Tao
Bohan Zhuang
ViT
31
40
0
23 Nov 2021
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
27
23
0
19 Nov 2021
Swin Transformer V2: Scaling Up Capacity and Resolution
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
...
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
49
1,746
0
18 Nov 2021
Multimodal Virtual Point 3D Detection
Tianwei Yin
Xingyi Zhou
Philipp Krahenbuhl
3DPC
160
245
0
12 Nov 2021
Previous
1
2
3
4
5
6
Next