ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.06278
  4. Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation

Per-Pixel Classification is Not All You Need for Semantic Segmentation

13 July 2021
Bowen Cheng
A. Schwing
Alexander Kirillov
    VLM
    ViT
ArXivPDFHTML

Papers citing "Per-Pixel Classification is Not All You Need for Semantic Segmentation"

50 / 262 papers shown
Title
CancerUniT: Towards a Single Unified Model for Effective Detection,
  Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection
  of CT Scans
CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans
Jieneng Chen
Yingda Xia
Jiawen Yao
K. Yan
Jianpeng Zhang
...
Xin Chen
Jingren Zhou
Alan Yuille
Zai-De Liu
Ling Zhang
ViT
MedIm
28
15
0
28 Jan 2023
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic
  Segmentation
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation
S. D. Dao
Hengcan Shi
Dinh Q. Phung
Jianfei Cai
VLM
34
0
0
18 Jan 2023
Dyna-DepthFormer: Multi-frame Transformer for Self-Supervised Depth
  Estimation in Dynamic Scenes
Dyna-DepthFormer: Multi-frame Transformer for Self-Supervised Depth Estimation in Dynamic Scenes
Songchun Zhang
Chunhui Zhao
ViT
MDE
28
4
0
14 Jan 2023
Text to Point Cloud Localization with Relation-Enhanced Transformer
Text to Point Cloud Localization with Relation-Enhanced Transformer
Guangzhi Wang
Hehe Fan
Mohan S. Kankanhalli
3DPC
25
13
0
13 Jan 2023
Towards Real-Time Panoptic Narrative Grounding by an End-to-End
  Grounding Network
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang
Jiayi Ji
Yiyi Zhou
Yongjian Wu
Xiaoshuai Sun
27
15
0
09 Jan 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance
  Segmentation
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
34
17
0
03 Jan 2023
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part
  Segmentation
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Xiangtai Li
Shilin Xu
Yibo Yang
Haobo Yuan
Guangliang Cheng
Yu Tong
Zhouchen Lin
Ming-Hsuan Yang
Dacheng Tao
ViT
37
21
0
03 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
Deep Learning Technique for Human Parsing: A Survey and Outlook
Lu Yang
Wenhe Jia
Shane Li
Q. Song
ViT
41
17
0
01 Jan 2023
PanDepth: Joint Panoptic Segmentation and Depth Completion
PanDepth: Joint Panoptic Segmentation and Depth Completion
J. Lagos
Esa Rahtu
3DPC
VLM
28
1
0
29 Dec 2022
Representation Separation for Semantic Segmentation with Vision
  Transformers
Representation Separation for Semantic Segmentation with Vision Transformers
Yuanduo Hong
Huihui Pan
Weichao Sun
Xinghu Yu
Huijun Gao
ViT
23
5
0
28 Dec 2022
DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
Xiaoyang Kang
Tao Yang
Wenqi Ouyang
Peiran Ren
Lingzhi Li
Xuansong Xie
DiffM
MQ
27
37
0
22 Dec 2022
Improving Unsupervised Video Object Segmentation with Motion-Appearance
  Synergy
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy
Long Lian
Zhirong Wu
Stella X. Yu
VOS
14
0
0
17 Dec 2022
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with
  Class-Aware Cross-Domain Transformers
One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers
R. Gong
Qin Wang
Dengxin Dai
Luc Van Gool
ViT
27
4
0
14 Dec 2022
Look Before You Match: Instance Understanding Matters in Video Object
  Segmentation
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Junke Wang
Dongdong Chen
Zuxuan Wu
Chong Luo
Chuanxin Tang
Xiyang Dai
Yucheng Zhao
Yujia Xie
Lu Yuan
Yu-Gang Jiang
VOS
36
39
0
13 Dec 2022
NMS Strikes Back
NMS Strikes Back
Jeffrey Ouyang-Zhang
Jang Hyun Cho
Xingyi Zhou
Philipp Krahenbuhl
32
37
0
12 Dec 2022
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Ziqi Zhou
Bowen Zhang
Yinjie Lei
Lingqiao Liu
Yifan Liu
VLM
21
167
0
07 Dec 2022
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Box2Mask: Box-supervised Instance Segmentation via Level-set Evolution
Wentong Li
Wenyu Liu
Jianke Zhu
Miaomiao Cui
Risheng Yu
Xia Hua
Lei Zhang
ISeg
26
30
0
03 Dec 2022
Shape-Guided Diffusion with Inside-Outside Attention
Shape-Guided Diffusion with Inside-Outside Attention
Dong Huk Park
Grace Luo
C. Toste
S. Azadi
Xihui Liu
M. Karalashvili
Anna Rohrbach
Trevor Darrell
DiffM
32
44
0
01 Dec 2022
NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D
  Reconstruction
NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D Reconstruction
Bin Tan
Nan Xue
Tianfu Wu
Guisong Xia
22
15
0
30 Nov 2022
Superpoint Transformer for 3D Scene Instance Segmentation
Superpoint Transformer for 3D Scene Instance Segmentation
Jiahao Sun
Chunmei Qing
Junpeng Tan
Xiangmin Xu
3DPC
39
104
0
28 Nov 2022
FsaNet: Frequency Self-attention for Semantic Segmentation
FsaNet: Frequency Self-attention for Semantic Segmentation
Fengyu Zhang
Ashkan Panahi
Guangjun Gao
AI4TS
26
28
0
28 Nov 2022
Prototype as Query for Few Shot Semantic Segmentation
Prototype as Query for Few Shot Semantic Segmentation
Leilei Cao
Yibo Guo
Ye Yuan
Qiangguo Jin
ViT
25
10
0
27 Nov 2022
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Mean Shift Mask Transformer for Unseen Object Instance Segmentation
Ya Lu
Yuqiao Chen
Nicholas Ruozzi
Yu Xiang
16
23
0
21 Nov 2022
Visual Programming: Compositional visual reasoning without training
Visual Programming: Compositional visual reasoning without training
Tanmay Gupta
Aniruddha Kembhavi
ReLM
VLM
LRM
76
401
0
18 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video
  UniFormer
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
27
106
0
17 Nov 2022
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
Jiaheng Liu
Tong He
Honghui Yang
Rui Su
Jiayi Tian
Junran Wu
Hongcheng Guo
Ke Xu
Wanli Ouyang
ISeg
26
14
0
17 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
16
18
0
16 Nov 2022
Robust Online Video Instance Segmentation with Track Queries
Robust Online Video Instance Segmentation with Track Queries
Zitong Zhan
Daniel McKee
Svetlana Lazebnik
26
9
0
16 Nov 2022
Mining Unseen Classes via Regional Objectness: A Simple Baseline for
  Incremental Segmentation
Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation
Zekang Zhang
Guangyu Gao
Zhiyuan Fang
Jianbo Jiao
Yunchao Wei
CLL
20
31
0
13 Nov 2022
Enhancing Few-shot Image Classification with Cosine Transformer
Enhancing Few-shot Image Classification with Cosine Transformer
Quang-Huy Nguyen
Cuong Q. Nguyen
Dung D. Le
Hieu H. Pham
ViT
24
12
0
13 Nov 2022
OneFormer: One Transformer to Rule Universal Image Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
ViT
24
326
0
10 Nov 2022
Efficient Unsupervised Video Object Segmentation Network Based on Motion
  Guidance
Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance
Chao Hu
Liqiang Zhu
VOS
14
2
0
10 Nov 2022
Dynamic loss balancing and sequential enhancement for road-safety
  assessment and traffic scene classification
Dynamic loss balancing and sequential enhancement for road-safety assessment and traffic scene classification
Marin Kavcan
Marko Sevrovic
Sinivsa vSegvić
19
1
0
08 Nov 2022
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Large Scale Radio Frequency Wideband Signal Detection & Recognition
Luke Boegner
Garrett M. Vanhoy
Phillip Vallance
Manbir Gulati
Dresden Feitzinger
B. Comar
Rob Miller
AI4TS
10
6
0
04 Nov 2022
Pointly-Supervised Panoptic Segmentation
Pointly-Supervised Panoptic Segmentation
Junsong Fan
Zhaoxiang Zhang
T. Tan
27
23
0
25 Oct 2022
Token-Label Alignment for Vision Transformers
Token-Label Alignment for Vision Transformers
Han Xiao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
11
4
0
12 Oct 2022
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
SaiT: Sparse Vision Transformers through Adaptive Token Pruning
Ling Li
D. Thorsley
Joseph Hassoun
ViT
25
17
0
11 Oct 2022
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised
  Instance Segmentation
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
Tianheng Cheng
Xinggang Wang
Shaoyu Chen
Qian Zhang
Wenyu Liu
ISeg
35
42
0
11 Oct 2022
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
CLIP
VLM
32
433
0
09 Oct 2022
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models
Chen Liang
Wenguan Wang
Jiaxu Miao
Yi Yang
VLM
33
117
0
05 Oct 2022
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision
  Models
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models
Chenglin Yang
Siyuan Qiao
Qihang Yu
Xiaoding Yuan
Yukun Zhu
Alan Yuille
Hartwig Adam
Liang-Chieh Chen
ViT
MoE
33
58
0
04 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Learning Equivariant Segmentation with Instance-Unique Querying
Wenguan Wang
James Liang
Dongfang Liu
ISeg
43
48
0
03 Oct 2022
PointScatter: Point Set Representation for Tubular Structure Extraction
PointScatter: Point Set Representation for Tubular Structure Extraction
Dong Wang
Zhao Zhang
Zi-Long Zhao
Yuhang Liu
Yihong Chen
Liwei Wang
3DPC
36
10
0
13 Sep 2022
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical
  Analysis of Approaches and Challenges
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges
Sanjay Haresh
Xiaohao Sun
Hanxiao Jiang
Angel X. Chang
Manolis Savva
38
10
0
12 Sep 2022
Detecting Network-based Internet Censorship via Latent Feature
  Representation Learning
Detecting Network-based Internet Censorship via Latent Feature Representation Learning
Shawn P. Duncan
Hui Chen
25
1
0
12 Sep 2022
Single-Stage Open-world Instance Segmentation with Cross-task
  Consistency Regularization
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
21
4
0
18 Aug 2022
L3: Accelerator-Friendly Lossless Image Format for High-Resolution,
  High-Throughput DNN Training
L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training
Jonghyun Bae
W. Baek
Tae Jun Ham
Jae W. Lee
15
1
0
18 Aug 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
  Grounding
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding
Zixiang Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Si Liu
12
12
0
11 Aug 2022
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
27
11
0
08 Aug 2022
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision
  Transformers for Panoramic Semantic Segmentation
Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
Jiaming Zhang
Kailun Yang
Haowen Shi
Simon Reiß
Kunyu Peng
Chaoxiang Ma
Haodong Fu
Philip H. S. Torr
Kaiwei Wang
Rainer Stiefelhagen
ViT
MDE
31
35
0
25 Jul 2022
Previous
123456
Next