Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.06278
Cited By
Per-Pixel Classification is Not All You Need for Semantic Segmentation
13 July 2021
Bowen Cheng
A. Schwing
Alexander Kirillov
VLM
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Per-Pixel Classification is Not All You Need for Semantic Segmentation"
50 / 262 papers shown
Title
CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes
Danial Qashqai
Emad Mousavian
S. B. Shokouhi
S. Mirzakuchaki
40
0
0
01 Jul 2024
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation
Ahmad Mohammadshirazi
Ali Nosrati Firoozsalari
Mengxi Zhou
Dheeraj Kulshrestha
R. Ramnath
31
0
0
25 Jun 2024
GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen
Sotirios A. Tsaftaris
M. Giuffrida
20
1
0
24 Jun 2024
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
Xiaowen Ma
Rongrong Lian
Zhenkai Wu
Hongbo Guo
Mengting Ma
Sensen Wu
Zhenhong Du
Siyang Song
Wei Zhang
44
4
0
24 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
42
0
0
17 Jun 2024
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
Bingfeng Zhang
Siyue Yu
Yunchao Wei
Yao Zhao
Jimin Xiao
VLM
35
8
0
17 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
40
0
0
14 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
78
12
0
09 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
43
1
0
06 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
77
6
0
04 Jun 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
51
2
0
30 May 2024
GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Xin Tan
Wenbin Wu
Zhiwei Zhang
Chaojie Fan
Yong Peng
Zhizhong Zhang
Yuan Xie
Lizhuang Ma
70
9
0
17 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
63
0
0
15 May 2024
A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden
Vishal Purohit
Wenxin Jiang
Akshath R. Ravikiran
James C. Davis
32
1
0
29 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
23
3
0
08 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
27
2
0
02 Apr 2024
Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation
Qi Bi
Shaodi You
Theo Gevers
45
1
0
29 Mar 2024
Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation
Chenhui Zhao
Liyue Shen
VLM
32
3
0
08 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
30
5
0
06 Mar 2024
End-to-End Human Instance Matting
Qinglin Liu
Shengping Zhang
Quanling Meng
Bineng Zhong
Peiqiang Liu
H. Yao
3DH
37
5
0
03 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
28
2
0
23 Feb 2024
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey
Fabio Tosi
Youming Zhang
Ziren Gong
Erik Sandström
S. Mattoccia
Martin R. Oswald
Matteo Poggi
3DGS
63
53
0
20 Feb 2024
ISCUTE: Instance Segmentation of Cables Using Text Embedding
Shir Kozlovsky
O. Joglekar
Dotan Di Castro
32
2
0
19 Feb 2024
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition
Xu Hu
Yuxi Wang
Lue Fan
Junsong Fan
Junran Peng
Zhen Lei
Qing Li
Zhaoxiang Zhang
Zhaoxiang Zhang
3DGS
42
8
0
31 Jan 2024
PACE: A Pragmatic Agent for Enhancing Communication Efficiency Using Large Language Models
Jiaxuan Li
Minxi Yang
Dahua Gao
Wenlong Xu
Guangming Shi
35
0
0
30 Jan 2024
Learning to Manipulate Artistic Images
Wei Guo
Yuqi Zhang
De Ma
Qian Zheng
28
0
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
31
14
0
25 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
40
11
0
18 Jan 2024
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
Zheng Chen
Qingan Yan
Huangying Zhan
Changjiang Cai
Xiangyu Xu
Yuzhong Huang
Weihan Wang
Ziyue Feng
Lantao Liu
Yi Tian Xu
3DV
48
3
0
30 Dec 2023
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Kun Jiang
Junchi Yan
Hongyang Li
38
34
0
26 Dec 2023
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Sangyun Shin
Kaichen Zhou
M. Vankadari
Andrew Markham
Niki Trigoni
3DPC
27
8
0
18 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
34
18
0
13 Dec 2023
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie-jin Yang
Yun Gu
46
1
0
13 Dec 2023
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
28
30
0
12 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
27
3
0
12 Dec 2023
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
20
4
0
11 Dec 2023
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
34
10
0
07 Dec 2023
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models
Haicheng Liao
Huanming Shen
Zhenning Li
Chengyue Wang
Guofa Li
Yiming Bie
Chengzhong Xu
34
50
0
06 Dec 2023
Predicting Scores of Various Aesthetic Attribute Sets by Learning from Overall Score Labels
Heng Huang
Xin Jin
Yaqi Liu
Hao Lou
Chaoen Xiao
Shuai Cui
Xinning Li
Dongqing Zou
25
1
0
06 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
30
3
0
05 Dec 2023
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
Yuan Wang
Naisong Luo
Tianzhu Zhang
30
11
0
29 Nov 2023
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
Thanh-Dat Truong
Utsav Prabhu
Bhiksha Raj
Jackson Cothren
Khoa Luu
CLL
37
3
0
27 Nov 2023
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Hao Li
Dingwen Zhang
Yalun Dai
Nian Liu
Lechao Cheng
Jingfeng Li
Jingdong Wang
Junwei Han
25
14
0
20 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
28
14
0
09 Nov 2023
OmniVec: Learning robust representations with cross modal sharing
Siddharth Srivastava
Gaurav Sharma
SSL
24
64
0
07 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
43
5
0
01 Nov 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
51
1
0
25 Oct 2023
Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers
Yuanduo Hong
Jue Wang
Weichao Sun
Huihui Pan
VLM
ViT
37
7
0
19 Oct 2023
3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Jieneng Chen
Jieru Mei
Xianhang Li
Yongyi Lu
Qihang Yu
...
M. Lungren
Lei Xing
Le Lu
Alan L. Yuille
Yuyin Zhou
MedIm
ViT
33
36
0
11 Oct 2023
Previous
1
2
3
4
5
6
Next