Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,359 papers shown
Title
Grounded 3D-LLM with Referent Tokens
Yilun Chen
Shuai Yang
Haifeng Huang
Tai Wang
Ruiyuan Lyu
Runsen Xu
Dahua Lin
Jiangmiao Pang
45
22
0
16 May 2024
4D Panoptic Scene Graph Generation
Jingkang Yang
Jun Cen
Wenxuan Peng
Shuai Liu
Fangzhou Hong
Xiangtai Li
Kaiyang Zhou
Qifeng Chen
Ziwei Liu
39
13
0
16 May 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
Chengxiang Fan
Muzhi Zhu
Hao Chen
Yang Liu
Weijia Wu
Huaqi Zhang
Chunhua Shen
DiffM
54
11
0
16 May 2024
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie-Kai Liang
Radu Timofte
Qiaosi Yi
Shuai Liu
Lingchen Sun
Rongyuan Wu
Xindong Zhang
Huiyu Zeng
Lei Zhang
42
18
0
16 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
58
0
0
15 May 2024
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll
Iramm Hamdard
Lukas Namgyu Rößler
Felix Geisler
Muhammed Bayram
...
Miguel de Campos
Anushervon Tabarov
Yitian Yang
Hanno Gottschalk
J. Marius Zöllner
34
5
0
13 May 2024
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach
Elham Ravanbakhsh
Cheng Niu
Yongqing Liang
J. Ramanujam
Xin Li
VLM
52
0
0
10 May 2024
Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
Zhenliang Ni
Xinghao Chen
Yingjie Zhai
Yehui Tang
Yunhe Wang
33
15
0
10 May 2024
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Yuhang Ming
Xingrui Yang
Weihan Wang
Zheng Chen
Jinglun Feng
Yifan Xing
Guofeng Zhang
27
11
0
09 May 2024
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Minh-Triet Tran
Adrian de Luis
Haitao Liao
Ying Huang
Roy McCann
Alan Mantooth
Jack Cothren
Ngan Le
82
0
0
07 May 2024
Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Yanan Zhang
Jinqing Zhang
Zengran Wang
Junhao Xu
Di Huang
23
14
0
04 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
37
17
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
34
6
0
02 May 2024
Garbage Segmentation and Attribute Analysis by Robotic Dogs
Nuo Xu
Jianfeng Liao
Qiwei Meng
Wei Song
11
0
0
28 Apr 2024
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng
Kaname Tomite
27
1
0
27 Apr 2024
Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM
Laksh Nanwani
Kumaraditya Gupta
Aditya Mathur
Swayam Agrawal
A. H. A. Hafez
K. M. Krishna
32
0
0
27 Apr 2024
Instance-free Text to Point Cloud Localization with Relative Position Awareness
Lichao Wang
Zhihao Yuan
Jinke Ren
Shuguang Cui
Zhen Li
34
0
0
27 Apr 2024
Features Fusion for Dual-View Mammography Mass Detection
Arina Varlamova
Valery Belotsky
Grigory Novikov
Anton Konushin
Evgeny Sidorov
MedIm
22
1
0
25 Apr 2024
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Haotian Yan
Ming Wu
Chuang Zhang
26
12
0
25 Apr 2024
MaGGIe: Masked Guided Gradual Human Instance Matting
Chuong Huynh
Seoung Wug Oh
Abhinav Shrivastava
Joon-Young Lee
VOS
30
8
0
24 Apr 2024
Efficient Transformer Encoders for Mask2Former-style models
Manyi Yao
Abhishek Aich
Yumin Suh
Amit Roy-Chowdhury
Christian Shelton
Manmohan Chandraker
36
0
0
23 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
29
2
0
22 Apr 2024
HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild
Supreeth Narasimhaswamy
Huy Anh Nguyen
Lihan Huang
Minh Hoai
30
4
0
22 Apr 2024
Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
Guanlong Jiao
Chenyangguang Zhang
Haonan Yin
Yu Mo
Biqing Huang
Hui Pan
Yi Luo
Jingxian Liu
33
0
0
21 Apr 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
36
32
0
21 Apr 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma
Yi-Xin Jiang
Jiannan Wu
Zehuan Yuan
Xiaojuan Qi
VLM
ObjD
37
51
0
19 Apr 2024
FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving
Xingtai Gui
Tengteng Huang
Haonan Shao
Haotian Yao
Chi Zhang
40
3
0
19 Apr 2024
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
Shengcao Cao
Jiuxiang Gu
Jason Kuen
Hao Tan
Ruiyi Zhang
Handong Zhao
A. Nenkova
Liangyan Gui
Tong Sun
Yu-Xiong Wang
VLM
OCL
33
3
0
18 Apr 2024
How to Benchmark Vision Foundation Models for Semantic Segmentation?
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
VLM
32
7
0
18 Apr 2024
MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification
Weikang Yu
Xiaokang Zhang
Samiran Das
Xiao Xiang Zhu
Pedram Ghamisi
31
9
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
29
8
0
18 Apr 2024
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Mir Rayat Imtiaz Hossain
Mennatullah Siam
Leonid Sigal
James J. Little
VLM
38
5
0
17 Apr 2024
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
Minh Q. Tran
Sang Truong
Arthur F. A. Fernandes
Michael Kidd
Ngan Le
ViT
32
3
0
17 Apr 2024
StyleCity: Large-Scale 3D Urban Scenes Stylization
Yingshu Chen
Huajian Huang
Tuan-Anh Vu
Ka Chun Shum
Sai-Kit Yeung
41
0
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
44
4
0
15 Apr 2024
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
Yu-Ju Tsai
Jin-Cheng Jhang
Jingjing Zheng
Wei Wang
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
Ming-Hsuan Yang
3DV
22
4
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li-Na Song
Wenjun Zhang
Zhiwu Huang
MLLM
36
0
0
15 Apr 2024
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
Gabriele Rosi
Claudia Cuttano
Niccolò Cavagnero
Giuseppe Averta
Fabio Cermelli
SSeg
56
3
0
15 Apr 2024
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction
Pin Tang
Zhongdao Wang
Guoqing Wang
Jilai Zheng
Xiangxuan Ren
Bailan Feng
Chao Ma
44
37
0
15 Apr 2024
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
26
0
0
14 Apr 2024
Coreset Selection for Object Detection
Hojun Lee
Suyoung Kim
Junhoo Lee
Jaeyoung Yoo
Nojun Kwak
30
4
0
14 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Binghua Li
Jie Mao
Zhe Sun
Chao Li
Qibin Zhao
Toshihisa Tanaka
28
0
0
13 Apr 2024
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
40
16
0
12 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
38
14
0
12 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
C. L. P. Chen
35
62
0
11 Apr 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
Jihao Liu
Jinliang Zheng
Yu Liu
Hongsheng Li
VLM
27
3
0
11 Apr 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Jingxuan Xu
Wuyang Chen
Yao-Min Zhao
Yunchao Wei
VLM
31
2
0
11 Apr 2024
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Poulami Sinhamahapatra
Franziska Schwaiger
Shirsha Bose
Huiyu Wang
Karsten Roscher
Stephan Guennemann
20
3
0
11 Apr 2024
Identification of Fine-grained Systematic Errors via Controlled Scene Generation
Valentyn Boreiko
Matthias Hein
J. H. Metzen
27
1
0
10 Apr 2024
Previous
1
2
3
...
10
11
12
...
26
27
28
Next