ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
4D Panoptic Scene Graph Generation
4D Panoptic Scene Graph GenerationNeural Information Processing Systems (NeurIPS), 2024
Jingkang Yang
Jun Cen
Wenxuan Peng
Shuai Liu
Fangzhou Hong
Xiangtai Li
Kaiyang Zhou
Qifeng Chen
Ziwei Liu
158
23
0
16 May 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data
  Distribution with More Diverse Generative Data
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataComputer Vision and Pattern Recognition (CVPR), 2024
Chengxiang Fan
Huanyi Zheng
Hao Chen
Yang Liu
Weijia Wu
Huaqi Zhang
Chunhua Shen
DiffM
270
14
0
16 May 2024
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Jie Liang
Radu Timofte
Qiaosi Yi
Shuai Liu
Lingchen Sun
Rongyuan Wu
Xindong Zhang
Huiyu Zeng
Lei Zhang
254
21
0
16 May 2024
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation
Yachan Guo
Yi Xiao
Danna Xue
Jose Luis Gomez Zurita
Antonio M. López
376
0
0
15 May 2024
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous
  Driving
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll
Iramm Hamdard
Lukas Namgyu Rößler
Felix Geisler
Muhammed Bayram
...
Miguel de Campos
Anushervon Tabarov
Yitian Yang
Hanno Gottschalk
J. Marius Zöllner
277
8
0
13 May 2024
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal
  Foundation Models: An End-to-End Approach
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach
Elham Ravanbakhsh
Cheng Niu
Yongqing Liang
J. Ramanujam
Xin Li
VLM
335
1
0
10 May 2024
Context-Guided Spatial Feature Reconstruction for Efficient Semantic
  Segmentation
Context-Guided Spatial Feature Reconstruction for Efficient Semantic SegmentationEuropean Conference on Computer Vision (ECCV), 2024
Zhenliang Ni
Xinghao Chen
Yingjie Zhai
Yehui Tang
Yunhe Wang
293
64
0
10 May 2024
Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview
Benchmarking Neural Radiance Fields for Autonomous Robots: An OverviewEngineering applications of artificial intelligence (EAAI), 2024
Yuhang Ming
Xingrui Yang
Weihan Wang
Zheng Chen
Jinglun Feng
Yifan Xing
Guofeng Zhang
263
19
0
09 May 2024
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling
Minh-Triet Tran
Adrian de Luis
Haitao Liao
Ying Huang
Roy McCann
Alan Mantooth
Jack Cothren
Ngan Le
424
0
0
07 May 2024
Vision-based 3D occupancy prediction in autonomous driving: a review and
  outlook
Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Yanan Zhang
Jinqing Zhang
Zengran Wang
Junhao Xu
Di Huang
313
25
0
04 May 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
Multi-Space Alignments Towards Universal LiDAR SegmentationComputer Vision and Pattern Recognition (CVPR), 2024
You-Chen Liu
Lingdong Kong
Xiaoyang Wu
Runnan Chen
Xin Li
Liang Pan
Ziwei Liu
Yuexin Ma
3DPC
265
30
0
02 May 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
Spider: A Unified Framework for Context-dependent Concept Segmentation
Xiaoqi Zhao
Youwei Pang
Wei Ji
Baicheng Sheng
Jiaming Zuo
Lihe Zhang
Huchuan Lu
310
14
0
02 May 2024
Garbage Segmentation and Attribute Analysis by Robotic Dogs
Garbage Segmentation and Attribute Analysis by Robotic Dogs
Nuo Xu
Jianfeng Liao
Qiwei Meng
Wei Song
197
0
0
28 Apr 2024
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex
  Driving Scenes
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes
Zelong Zeng
Kaname Tomite
247
1
0
27 Apr 2024
Instance-free Text to Point Cloud Localization with Relative Position
  Awareness
Instance-free Text to Point Cloud Localization with Relative Position Awareness
Lichao Wang
Zhihao Yuan
Jinke Ren
Shuguang Cui
Zhen Li
267
0
0
27 Apr 2024
Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM
Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM
Laksh Nanwani
Kumaraditya Gupta
Aditya Mathur
Swayam Agrawal
A. H. A. Hafez
K. M. Krishna
301
2
0
27 Apr 2024
Features Fusion for Dual-View Mammography Mass Detection
Features Fusion for Dual-View Mammography Mass Detection
Arina Varlamova
Valery Belotsky
Grigory Novikov
Anton Konushin
Evgeny Sidorov
MedIm
144
2
0
25 Apr 2024
Multi-Scale Representations by Varying Window Attention for Semantic
  Segmentation
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation
Haotian Yan
Ming Wu
Chuang Zhang
251
28
0
25 Apr 2024
MaGGIe: Masked Guided Gradual Human Instance Matting
MaGGIe: Masked Guided Gradual Human Instance Matting
Chuong Huynh
Seoung Wug Oh
Abhinav Shrivastava
Joon-Young Lee
VOS
186
13
0
24 Apr 2024
Efficient Transformer Encoders for Mask2Former-style models
Efficient Transformer Encoders for Mask2Former-style models
Manyi Yao
Abhishek Aich
Yumin Suh
Amit Roy-Chowdhury
Christian Shelton
Manmohan Chandraker
208
0
0
23 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
418
0
0
23 Apr 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
226
2
0
22 Apr 2024
HOIST-Former: Hand-held Objects Identification, Segmentation, and
  Tracking in the Wild
HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild
Supreeth Narasimhaswamy
Huy Anh Nguyen
Lihan Huang
Minh Hoai
225
7
0
22 Apr 2024
Semantic-Rearrangement-Based Multi-Level Alignment for Domain
  Generalized Segmentation
Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation
Guanlong Jiao
Chenyangguang Zhang
Haonan Yin
Yu Mo
Biqing Huang
Hui Pan
Yi Luo
Jingxian Liu
288
2
0
21 Apr 2024
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Clio: Real-time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio
Yun Chang
Nathan Hughes
Matthew Trang
Dan Griffith
Carlyn Dougherty
Eric Cristofalo
Lukas Schmid
Luca Carlone
3DV
289
70
0
21 Apr 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large
  Language Models
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma
Yi Jiang
Jiannan Wu
Zehuan Yuan
Xiaojuan Qi
VLMObjD
209
104
0
19 Apr 2024
FipTR: A Simple yet Effective Transformer Framework for Future Instance
  Prediction in Autonomous Driving
FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving
Xingtai Gui
Tengteng Huang
Haonan Shao
Haotian Yao
Chi Zhang
209
7
0
19 Apr 2024
How to Benchmark Vision Foundation Models for Semantic Segmentation?
How to Benchmark Vision Foundation Models for Semantic Segmentation?
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
VLM
247
11
0
18 Apr 2024
MaskCD: A Remote Sensing Change Detection Network Based on Mask
  Classification
MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification
Weikang Yu
Xiaokang Zhang
Samiran Das
Xiao Xiang Zhu
Pedram Ghamisi
112
13
0
18 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image
  Segmentation
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
177
24
0
18 Apr 2024
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
SOHES: Self-supervised Open-world Hierarchical Entity SegmentationInternational Conference on Learning Representations (ICLR), 2024
Shengcao Cao
J. Gu
Jason Kuen
Hao Tan
Ruiyi Zhang
Handong Zhao
A. Nenkova
Liangyan Gui
Tong Sun
Yu Wang
VLMOCL
332
3
0
18 Apr 2024
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale
  Approach
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Mir Rayat Imtiaz Hossain
Mennatullah Siam
Leonid Sigal
James J. Little
VLM
229
18
0
17 Apr 2024
CarcassFormer: An End-to-end Transformer-based Framework for
  Simultaneous Localization, Segmentation and Classification of Poultry Carcass
  Defect
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect
Minh Q. Tran
Sang Truong
Arthur F. A. Fernandes
Michael Kidd
Ngan Le
ViT
244
10
0
17 Apr 2024
StyleCity: Large-Scale 3D Urban Scenes Stylization
StyleCity: Large-Scale 3D Urban Scenes Stylization
Yingshu Chen
Huajian Huang
Tuan-Anh Vu
Ka Chun Shum
Sai-Kit Yeung
231
3
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
188
5
0
15 Apr 2024
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
No More Ambiguity in 360° Room Layout via Bi-Layout Estimation
Yu-Ju Tsai
Jin-Cheng Jhang
Jingjing Zheng
Wei Wang
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
Ming-Hsuan Yang
3DV
170
7
0
15 Apr 2024
In-Context Translation: Towards Unifying Image Recognition, Processing,
  and Generation
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
Han Xue
Qianru Sun
Li Song
Wenjun Zhang
Zhiwu Huang
MLLM
157
0
0
15 Apr 2024
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
Gabriele Rosi
Claudia Cuttano
Niccolò Cavagnero
Giuseppe Averta
Fabio Cermelli
SSeg
219
4
0
15 Apr 2024
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based
  Semantic Occupancy Prediction
SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction
Pin Tang
Zhongdao Wang
Guoqing Wang
Jilai Zheng
Xiangxuan Ren
Bailan Feng
Chao Ma
197
74
0
15 Apr 2024
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision
  Transformers
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
259
1
0
14 Apr 2024
Coreset Selection for Object Detection
Coreset Selection for Object Detection
Hojun Lee
Suyoung Kim
Junhoo Lee
Jaeyoung Yoo
Nojun Kwak
195
15
0
14 Apr 2024
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part
  Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Binghua Li
Jie Mao
Zhe Sun
Chao Li
Qibin Zhao
Toshihisa Tanaka
184
1
0
13 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
182
19
0
12 Apr 2024
LaSagnA: Language-based Segmentation Assistant for Complex Queries
LaSagnA: Language-based Segmentation Assistant for Complex Queries
Cong Wei
Haoxian Tan
Yujie Zhong
Yujiu Yang
Lin Ma
249
37
0
12 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Chong Chen
221
140
0
11 Apr 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
Jihao Liu
Jinliang Zheng
Yu Liu
Jiaming Song
VLM
171
6
0
11 Apr 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
Jingxuan Xu
Wuyang Chen
Yao-Min Zhao
Yunchao Wei
VLM
229
1
0
11 Apr 2024
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using PrototypesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Poulami Sinhamahapatra
Franziska Schwaiger
Shirsha Bose
Huiyu Wang
Karsten Roscher
Stephan Guennemann
210
1
0
11 Apr 2024
Identification of Fine-grained Systematic Errors via Controlled Scene
  Generation
Identification of Fine-grained Systematic Errors via Controlled Scene Generation
Valentyn Boreiko
Matthias Hein
J. H. Metzen
171
1
0
10 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic
  Segmentation
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Nianzu Yang
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
252
5
0
08 Apr 2024
Previous
123...161718...323334
Next