ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,648 papers shown
Title
Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving
Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving
Leila Cheshmi
Mennatullah Siam
ViT
124
0
0
20 Aug 2025
EventSSEG: Event-driven Self-Supervised Segmentation with Probabilistic Attention
EventSSEG: Event-driven Self-Supervised Segmentation with Probabilistic Attention
Lakshmi Annamalai
Chetan Singh Thakur
80
0
0
20 Aug 2025
A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
Lixian Zhang
Zi Ye
Yibin Wen
Jianxi Huang
Zhiwei Zhang
Qingmei Li
Qiong Hu
Baodong Xu
Lingyuan Zhao
Haohuan Fu
88
1
0
20 Aug 2025
LENS: Learning to Segment Anything with Unified Reinforced Reasoning
LENS: Learning to Segment Anything with Unified Reinforced Reasoning
Lianghui Zhu
Bin Ouyang
Yuxuan Zhang
Tianheng Cheng
R. Hu
...
Longjin Ran
Xiaoxin Chen
Li Yu
Wenyu Liu
Xinggang Wang
VLMLRM
92
0
0
19 Aug 2025
SIS-Challenge: Event-based Spatio-temporal Instance Segmentation Challenge at the CVPR 2025 Event-based Vision Workshop
SIS-Challenge: Event-based Spatio-temporal Instance Segmentation Challenge at the CVPR 2025 Event-based Vision Workshop
Friedhelm Hamann
Emil Mededovic
Fabian Gülhan
Yuli Wu
Johannes Stegmaier
...
Kanghan Oh
Gi Hyun Lim
Boxuan Yang
Bowen Du
Guillermo Gallego
ISeg
149
0
0
18 Aug 2025
Temporal Grounding as a Learning Signal for Referring Video Object Segmentation
Temporal Grounding as a Learning Signal for Referring Video Object Segmentation
Seunghun Lee
Jiwan Seo
Jeonghoon Kim
S. Kim
Siwon Kim
...
Wonhyeok Choi
Jaehoon Jeong
Zane Durante
Sang Hyun Park
Sunghoon Im
VOS
128
0
0
16 Aug 2025
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
Wenhui Zhu
Xiwen Chen
Zhipeng Wang
Shao Tang
Sayan Ghosh
Xuanzhao Dong
Rajat Koner
Yalin Wang
VLM
84
0
0
16 Aug 2025
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
T. Zemskova
A. Staroverov
Dmitry A. Yudin
Aleksandr I. Panov
56
0
0
15 Aug 2025
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Junjie Wang
Keyu Chen
Yulin Li
Bin Chen
Hengshuang Zhao
Xiaojuan Qi
Zhuotao Tian
CLIPVLM
82
1
0
15 Aug 2025
Unlocking Robust Semantic Segmentation Performance via Label-only Elastic Deformations against Implicit Label Noise
Unlocking Robust Semantic Segmentation Performance via Label-only Elastic Deformations against Implicit Label Noise
Yechan Kim
DongHo Yoon
Younkwan Lee
Unse Fatima
Hong Kook Kim
Songjae Lee
Sanga Park
Jeong Ho Park
Seonjong Kang
M. Jeon
VLM
131
1
0
14 Aug 2025
CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
Baichen Liu
Qi Lyu
Xudong Wang
Jiahua Dong
Lianqing Liu
Zhi Han
CLLVLM
100
0
0
14 Aug 2025
From Pixel to Mask: A Survey of Out-of-Distribution Segmentation
From Pixel to Mask: A Survey of Out-of-Distribution Segmentation
Wenjie Zhao
Jia Li
Yunhui Guo
84
0
0
14 Aug 2025
FM4NPP: A Scaling Foundation Model for Nuclear and Particle Physics
FM4NPP: A Scaling Foundation Model for Nuclear and Particle Physics
David K. Park
Shuhang Li
Y. Huang
Xihaier Luo
Haiwang Yu
...
Lu Ma
Shinjae Yoo
Joseph Osborn
Jin-zhi Huang
Zhongjing Jiang
AI4CE
56
1
0
13 Aug 2025
TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos
TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos
Jinxi Li
Ziyang Song
Bo Yang
3DVAI4CE
96
2
0
13 Aug 2025
RelayFormer: A Unified Local-Global Attention Framework for Scalable Image and Video Manipulation Localization
RelayFormer: A Unified Local-Global Attention Framework for Scalable Image and Video Manipulation Localization
Wen Huang
Jiarui Yang
Tao Dai
Jiawei Li
Shaoxiong Zhan
Bin Wang
Shu-Tao Xia
84
0
0
13 Aug 2025
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Shi-Chen Zhang
Yunheng Li
Yu-Huan Wu
Qibin Hou
Ming-Ming Cheng
SSeg
152
1
0
12 Aug 2025
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation
Jiahua Dong
Hui Yin
Wenqi Liang
Hanbin Zhao
Henghui Ding
Nicu Sebe
Salman Khan
Fahad Shahbaz Khan
VLM
109
0
0
12 Aug 2025
SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)
SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)Computers & graphics (Comput. Graph.), 2025
T. Nguyen
Viet-Tham Huynh
Quang-Thuc Nguyen
H. Nguyen
Long Le Bao
...
Dinh-Khoi Vo
Van-Loc Nguyen
Trung-Truc Huynh-Le
Tam V. Nguyen
Minh-Triet Tran
3DV
76
1
0
12 Aug 2025
Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos
Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos
Tuyen Tran
T. Hoang Ngan Le
Quang-Hung Le
Truyen Tran
96
0
0
10 Aug 2025
EventRR: Event Referential Reasoning for Referring Video Object Segmentation
EventRR: Event Referential Reasoning for Referring Video Object Segmentation
Huihui Xu
Jiashi Lin
Haoyu Chen
Junjun He
Lei Zhu
VOS
215
0
0
10 Aug 2025
S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision
S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision
Huihui Xu
Jin Ye
Hongqiu Wang
Changkai Ji
Jiashi Lin
...
Chenglong Ma
Tianbin Li
Lihao Liu
Junjun He
Lei Zhu
126
0
0
09 Aug 2025
Decoupling Continual Semantic Segmentation
Decoupling Continual Semantic Segmentation
Yifu Guo
Y. Lu
Wentao Zhang
Zishan Xu
Dexia Chen
Siyu Zhang
Yizhe Zhang
Ruixuan Wang
CLL
103
2
0
07 Aug 2025
SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion
SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion
Xiaoyang Zhang
Zhen Hua
Yakun Ju
Wei Zhou
Jun Liu
Alex C. Kot
Alex C. Kot
96
0
0
07 Aug 2025
SPEX: A Vision-Language Model for Land Cover Extraction on Spectral Remote Sensing Images
SPEX: A Vision-Language Model for Land Cover Extraction on Spectral Remote Sensing Images
Dongchen Si
Di Wang
Erzhong Gao
Xiaolei Qin
Liu Zhao
...
Jianbo Zhan
Jianshe Wang
Lin Liu
Bo Du
Liangpei Zhang
64
1
0
07 Aug 2025
Prototype-Driven Structure Synergy Network for Remote Sensing Images Segmentation
Prototype-Driven Structure Synergy Network for Remote Sensing Images SegmentationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Junyi Wang
Jinjiang Li
Guodong Fan
Yakun Ju
Xiang Fang
Alex C. Kot
111
1
0
06 Aug 2025
TNet: Terrace Convolutional Decoder Network for Remote Sensing Image Semantic Segmentation
TNet: Terrace Convolutional Decoder Network for Remote Sensing Image Semantic Segmentation
Chengqian Dai
Yonghong Guo
Hongzhao Xiang
Yigui Luo
ViT
127
0
0
06 Aug 2025
DOMR: Establishing Cross-View Segmentation via Dense Object Matching
DOMR: Establishing Cross-View Segmentation via Dense Object Matching
Jitong Liao
Yulu Gao
Shaofei Huang
Jialin Gao
Jie Lei
Ronghua Liang
Si Liu
137
1
0
06 Aug 2025
Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis
Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis
Angang Zhang
Fang Deng
Hao Chen
Zhongjian Chen
Junyan Li
DiffM
74
0
0
06 Aug 2025
What Holds Back Open-Vocabulary Segmentation?
What Holds Back Open-Vocabulary Segmentation?
Josip Šarić
Ivan Martinović
Matej Kristan
Sinisa Segvic
VLM
101
0
0
06 Aug 2025
X-SAM: From Segment Anything to Any Segmentation
X-SAM: From Segment Anything to Any Segmentation
Hao Wang
Limeng Qiao
Zequn Jie
Zhijian Huang
Chengjian Feng
Qingfang Zheng
Lin Ma
X. Lan
Xiaodan Liang
VLM
105
3
0
06 Aug 2025
Multi-Granularity Feature Calibration via VFM for Domain Generalized Semantic Segmentation
Multi-Granularity Feature Calibration via VFM for Domain Generalized Semantic Segmentation
Xinhui Li
Xiaojie Guo
120
0
0
05 Aug 2025
ParticleSAM: Small Particle Segmentation for Material Quality Monitoring in Recycling Processes
ParticleSAM: Small Particle Segmentation for Material Quality Monitoring in Recycling Processes
Yu Zhou
Pelle Thielmann
Ayush Chamoli
B. Mirbach
D. Stricker
J. Rambach
56
0
0
05 Aug 2025
MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy
MetaScope: Optics-Driven Neural Network for Ultra-Micro Metalens Endoscopy
Wuyang Li
W. Pan
Xiaoyuan Liu
Zhendong Luo
Chenxin Li
Hengyu Liu
Din Ping Tsai
Mu Ku Chen
Yixuan Yuan
121
0
0
05 Aug 2025
AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding
AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual Grounding
Yidan Wang
Chenyi Zhuang
Wutao Liu
Pan Gao
Nicu Sebe
ObjD
170
0
0
05 Aug 2025
Set Pivot Learning: Redefining Generalized Segmentation with Vision Foundation Models
Set Pivot Learning: Redefining Generalized Segmentation with Vision Foundation Models
Xinhui Li
Xinyu He
Qiming Hu
Xiaojie Guo
103
0
0
03 Aug 2025
IAUNet: Instance-Aware U-Net
IAUNet: Instance-Aware U-Net
Yaroslav Prytula
Illia Tsiporenko
Ali Zeynalli
Dmytro Fishman
SSeg
156
0
0
03 Aug 2025
Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models
Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models
Zhixiang Wei
Xiaoxiao Ma
Ruishen Yan
Tao Tu
Huajun Chen
Jinjin Zheng
Yi-jing Jin
Enhong Chen
VLM
131
1
0
03 Aug 2025
LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving
LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving
Luqi Cheng
Zhangshuo Qi
Zijie Zhou
Chao Lu
Guangming Xiong
3DGS
91
2
0
03 Aug 2025
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
Jiayuan Wang
Q. M. Jonathan Wu
Katsuya Suto
Ning Zhang
105
2
0
02 Aug 2025
UIS-Mamba: Exploring Mamba for Underwater Instance Segmentation via Dynamic Tree Scan and Hidden State Weaken
UIS-Mamba: Exploring Mamba for Underwater Instance Segmentation via Dynamic Tree Scan and Hidden State Weaken
Runmin Cong
Zongji Yu
Hao Fang
Haoyan Sun
Sam Kwong
Mamba
93
2
0
01 Aug 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
298
10
0
01 Aug 2025
SDMatte: Grafting Diffusion Models for Interactive Matting
SDMatte: Grafting Diffusion Models for Interactive Matting
Daigang Xu
Yu Liang
H. Zhang
Jinwei Chen
Wei Dong
L. Chen
Wanyu Liu
Bo Li
P. Jiang
DiffM
139
1
0
01 Aug 2025
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen
Lingxiao Yang
Yun Chen
Nailong Zhao
Jianhuang Lai
Jie Shao
Xiaohua Xie
VLM
107
1
0
01 Aug 2025
MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting
MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting
Xingyue Peng
Yuandong Lyu
Lang Zhang
Jian Zhu
S. Wang
...
Songxin Lu
Weiliang Ma
Dangen She
Fu Liu
Xianpeng Lang
3DGS3DV
85
0
0
31 Jul 2025
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Kaining Ying
Henghui Ding
Guangquan Jie
Yu Jiang
VOS
233
5
0
30 Jul 2025
Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
Deepak Joshi
Mayukha Pal
ViT
44
0
0
28 Jul 2025
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation
I-Hsiang Chen
Hua-En Chang
Wei-Ting Chen
Jenq-Neng Hwang
Sy-Yen Kuo
DiffM
126
0
0
28 Jul 2025
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
J. Park
Kumju Jo
Sungyong Baik
DiffM
134
0
0
26 Jul 2025
Latest Object Memory Management for Temporally Consistent Video Instance Segmentation
Latest Object Memory Management for Temporally Consistent Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Minwoo Choi
Kiljoon Han
Jaehoon Jeong
Zane Durante
Ehsan Adeli
Sang Hyun Park
Sunghoon Im
VOS
150
0
0
26 Jul 2025
FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving
FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving
Tao Lian
J. Gómez
Antonio M. López
FedML
87
1
0
26 Jul 2025
Previous
12345...313233
Next