ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving
LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving
Luqi Cheng
Zhangshuo Qi
Zijie Zhou
Chao Lu
Guangming Xiong
3DGS
103
2
0
03 Aug 2025
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving
Jiayuan Wang
Q. M. Jonathan Wu
Katsuya Suto
Ning Zhang
137
2
0
02 Aug 2025
SDMatte: Grafting Diffusion Models for Interactive Matting
SDMatte: Grafting Diffusion Models for Interactive Matting
Daigang Xu
Yu Liang
H. Zhang
Jinwei Chen
Wei Dong
L. Chen
Wanyu Liu
Bo Li
P. Jiang
DiffM
155
1
0
01 Aug 2025
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen
Lingxiao Yang
Yun Chen
Nailong Zhao
Jianhuang Lai
Jie Shao
Xiaohua Xie
VLM
131
1
0
01 Aug 2025
UIS-Mamba: Exploring Mamba for Underwater Instance Segmentation via Dynamic Tree Scan and Hidden State Weaken
UIS-Mamba: Exploring Mamba for Underwater Instance Segmentation via Dynamic Tree Scan and Hidden State Weaken
Runmin Cong
Zongji Yu
Hao Fang
Haoyan Sun
Sam Kwong
Mamba
93
2
0
01 Aug 2025
Multimodal Referring Segmentation: A Survey
Multimodal Referring Segmentation: A Survey
Henghui Ding
Song Tang
Shuting He
Chang-rui Liu
Zuxuan Wu
Yu-Gang Jiang
330
10
0
01 Aug 2025
MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting
MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting
Xingyue Peng
Yuandong Lyu
Lang Zhang
Jian Zhu
S. Wang
...
Songxin Lu
Weiliang Ma
Dangen She
Fu Liu
Xianpeng Lang
3DGS3DV
117
0
0
31 Jul 2025
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Kaining Ying
Henghui Ding
Guangquan Jie
Yu Jiang
VOS
265
5
0
30 Jul 2025
Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery
Deepak Joshi
Mayukha Pal
ViT
44
0
0
28 Jul 2025
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation
I-Hsiang Chen
Hua-En Chang
Wei-Ting Chen
Jenq-Neng Hwang
Sy-Yen Kuo
DiffM
162
0
0
28 Jul 2025
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
J. Park
Kumju Jo
Sungyong Baik
DiffM
150
0
0
26 Jul 2025
Latest Object Memory Management for Temporally Consistent Video Instance Segmentation
Latest Object Memory Management for Temporally Consistent Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Minwoo Choi
Kiljoon Han
Jaehoon Jeong
Zane Durante
Ehsan Adeli
Sang Hyun Park
Sunghoon Im
VOS
190
0
0
26 Jul 2025
FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving
FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving
Tao Lian
J. Gómez
Antonio M. López
FedML
103
1
0
26 Jul 2025
MixA-Q: Revisiting Activation Sparsity for Vision Transformers from a Mixed-Precision Quantization Perspective
MixA-Q: Revisiting Activation Sparsity for Vision Transformers from a Mixed-Precision Quantization Perspective
Weitian Wang
Rai Shubham
Cecilia De La Parra
Akash Kumar
MQ
138
1
0
25 Jul 2025
SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation
SurgPIS: Surgical-instrument-level Instances and Part-level Semantics for Weakly-supervised Part-aware Instance Segmentation
Meng Wei
Charlie Budd
Oluwatosin O. Alabi
Miaojing Shi
Tom Vercauteren
187
0
0
25 Jul 2025
GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences
GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences
Gabriel Jarry
Ramon Dalmau
Philippe Very
Franck Ballerini
Stephania-Denisa Bocu
186
1
0
24 Jul 2025
Object segmentation in the wild with foundation models: application to vision assisted neuro-prostheses for upper limbs
Object segmentation in the wild with foundation models: application to vision assisted neuro-prostheses for upper limbs
Bolutife Atoki
J. Benois-Pineau
Renaud Péteri
Fabien Baldacci
A. Rugy
VLM
209
0
0
24 Jul 2025
Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation
Synthetic Data Augmentation for Enhanced Chicken Carcass Instance Segmentation
Yihong Feng
Chaitanya Pallerla
Xiaomin Lin
Pouya Sohrabipour Sr
Philip Glen Crandall
Wan Shou
Yu She
Dongyi Wang
101
0
0
24 Jul 2025
IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception
IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird's-Eye View Perception
Haichuan Li
Changda Tian
Panos Trahanias
Tomi Westerlund
3DPC
81
0
0
23 Jul 2025
A2Mamba: Attention-augmented State Space Models for Visual Recognition
A2Mamba: Attention-augmented State Space Models for Visual Recognition
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
161
0
0
22 Jul 2025
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Wentao Xiang
Haoxian Tan
Cong Wei
Yujie Zhong
Dengjie Li
Yujiu Yang
VLM
181
1
0
22 Jul 2025
Part Segmentation of Human Meshes via Multi-View Human Parsing
Part Segmentation of Human Meshes via Multi-View Human Parsing
James Dickens
Kamyar Hamad
3DH
330
1
0
22 Jul 2025
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction
Zhixiong Zhang
Shuangrui Ding
Xiaoyi Dong
Songxin He
Jianfan Lin
Junsong Tang
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
VOS
228
10
0
21 Jul 2025
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
Ruijie Zhu
Mulin Yu
Linning Xu
Lihan Jiang
Shouqing Yang
Tianzhu Zhang
Jiangmiao Pang
Bo Dai
3DGS
98
3
0
21 Jul 2025
Conditional Video Generation for High-Efficiency Video Compression
Conditional Video Generation for High-Efficiency Video Compression
Fangqiu Yi
Jingyu Xu
Jiawei Shao
Chi Zhang
Xuelong Li
DiffMVGen
283
1
0
21 Jul 2025
GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset
GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset
Zhiwei Zhang
Zi Ye
Yibin Wen
Shuai Yuan
Haohuan Fu
Jianxi Huang
Lixian Zhang
250
1
0
19 Jul 2025
Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation
Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation
Masahiro Ogawa
Qi An
Atsushi Yamashita
105
0
0
18 Jul 2025
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation
Shiqi Huang
Shuting He
Huaiyuan Qin
Bihan Wen
271
1
0
17 Jul 2025
AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving
AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving
Jiawei Xu
K. Deng
Zexin Fan
Shenlong Wang
Jin Xie
Zhiqiang Wang
3DGS
282
6
0
16 Jul 2025
Spatial Frequency Modulation for Semantic Segmentation
Spatial Frequency Modulation for Semantic SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Linwei Chen
Ying Fu
Lin Gu
Dezhi Zheng
Jifeng Dai
357
2
0
16 Jul 2025
Frequency-Dynamic Attention Modulation for Dense Prediction
Frequency-Dynamic Attention Modulation for Dense Prediction
Linwei Chen
Lin Gu
Ying Fu
474
1
0
16 Jul 2025
Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline
Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline
Shiyi Mu
Zichong Gu
Hanqi Lyu
Yilin Gao
Shugong Xu
3DPC
145
0
0
12 Jul 2025
All in One: Visual-Description-Guided Unified Point Cloud Segmentation
All in One: Visual-Description-Guided Unified Point Cloud Segmentation
Zongyan Han
Mohamed El Amine Boudjoghra
Jiahua Dong
Jinhong Wang
Rao Muhammad Anwer
170
0
0
07 Jul 2025
OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts
OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts
Shiting Xiao
Rishabh Kabra
Yuhang Li
Donghyun Lee
João Carreira
Priyadarshini Panda
VLM
239
0
0
07 Jul 2025
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
Move to Understand a 3D Scene: Bridging Visual Grounding and Exploration for Efficient and Versatile Embodied Navigation
Ziyu Zhu
Xilin Wang
Yixuan Li
Zhuofan Zhang
Xiaojian Ma
...
Wei Liang
Qian Yu
Zhidong Deng
Siyuan Huang
Qing Li
LM&Ro
240
17
0
05 Jul 2025
PhenoBench: A Comprehensive Benchmark for Cell Phenotyping
PhenoBench: A Comprehensive Benchmark for Cell Phenotyping
Fabian H. Reith
Claudia Winklmayr
Jerome Luescher
Nora Koreuber
Jannik Franzen
Elias Baumann
Christian M. Schuerch
Dagmar Kainmueller
J. L. Rumberger
216
0
0
04 Jul 2025
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
Qi Xu
Dongxu Wei
Lingzhe Zhao
Wenpu Li
Zhangchi Huang
Shunping Ji
Peidong Liu
3DV
235
0
0
03 Jul 2025
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Sen Yang
Wenxiao Cai
Yanpeng Sun
Wankou Yang
164
6
0
02 Jul 2025
Learning an Ensemble Token from Task-driven Priors in Facial Analysis
Learning an Ensemble Token from Task-driven Priors in Facial Analysis
Sunyong Seo
Huisu Yoon
Jongha Lee
ViTFedML
261
0
0
02 Jul 2025
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Yuqi Liu
Bohao Peng
Zhisheng Zhong
Zihao Yue
Fanbin Lu
Bei Yu
Jiaya Jia
LRMVLM
335
46
0
01 Jul 2025
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
MLLMLRM
529
2
0
01 Jul 2025
Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
Yuchu Jiang
Jiaming Chu
Jian Zhao
Xin Zhang
Xu Yang
Lei Jin
C. Zhang
Xuelong Li
167
0
0
20 Jun 2025
FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation
FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation
Fan Yang
Yousong Zhu
Xin Li
Yufei Zhan
Hongyin Zhao
Shurong Zheng
Yaowei Wang
Ming Tang
Jinqiao Wang
MLLMVLM
206
0
0
20 Jun 2025
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
Stepping Out of Similar Semantic Space for Open-Vocabulary Segmentation
Yong-Jin Liu
SongLi Wu
Sule Bai
Jiahao Wang
Yitong Wang
Yansong Tang
VLMVOS
254
1
0
19 Jun 2025
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception
Mingyuan Li
Tong Jia
Han Gu
Hui Lu
Hao Wang
Bowen Ma
Shuyang Lin
Shiyi Guo
Shizhuo Deng
Dongyue Chen
144
1
0
16 Jun 2025
Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning
Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning
Rohit Mohan
Julia Hindel
F. Drews
Claudius Gläser
Daniele Cattaneo
Abhinav Valada
UQCVEDL
390
1
0
16 Jun 2025
Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation
Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation
Numair Nadeem
Saeed Anwar
Muhammad Asad
Abdul Bais
VLM
204
0
0
16 Jun 2025
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Rong Wu
Ziqi Chen
Liming Zhong
Heng Li
Hai Shu
MedIm
265
1
0
15 Jun 2025
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling
Liang Yin
Xudong Xie
Zhang Li
Xiang Bai
Yuliang Liu
LRM
274
0
0
12 Jun 2025
Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models
Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models
Francisco Caetano
Christiaan Viviers
Peter H. N. de With
Fons van der Sommen
DiffM
307
1
0
12 Jun 2025
Previous
123456...323334
Next