ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation
Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation
Edgar Heinert
Thomas Gottwald
Annika Mütze
Matthias Rottmann
336
1
0
16 Mar 2025
E-SAM: Training-Free Segment Every Entity Model
E-SAM: Training-Free Segment Every Entity Model
Weiming Zhang
Dingwen Xiao
Lei Chen
Lin Wang
VLM
149
1
0
15 Mar 2025
SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets
Hao Liu
Pengyu Guo
Siyuan Yang
Zeqing Jiang
Qinglei Hu
Dongyu Li
103
1
0
14 Mar 2025
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
Christopher Xie
A. Avetisyan
Henry Howard-Jenkins
Yawar Siddiqui
Julian Straub
Richard Newcombe
Vasileios Balntas
Jakob Julian Engel
3DH3DV
329
1
0
14 Mar 2025
VGGT: Visual Geometry Grounded TransformerComputer Vision and Pattern Recognition (CVPR), 2025
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
427
459
0
14 Mar 2025
High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation
High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation
Quan-Sheng Zeng
Yunheng Li
Daquan Zhou
Guanbin Li
Qibin Hou
Ming-Ming Cheng
CLIPVLM
291
2
0
13 Mar 2025
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation
Máté Tóth
Péter Kovács
Zoltán Bendefy
Zoltán Bendefy
Zoltán Hortsin
Balázs Teréki
Tamás Matuszka
3DGSAI4CE
814
0
0
12 Mar 2025
Learning Appearance and Motion Cues for Panoptic Tracking
Juana Valeria Hurtado
Sajad Marvi
Rohit Mohan
Abhinav Valada
257
0
0
12 Mar 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffMVOSVGen
286
1
0
11 Mar 2025
From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Qinji Yu
Yirui Wang
K. Yan
Dandan Zheng
Dashan Ai
...
N. Shen
Xiaowei Ding
Le Lu
X. Ye
Dakai Jin
ViTMedIm
407
0
0
11 Mar 2025
TRACE: Your Diffusion Model is Secretly an Instance Edge Detector
TRACE: Your Diffusion Model is Secretly an Instance Edge Detector
Sanghyun Jo
Ziseok Lee
Wooyeol Lee
Jonghyun Choi
Jaesik Park
Kyungsu Kim
438
1
0
11 Mar 2025
SAS: Segment Any 3D Scene with Integrated 2D Priors
Hao Sun
Jiahao Lu
Jiacheng Deng
Hanzhi Chang
Lifan Wu
Yanzhe Liang
Tianzhu Zhang
237
2
0
11 Mar 2025
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
Zhuo Zhi
Chen Feng
Adam Daneshmend
Mine Orlu
Andreas Demosthenous
L. Yin
Da Li
Ziquan Liu
Miguel R. D. Rodrigues
LRM
232
8
0
11 Mar 2025
MegaSR: Mining Customized Semantics and Expressive Guidance for Real-World Image Super-Resolution
MegaSR: Mining Customized Semantics and Expressive Guidance for Real-World Image Super-Resolution
Xiaochen Li
Yue Yu
Xinchuan Huang
C. L. Philip Chen
Weili Guan
Xian-Sheng Hua
DiffM
242
0
0
11 Mar 2025
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
Anzhe Cheng
Chenzhong Yin
Yu Chang
Heng Ping
Shixuan Li
Shahin Nazarian
Paul Bogdan
SSeg
540
1
0
11 Mar 2025
3D Medical Imaging Segmentation on Non-Contrast CT
Canxuan Gang
Yuhan Peng
222
0
0
11 Mar 2025
Visual Attention Graph
Kai-Fu Yang
Yong-Jie Li
162
0
0
11 Mar 2025
TrackOcc: Camera-based 4D Panoptic Occupancy TrackingIEEE International Conference on Robotics and Automation (ICRA), 2025
Zhuoguang Chen
Kenan Li
Xiuyu Yang
Tao Jiang
Yongqian Li
Hang Zhao
313
1
0
11 Mar 2025
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion PathwaysComputer Vision and Pattern Recognition (CVPR), 2025
Yi Liu
Hao Zhou
Wenxiang Shang
Ran Lin
Benlei Cui
DiffM
101
7
0
10 Mar 2025
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue
Ryusuke Miyamoto
177
0
0
10 Mar 2025
Dynamic Dictionary Learning for Remote Sensing Image Segmentation
Xuechao Zou
Yue Li
Shun Zhang
Kai Li
Shiying Wang
Pin Tao
Junliang Xing
Congyan Lang
286
1
0
09 Mar 2025
Golden Cudgel Network for Real-Time Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2025
Guoyu Yang
Yuan Wang
Daming Shi
Yanjie Wang
192
6
0
05 Mar 2025
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca
Xianmei Lei
Jonathan Becktor
J. Edlund
Anna Sabel
Patrick Spieler
Curtis Padgett
Alexandre Alahi
Deegan Atha
308
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Xiangwei Zhu
423
0
0
05 Mar 2025
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Youssef Shoeb
Azarm Nowzad
Hanno Gottschalk
UQCV
536
6
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Boltzmann Attention Sampling for Image Analysis with Small ObjectsComputer Vision and Pattern Recognition (CVPR), 2025
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
376
1
0
04 Mar 2025
Object-Aware Video Matting with Cross-Frame Guidance
Han Zhang
Dongyue Wu
Yuanjie Shao
Nong Sang
Changxin Gao
VOS
221
1
0
03 Mar 2025
One-shot In-context Part SegmentationACM Multimedia (MM), 2024
Zhenqi Dai
Ting Liu
Xinyu Zhang
Y. X. Wei
Yanning Zhang
VLM
431
2
0
03 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Nianzu Yang
Yun Zheng
Liwei Wang
ObjDVLM
365
12
0
03 Mar 2025
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisAAAI Conference on Artificial Intelligence (AAAI), 2025
Yun Wang
Jingchen Ni
Yong-Jin Liu
Chun Yuan
Yansong Tang
251
13
0
02 Mar 2025
Training-Free Dataset Pruning for Instance SegmentationInternational Conference on Learning Representations (ICLR), 2025
Yalun Dai
Lingao Xiao
Ivor W. Tsang
Yang He
ISeg
285
4
0
02 Mar 2025
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li
Ziyue Li
Wanrong Zhu
MoE
409
3
0
27 Feb 2025
Open-Vocabulary Semantic Part Segmentation of 3D Human
Open-Vocabulary Semantic Part Segmentation of 3D HumanInternational Conference on 3D Vision (3DV), 2025
Keito Suzuki
Bang Du
Girish Krishnan
Kunyao Chen
Runfa Li
Truong Thao Nguyen
3DHVLM
336
6
0
27 Feb 2025
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating ObjectsAAAI Conference on Artificial Intelligence (AAAI), 2025
Elkhan Ismayilzada
MD Khalequzzaman Chowdhury Sayem
Yihalem Yimolal Tiruneh
Mubarrat Chowdhury
Muhammadjon Boboev
Seungryul Baek
ViT
285
2
0
27 Feb 2025
Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach
Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach
Anton Backhaus
Thorsten Luettel
Mirko Maehlisch
246
0
0
26 Feb 2025
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images
N. Shvetsov
T. Kilvaer
M. Tafavvoghi
Anders Sildnes
Kajsa Møllersen
Lill-ToveRasmussen Busund
L. A. Bongo
VLM
308
1
0
26 Feb 2025
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation
Vishal Thengane
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
L. Yin
Xiatian Zhu
Salman Khan
CLL
965
0
0
24 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Vision-LSTM: xLSTM as Generic Vision BackboneInternational Conference on Learning Representations (ICLR), 2024
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
424
80
0
24 Feb 2025
Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement
Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement
Rui Liu
216
0
0
24 Feb 2025
OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving
OG-Gaussian: Occupancy Based Street Gaussians for Autonomous DrivingIEEE International Conference on Robotics and Automation (ICRA), 2025
Yedong Shen
Xinran Zhang
YiFan Duan
Shiqi Zhang
Heng Li
Yilong Wu
Jianmin Ji
Yanyong Zhang
3DGS
108
3
0
20 Feb 2025
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing
Shutong Zhang
293
1
0
15 Feb 2025
A Survey on Mamba Architecture for Vision Applications
A Survey on Mamba Architecture for Vision Applications
Fady Ibrahim
Guangjun Liu
Guanghui Wang
Mamba
383
9
0
11 Feb 2025
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing
Sicen Guo
Tianyou Wen
Chuang-Wei Liu
Qijun Chen
Rui Fan
364
0
0
10 Feb 2025
A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation
Canxuan Gang
MedImViT
225
1
0
08 Feb 2025
Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation
Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation
Jiahao Lu
Jiacheng Deng
Tianzhu Zhang
428
3
0
06 Feb 2025
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic SegmentationInternational Conference on Learning Representations (ICLR), 2025
Tao Zhang
Jinyong Wen
Zhen Chen
Kun Ding
Di Zhang
Chunhong Pan
401
2
0
04 Feb 2025
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis
B. Alawode
I. I. Ganapathi
S. Javed
Naoufel Werghi
Mohammed Bennamoun
Arif Mahmood
CLIPVLM
376
5
0
03 Feb 2025
Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation
Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation
Lin Chen
Qi Yang
Kun Ding
Tianying Wang
Gang Shen
Fei Li
Qiyuan Cao
Shiming Xiang
VLM
175
2
0
29 Jan 2025
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identificationIEEE Transactions on Image Processing (IEEE TIP), 2025
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jing Liu
390
7
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
An Item is Worth a Prompt: Versatile Image Editing with Disentangled ControlAAAI Conference on Artificial Intelligence (AAAI), 2024
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
269
8
0
28 Jan 2025
Previous
123...8910...323334
Next