Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,359 papers shown
Title
Universal Scene Graph Generation
Shengqiong Wu
Hao Fei
Tat-Seng Chua
36
0
0
19 Mar 2025
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Runsong Zhu
Shi Qiu
Zhengzhe Liu
Ka-Hei Hui
Qianyi Wu
Pheng Ann Heng
Chi-Wing Fu
3DGS
3DV
88
1
0
18 Mar 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
Kangfu Mei
Hossein Talebi
Mojtaba Ardakani
Vishal M. Patel
P. Milanfar
M. Delbracio
DiffM
77
1
0
18 Mar 2025
Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
Chen Liu
Liying Yang
Peike Li
Dadong Wang
Lincheng Li
Xin Yu
VOS
94
0
0
17 Mar 2025
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
Tao Wang
Changxu Cheng
Lingfeng Wang
Senda Chen
Wuyue Zhao
VLM
67
0
0
17 Mar 2025
MTGS: Multi-Traversal Gaussian Splatting
Tianyu Li
Yihang Qiu
Zhenhua Wu
Carl Lindström
Peng Su
Matthias Nießner
Hongyang Li
3DGS
62
0
0
16 Mar 2025
Shape Bias and Robustness Evaluation via Cue Decomposition for Image Classification and Segmentation
Edgar Heinert
Thomas Gottwald
Annika Mütze
Matthias Rottmann
60
0
0
16 Mar 2025
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding
Imran Kabir
Md. Alimoor Reza
Syed Masum Billah
ReLM
VLM
LRM
78
0
0
16 Mar 2025
E-SAM: Training-Free Segment Every Entity Model
Weiming Zhang
Dingwen Xiao
Lei Chen
Lin Wang
VLM
57
0
0
15 Mar 2025
SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets
Hao Liu
Pengyu Guo
Siyuan Yang
Zeqing Jiang
Qinglei Hu
Dongyu Li
40
0
0
14 Mar 2025
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
Christopher Xie
A. Avetisyan
Henry Howard-Jenkins
Yawar Siddiqui
Julian Straub
Richard A. Newcombe
Vasileios Balntas
Jakob Julian Engel
3DH
3DV
65
0
0
14 Mar 2025
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
48
7
0
14 Mar 2025
Learning Appearance and Motion Cues for Panoptic Tracking
Juana Valeria Hurtado
Sajad Marvi
Rohit Mohan
Abhinav Valada
68
0
0
12 Mar 2025
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation
Máté Tóth
Péter Kovács
Zoltán Bendefy
Zoltán Hortsin
Balázs Teréki
Tamás Matuszka
3DGS
AI4CE
126
0
0
12 Mar 2025
3D Medical Imaging Segmentation on Non-Contrast CT
Canxuan Gang
Yuhan Peng
55
0
0
11 Mar 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffM
VOS
VGen
69
0
0
11 Mar 2025
From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Qinji Yu
Yirui Wang
K. Yan
Dandan Zheng
Dashan Ai
...
N. Shen
Xiaowei Ding
Le Lu
X. Ye
Dakai Jin
ViT
MedIm
57
0
0
11 Mar 2025
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation
Anzhe Cheng
Chenzhong Yin
Yu Chang
Heng Ping
Shixuan Li
Shahin Nazarian
Paul Bogdan
SSeg
86
0
0
11 Mar 2025
SAS: Segment Any 3D Scene with Integrated 2D Priors
Z. Li
Jiahao Lu
Jiacheng Deng
Hanzhi Chang
Lifan Wu
Yanzhe Liang
Tianzhu Zhang
55
0
0
11 Mar 2025
TrackOcc: Camera-based 4D Panoptic Occupancy Tracking
Zhuoguang Chen
Kenan Li
Xiuyu Yang
Tao Jiang
Y. Li
Hang Zhao
52
0
0
11 Mar 2025
MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution
X. Li
Jianlong Wu
Xinchuan Huang
C. L. Philip Chen
Weili Guan
Xian-Sheng Hua
Liqiang Nie
DiffM
51
0
0
11 Mar 2025
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
Zhuo Zhi
Chen Feng
Adam Daneshmend
Mine Orlu
Andreas Demosthenous
L. Yin
Da Li
Ziquan Liu
Miguel R. D. Rodrigues
LRM
64
1
0
11 Mar 2025
DiffEGG: Diffusion-Driven Edge Generation as a Pixel-Annotation-Free Alternative for Instance Annotation
Sanghyun Jo
Ziseok Lee
Wooyeol Lee
Kyungsu Kim
34
0
0
11 Mar 2025
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
LRM
MLLM
52
0
0
10 Mar 2025
FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection
Takeru Inoue
Ryusuke Miyamoto
44
0
0
10 Mar 2025
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Yuqi Liu
Bohao Peng
Zhisheng Zhong
Zihao Yue
Fanbin Lu
Bei Yu
Jiaya Jia
LRM
VLM
45
10
0
09 Mar 2025
Dynamic Dictionary Learning for Remote Sensing Image Segmentation
Xuechao Zou
Yue Li
Shun Zhang
Kai Li
Shiying Wang
Pin Tao
Junliang Xing
Congyan Lang
48
0
0
09 Mar 2025
COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca
Xianmei Lei
Jonathan Becktor
J. Edlund
Anna Sabel
Patrick Spieler
Curtis Padgett
Alexandre Alahi
Deegan Atha
50
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
48
0
0
05 Mar 2025
Golden Cudgel Network for Real-Time Semantic Segmentation
Guoyu Yang
Yuan Wang
Daming Shi
Y. Wang
45
0
0
05 Mar 2025
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Youssef Shoeb
Azarm Nowzad
Hanno Gottschalk
UQCV
78
2
0
04 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
71
0
0
04 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Pandeng Li
Yun Zheng
Liwei Wang
ObjD
VLM
54
0
0
03 Mar 2025
Object-Aware Video Matting with Cross-Frame Guidance
H. Zhang
Dongyue Wu
Yuanjie Shao
Nong Sang
Changxin Gao
VOS
77
0
0
03 Mar 2025
One-shot In-context Part Segmentation
Zhenqi Dai
Ting Liu
X. Zhang
Y. X. Wei
Yanning Zhang
VLM
80
1
0
03 Mar 2025
Training-Free Dataset Pruning for Instance Segmentation
Yalun Dai
Lingao Xiao
Ivor W. Tsang
Yang He
ISeg
43
0
0
02 Mar 2025
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
Y. Wang
Jingchen Ni
Yong-Jin Liu
Chun Yuan
Yansong Tang
47
1
0
02 Mar 2025
Open-Vocabulary Semantic Part Segmentation of 3D Human
Keito Suzuki
Bang Du
Girish Krishnan
Kunyao Chen
Runfa Li
Truong Thao Nguyen
3DH
VLM
94
0
0
27 Feb 2025
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts
Zhongyang Li
Ziyue Li
Tianyi Zhou
MoE
46
0
0
27 Feb 2025
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
Elkhan Ismayilzada
MD Khalequzzaman Chowdhury Sayem
Yihalem Yimolal Tiruneh
Mubarrat Chowdhury
Muhammadjon Boboev
Seungryul Baek
ViT
65
1
0
27 Feb 2025
Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach
Anton Backhaus
Thorsten Luettel
Mirko Maehlisch
38
0
0
26 Feb 2025
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images
N. Shvetsov
T. Kilvaer
M. Tafavvoghi
Anders Sildnes
Kajsa Møllersen
Lill-ToveRasmussen Busund
L. A. Bongo
VLM
66
1
0
26 Feb 2025
CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation
Vishal G. Thengane
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
L. Yin
Xiatian Zhu
Salman Khan
CLL
103
0
0
24 Feb 2025
Vision-LSTM: xLSTM as Generic Vision Backbone
Benedikt Alkin
M. Beck
Korbinian Poppel
Sepp Hochreiter
Johannes Brandstetter
VLM
56
42
0
24 Feb 2025
Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement
Rui Liu
39
0
0
24 Feb 2025
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing
Shutong Zhang
37
0
0
15 Feb 2025
Generalized Class Discovery in Instance Segmentation
Cuong Manh Hoang
Yeejin Lee
Byeongkeun Kang
ISeg
87
0
0
12 Feb 2025
A Survey on Mamba Architecture for Vision Applications
Fady Ibrahim
Guangjun Liu
Guanghui Wang
Mamba
48
2
0
11 Feb 2025
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing
Sicen Guo
Tianyou Wen
Chuang-Wei Liu
Qijun Chen
Rui Fan
55
0
0
10 Feb 2025
A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation
Canxuan Gang
MedIm
ViT
47
0
0
08 Feb 2025
Previous
1
2
3
4
5
6
...
26
27
28
Next