Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
290
6
0
22 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
374
9
0
22 Jan 2025
Towards Accurate Unified Anomaly Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Wenxin Ma
Qingsong Yao
Xiang Zhang
Zhelong Huang
Zihang Jiang
S. Kevin Zhou
320
10
0
21 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
164
6
0
20 Jan 2025
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation
Xingxin He
Yifan Hu
Zhaoye Zhou
Mohamed Jarraya
Fang Liu
VLM
MedIm
252
5
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
IEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
666
66
0
17 Jan 2025
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
IEEE transactions on multimedia (TMM), 2025
Haomiao Xiong
Yunzhi Zhuge
Jiawen Zhu
Lu Zhang
Huchuan Lu
209
10
0
14 Jan 2025
Static Segmentation by Tracking: A Label-Efficient Approach for Fine-Grained Specimen Image Segmentation
Zhenyang Feng
Zihe Wang
Saul Ibaven Bueno
Saul Ibaven Bueno
Tomasz Frelek
...
Hilmar Lapp
Charles V. Stewart
T. Berger-Wolf
Yu-Chuan Su
Wei-Lun Chao
252
0
0
12 Jan 2025
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
AAAI Conference on Artificial Intelligence (AAAI), 2024
Duc-Hai Pham
Duc Dung Nguyen
Anh Pham
Ho Lai Tuan
P. Nguyen
Khoi Duc Minh Nguyen
Rang Nguyen
3DPC
514
3
0
10 Jan 2025
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish
Nejc Novak
Daniel Lehotský
Vasiliki Ismiroglou
Niels Madsen
T. Moeslund
Malte Pedersen
160
2
0
08 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
Xianrui Li
Tao Zhang
Zilong Huang
Shilin Xu
...
Yunhai Tong
Lu Qi
Jiashi Feng
Ming-Hsuan Yang
Ming-Hsuan Yang
VLM
526
81
0
07 Jan 2025
A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE JSTARS), 2024
Dawen Yu
Shunping Ji
ViT
266
5
0
03 Jan 2025
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2025
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
285
4
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
Neural Information Processing Systems (NeurIPS), 2024
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLM
VLM
LRM
735
117
0
03 Jan 2025
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves
Madeleine Darbyshire
Elizabeth I. Sklar
Simon Parsons
266
0
0
03 Jan 2025
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Runnan Chen
Zhaoqing Wang
Jiepeng Wang
Yuexin Ma
Mingming Gong
Wenping Wang
Tongliang Liu
3DGS
272
5
0
03 Jan 2025
Unlocking adaptive digital pathology through dynamic feature learning
Jiawen Li
Tian Guan
Qingxin Xia
Yanjie Wang
Xitong Ling
...
Xiu-Wu Bian
Liang Luo
Lingchuan Guo
Chao He
Yonghong He
AI4CE
156
1
0
31 Dec 2024
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training
International Conference on Intelligent Computing and its Emerging Applications (ICIEA), 2024
Fardin Ayar
Ehsan Javanmardi
Manabu Tsukada
Mahdi Javanmardi
Mohammad Rahmati
VOS
361
0
0
31 Dec 2024
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
452
5
0
29 Dec 2024
Towards Visual Grounding: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
863
27
0
28 Dec 2024
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection and Simulation
Yikang Zhang
Chuang-Wei Liu
Jiahang Li
Yingbing Chen
Jie Cheng
Rui Fan
196
0
0
23 Dec 2024
Segmentation of arbitrary features in very high resolution remote sensing imagery
Henry Cording
Yves Plancherel
Pablo Brito-Parada
293
1
0
20 Dec 2024
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xinyue Chen
Miaojing Shi
Zijian Zhou
Lianghua He
Sophia Tsoka
225
2
0
20 Dec 2024
FashionComposer: Compositional Fashion Image Generation
S. Ji
Yiyang Wang
Xi Chen
Xiaohan Li
Hao Luo
Hengshuang Zhao
327
0
0
18 Dec 2024
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
International Conference on Artificial Neural Networks (ICANN), 2024
J. Zhang
Li Zhang
Shijian Li
VLM
332
0
0
18 Dec 2024
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
Cong Wei
Yujie Zhong
Haoxian Tan
Yingsen Zeng
Yong Liu
Zheng Zhao
Yujiu Yang
MLLM
VLM
VOS
252
11
0
18 Dec 2024
Locate n' Rotate: Two-stage Openable Part Detection with Foundation Model Priors
Asian Conference on Computer Vision (ACCV), 2024
Siqi Li
Xiaoxue Chen
Haoyu Cheng
Guyue Zhou
Hao Zhao
Guanzhong Tian
377
1
0
17 Dec 2024
Open-World Panoptic Segmentation
Matteo Sodano
Federico Magistri
Jens Behley
Cyrill Stachniss
VLM
313
2
0
17 Dec 2024
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Hongwei Niu
Linhuang Xie
Jianghang Lin
Shengchuan Zhang
290
11
0
16 Dec 2024
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
217
1
0
16 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
287
4
0
16 Dec 2024
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zijian Gu
Jianwei Ma
Yan Huang
Honghao Wei
Zhanye Chen
Huatian Zhang
Wei Hong
312
10
0
16 Dec 2024
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Yunxiang Fu
Meng Lou
Yizhou Yu
600
18
0
16 Dec 2024
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
524
14
0
16 Dec 2024
Mask Enhanced Deeply Supervised Prostate Cancer Detection on B-mode Micro-Ultrasound
Lichun Zhang
Steve Zhou
Moon Hyung Choi
Jeong Hoon Lee
Shengtian Sang
...
Wei Shao
Ahmed N. El Kaffas
Richard E. Fan
G. Sonn
M. Rusu
MedIm
214
0
0
14 Dec 2024
Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation
Jurica Runtas
Tomislav Petkovic
UQCV
261
0
0
14 Dec 2024
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance
Wenjun Huang
Jianguo Hu
199
0
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
271
1
0
13 Dec 2024
Coherent 3D Scene Diffusion From a Single RGB Image
Neural Information Processing Systems (NeurIPS), 2024
Manuel Dahnert
Angela Dai
Norman Muller
Matthias Nießner
204
2
0
13 Dec 2024
Continual Learning for Segment Anything Model Adaptation
Jinglong Yang
Yichen Wu
Jun Cen
Wenjian Huang
Hong Wang
Jianguo Zhang
TTA
CLL
201
3
0
09 Dec 2024
A Pipeline and NIR-Enhanced Dataset for Parking Lot Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Shirin Qiam
Saipraneeth Devunuri
Lewis J. Lehe
127
1
0
09 Dec 2024
Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Maxime Fontana
Michael W. Spratling
Miaojing Shi
MoE
VLM
342
0
0
04 Dec 2024
SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection
Joongwon Chae
Zhenyu Wang
Peiwu Qin
VLM
260
0
0
03 Dec 2024
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
Computer Vision and Pattern Recognition (CVPR), 2024
Byung-Kwan Lee
Ryo Hachiuma
Yu-Chiang Frank Wang
Y. Ro
Yueh-Hua Wu
VLM
365
4
0
02 Dec 2024
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
560
17
0
02 Dec 2024
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
365
4
0
02 Dec 2024
SyncVIS: Synchronized Video Instance Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
281
1
0
01 Dec 2024
EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Tong Jin
Feng Lu
Shuyu Hu
Chun Yuan
Yunpeng Liu
ViT
405
3
0
01 Dec 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLM
VLM
3DV
552
1
0
29 Nov 2024
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation
Finlay G. C. Hudson
W. Smith
VOS
VLM
321
0
0
28 Nov 2024
Previous
1
2
3
...
9
10
11
...
32
33
34
Next