ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
Kaiyu Li
Xiangyong Cao
Yupeng Deng
Chao Pang
Zepeng Xin
Deyu Meng
Zhi Wang
ObjD
290
6
0
22 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
374
9
0
22 Jan 2025
Towards Accurate Unified Anomaly Segmentation
Towards Accurate Unified Anomaly SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Wenxin Ma
Qingsong Yao
Xiang Zhang
Zhelong Huang
Zihang Jiang
S. Kevin Zhou
320
10
0
21 Jan 2025
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results
Benjamin Kiefer
Lojze Žust
Jon Muhovič
Matej Kristan
J. Pers
...
Ashraf Saleem
Ching-Heng Cheng
Yu-Fan Lin
Tzu-Yu Lin
Chih-Chung Hsu
164
6
0
20 Jan 2025
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation
Xingxin He
Yifan Hu
Zhaoye Zhou
Mohamed Jarraya
Fang Liu
VLMMedIm
252
5
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CELM&MAVLM
666
66
0
17 Jan 2025
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingIEEE transactions on multimedia (TMM), 2025
Haomiao Xiong
Yunzhi Zhuge
Jiawen Zhu
Lu Zhang
Huchuan Lu
209
10
0
14 Jan 2025
Static Segmentation by Tracking: A Label-Efficient Approach for Fine-Grained Specimen Image Segmentation
Static Segmentation by Tracking: A Label-Efficient Approach for Fine-Grained Specimen Image Segmentation
Zhenyang Feng
Zihe Wang
Saul Ibaven Bueno
Saul Ibaven Bueno
Tomasz Frelek
...
Hilmar Lapp
Charles V. Stewart
T. Berger-Wolf
Yu-Chuan Su
Wei-Lun Chao
252
0
0
12 Jan 2025
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model GuidanceAAAI Conference on Artificial Intelligence (AAAI), 2024
Duc-Hai Pham
Duc Dung Nguyen
Anh Pham
Ho Lai Tuan
P. Nguyen
Khoi Duc Minh Nguyen
Rang Nguyen
3DPC
514
3
0
10 Jan 2025
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish
Nejc Novak
Daniel Lehotský
Vasiliki Ismiroglou
Niels Madsen
T. Moeslund
Malte Pedersen
160
2
0
08 Jan 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
Xianrui Li
Tao Zhang
Zilong Huang
Shilin Xu
...
Yunhai Tong
Lu Qi
Jiashi Feng
Ming-Hsuan Yang
Ming-Hsuan Yang
VLM
526
81
0
07 Jan 2025
A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing ImagesIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (IEEE JSTARS), 2024
Dawen Yu
Shunping Ji
ViT
266
5
0
03 Jan 2025
Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2025
S. Park
Subeen Lee
Hyun Seok Seong
Jaejoon Yoo
Jae-Pil Heo
285
4
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksNeural Information Processing Systems (NeurIPS), 2024
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLMVLMLRM
735
117
0
03 Jan 2025
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves
Madeleine Darbyshire
Elizabeth I. Sklar
Simon Parsons
266
0
0
03 Jan 2025
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Runnan Chen
Zhaoqing Wang
Jiepeng Wang
Yuexin Ma
Mingming Gong
Wenping Wang
Tongliang Liu
3DGS
272
5
0
03 Jan 2025
Unlocking adaptive digital pathology through dynamic feature learning
Unlocking adaptive digital pathology through dynamic feature learning
Jiawen Li
Tian Guan
Qingxin Xia
Yanjie Wang
Xitong Ling
...
Xiu-Wu Bian
Liang Luo
Lingchuan Guo
Chao He
Yonghong He
AI4CE
156
1
0
31 Dec 2024
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video TrainingInternational Conference on Intelligent Computing and its Emerging Applications (ICIEA), 2024
Fardin Ayar
Ehsan Javanmardi
Manabu Tsukada
Mahdi Javanmardi
Mohammad Rahmati
VOS
361
0
0
31 Dec 2024
DPBridge: Latent Diffusion Bridge for Dense Prediction
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
452
5
0
29 Dec 2024
Towards Visual Grounding: A Survey
Towards Visual Grounding: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
863
27
0
28 Dec 2024
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection and Simulation
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection and Simulation
Yikang Zhang
Chuang-Wei Liu
Jiahang Li
Yingbing Chen
Jie Cheng
Rui Fan
196
0
0
23 Dec 2024
Segmentation of arbitrary features in very high resolution remote
  sensing imagery
Segmentation of arbitrary features in very high resolution remote sensing imagery
Henry Cording
Yves Plancherel
Pablo Brito-Parada
293
1
0
20 Dec 2024
Enhancing Generalized Few-Shot Semantic Segmentation via Effective
  Knowledge Transfer
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge TransferAAAI Conference on Artificial Intelligence (AAAI), 2024
Xinyue Chen
Miaojing Shi
Zijian Zhou
Lianghua He
Sophia Tsoka
225
2
0
20 Dec 2024
FashionComposer: Compositional Fashion Image Generation
FashionComposer: Compositional Fashion Image Generation
S. Ji
Yiyang Wang
Xi Chen
Xiaohan Li
Hao Luo
Hengshuang Zhao
327
0
0
18 Dec 2024
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic
  Segmentation
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic SegmentationInternational Conference on Artificial Neural Networks (ICANN), 2024
J. Zhang
Li Zhang
Shijian Li
VLM
332
0
0
18 Dec 2024
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal
  Large Language Models
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
Cong Wei
Yujie Zhong
Haoxian Tan
Yingsen Zeng
Yong Liu
Zheng Zhao
Yujiu Yang
MLLMVLMVOS
252
11
0
18 Dec 2024
Locate n' Rotate: Two-stage Openable Part Detection with Foundation
  Model Priors
Locate n' Rotate: Two-stage Openable Part Detection with Foundation Model PriorsAsian Conference on Computer Vision (ACCV), 2024
Siqi Li
Xiaoxue Chen
Haoyu Cheng
Guyue Zhou
Hao Zhao
Guanzhong Tian
377
1
0
17 Dec 2024
Open-World Panoptic Segmentation
Open-World Panoptic Segmentation
Matteo Sodano
Federico Magistri
Jens Behley
Cyrill Stachniss
VLM
313
2
0
17 Dec 2024
Exploring Semantic Consistency and Style Diversity for Domain
  Generalized Semantic Segmentation
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2024
Hongwei Niu
Linhuang Xie
Jianghang Lin
Shengchuan Zhang
290
11
0
16 Dec 2024
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
Expanded Comprehensive Robotic Cholecystectomy Dataset (CRCD)
K. Oh
Leonardo Borgioli
Alberto Mangano
Valentina Valle
Marco Di Pangrazio
...
Luciano Ambrosini
Alvaro Ducas
Milos Zefran
Liaohai Chen
P. Giulianotti
217
1
0
16 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
287
4
0
16 Dec 2024
HGSFusion: Radar-Camera Fusion with Hybrid Generation and
  Synchronization for 3D Object Detection
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionAAAI Conference on Artificial Intelligence (AAAI), 2024
Zijian Gu
Jianwei Ma
Yan Huang
Honghao Wei
Zhanye Chen
Huatian Zhang
Wei Hong
312
10
0
16 Dec 2024
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2024
Yunxiang Fu
Meng Lou
Yizhou Yu
600
18
0
16 Dec 2024
DINO-Foresight: Looking into the Future with DINO
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
524
14
0
16 Dec 2024
Mask Enhanced Deeply Supervised Prostate Cancer Detection on B-mode
  Micro-Ultrasound
Mask Enhanced Deeply Supervised Prostate Cancer Detection on B-mode Micro-Ultrasound
Lichun Zhang
Steve Zhou
Moon Hyung Choi
Jeong Hoon Lee
Shengtian Sang
...
Wei Shao
Ahmed N. El Kaffas
Richard E. Fan
G. Sonn
M. Rusu
MedIm
214
0
0
14 Dec 2024
Neural Network Meta Classifier: Improving the Reliability of Anomaly
  Segmentation
Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation
Jurica Runtas
Tomislav Petkovic
UQCV
261
0
0
14 Dec 2024
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision
  Performance
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance
Wenjun Huang
Jianguo Hu
199
0
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
271
1
0
13 Dec 2024
Coherent 3D Scene Diffusion From a Single RGB Image
Coherent 3D Scene Diffusion From a Single RGB ImageNeural Information Processing Systems (NeurIPS), 2024
Manuel Dahnert
Angela Dai
Norman Muller
Matthias Nießner
204
2
0
13 Dec 2024
Continual Learning for Segment Anything Model Adaptation
Continual Learning for Segment Anything Model Adaptation
Jinglong Yang
Yichen Wu
Jun Cen
Wenjian Huang
Hong Wang
Jianguo Zhang
TTACLL
201
3
0
09 Dec 2024
A Pipeline and NIR-Enhanced Dataset for Parking Lot Segmentation
A Pipeline and NIR-Enhanced Dataset for Parking Lot SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Shirin Qiam
Saipraneeth Devunuri
Lewis J. Lehe
127
1
0
09 Dec 2024
Optimizing Dense Visual Predictions Through Multi-Task Coherence and
  Prioritization
Optimizing Dense Visual Predictions Through Multi-Task Coherence and PrioritizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Maxime Fontana
Michael W. Spratling
Miaojing Shi
MoEVLM
342
0
0
04 Dec 2024
SJTU:Spatial judgments in multimodal models towards unified segmentation
  through coordinate detection
SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection
Joongwon Chae
Zhenyu Wang
Peiwu Qin
VLM
260
0
0
03 Dec 2024
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Byung-Kwan Lee
Ryo Hachiuma
Yu-Chiang Frank Wang
Y. Ro
Yueh-Hua Wu
VLM
365
4
0
02 Dec 2024
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
560
17
0
02 Dec 2024
Referring Video Object Segmentation via Language-aligned Track Selection
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
365
4
0
02 Dec 2024
SyncVIS: Synchronized Video Instance Segmentation
SyncVIS: Synchronized Video Instance SegmentationNeural Information Processing Systems (NeurIPS), 2024
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
281
1
0
01 Dec 2024
EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Tong Jin
Feng Lu
Shuyu Hu
Chun Yuan
Yunpeng Liu
ViT
405
3
0
01 Dec 2024
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLMVLM3DV
552
1
0
29 Nov 2024
Track Anything Behind Everything: Zero-Shot Amodal Video Object
  Segmentation
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation
Finlay G. C. Hudson
W. Smith
VOSVLM
321
0
0
28 Nov 2024
Previous
123...91011...323334
Next