ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and
  Self-Prompting
ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting
Yankai Jiang
Zhongzhen Huang
Rongzhao Zhang
Xiaofan Zhang
Shaoting Zhang
VLM
247
19
0
07 Dec 2023
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Yong Liu
Sule Bai
Guanbin Li
Yitong Wang
Yansong Tang
VLM
192
47
0
07 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
255
14
0
06 Dec 2023
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Foundation Model Assisted Weakly Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
326
38
0
06 Dec 2023
AI-SAM: Automatic and Interactive Segment Anything Model
AI-SAM: Automatic and Interactive Segment Anything Model
Yimu Pan
Sitao Zhang
Alison D. Gernand
Jeffery A. Goldstein
J. Z. Wang
VLM
186
10
0
05 Dec 2023
RotaTR: Detection Transformer for Dense and Rotated Object
RotaTR: Detection Transformer for Dense and Rotated Object
Yuke Zhu
Yumeng Ruan
Lei Yang
Sheng Guo
212
1
0
05 Dec 2023
Uni3DL: Unified Model for 3D and Language Understanding
Uni3DL: Unified Model for 3D and Language Understanding
Xiang Li
Jian Ding
Zhaoyang Chen
Mohamed Elhoseiny
304
9
0
05 Dec 2023
Lenna: Language Enhanced Reasoning Detection Assistant
Lenna: Language Enhanced Reasoning Detection Assistant
Fei Wei
Xinyu Zhang
Ailing Zhang
Bo Zhang
Xiangxiang Chu
MLLMLRM
234
29
0
05 Dec 2023
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty AwarenessComputer Vision and Pattern Recognition (CVPR), 2023
Anh-Quan Cao
Angela Dai
Raoul de Charette
UQCV
217
35
0
04 Dec 2023
Aligning and Prompting Everything All at Once for Universal Visual
  Perception
Aligning and Prompting Everything All at Once for Universal Visual PerceptionComputer Vision and Pattern Recognition (CVPR), 2023
Chunjiang Ge
Chaoyou Fu
Peixian Chen
Mengdan Zhang
Ke Li
Xing Sun
Yunsheng Wu
Shaohui Lin
Rongrong Ji
VLMObjD
255
63
0
04 Dec 2023
UniGS: Unified Representation for Image Generation and Segmentation
UniGS: Unified Representation for Image Generation and SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Lu Qi
Lehan Yang
Weidong Guo
Yu-Syuan Xu
Bo Du
Varun Jampani
Ming-Hsuan Yang
229
25
0
04 Dec 2023
Unveiling Objects with SOLA: An Annotation-Free Image Search on the
  Object Level for Automotive Data Sets
Unveiling Objects with SOLA: An Annotation-Free Image Search on the Object Level for Automotive Data Sets
Philipp Rigoll
Jacob Langner
Eric Sax
200
5
0
04 Dec 2023
Effective Adapter for Face Recognition in the Wild
Effective Adapter for Face Recognition in the Wild
Yunhao Liu
Yu-Ju Tsai
Kelvin C. K. Chan
Xiangtai Li
Lu Qi
Ming-Hsuan Yang
CVBM
210
1
0
04 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language
  Instruction
Universal Segmentation at Arbitrary Granularity with Language InstructionComputer Vision and Pattern Recognition (CVPR), 2023
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLMVOS
233
27
0
04 Dec 2023
SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
SCLIP: Rethinking Self-Attention for Dense Vision-Language InferenceEuropean Conference on Computer Vision (ECCV), 2023
Feng Wang
Jieru Mei
Yaoyao Liu
VLM
329
116
0
04 Dec 2023
SANeRF-HQ: Segment Anything for NeRF in High Quality
SANeRF-HQ: Segment Anything for NeRF in High QualityComputer Vision and Pattern Recognition (CVPR), 2023
Yichen Liu
Benran Hu
Chi-Keung Tang
Yu-Wing Tai
247
22
0
03 Dec 2023
Segment and Caption Anything
Segment and Caption AnythingComputer Vision and Pattern Recognition (CVPR), 2023
Xiaoke Huang
Jianfeng Wang
Yansong Tang
Zheng Zhang
Han Hu
Jiwen Lu
Lijuan Wang
Zicheng Liu
MLLMVLM
198
32
0
01 Dec 2023
Sequential Modeling Enables Scalable Learning for Large Vision Models
Sequential Modeling Enables Scalable Learning for Large Vision ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Yutong Bai
Xinyang Geng
K. Mangalam
Amir Bar
Alan Yuille
Trevor Darrell
Jitendra Malik
Alexei A. Efros
MLLMVLM
300
222
0
01 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingComputer Vision and Pattern Recognition (CVPR), 2023
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
308
230
0
01 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion
  Models
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGenDiffM
146
0
0
01 Dec 2023
InstructSeq: Unifying Vision Tasks with Instruction-conditioned
  Multi-modal Sequence Generation
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Rongyao Fang
Shilin Yan
Zhaoyang Huang
Jingqiu Zhou
Hao Tian
Jifeng Dai
Jiaming Song
MLLM
184
16
0
30 Nov 2023
A Lightweight Clustering Framework for Unsupervised Semantic
  Segmentation
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation
Yau Shing Jonathan Cheung
Xi Chen
Lihe Yang
Hengshuang Zhao
293
1
0
30 Nov 2023
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
Ju He
Qihang Yu
Inkyu Shin
XueQing Deng
Yaoyao Liu
Xiaohui Shen
Liang-Chieh Chen
VOS
302
3
0
30 Nov 2023
One-Shot Open Affordance Learning with Foundation Models
One-Shot Open Affordance Learning with Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Gen Li
Deqing Sun
Laura Sevilla-Lara
Varun Jampani
VLM
298
47
0
29 Nov 2023
Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
Focus on Query: Adversarial Mining Transformer for Few-Shot SegmentationNeural Information Processing Systems (NeurIPS), 2023
Yuan Wang
Naisong Luo
Tianzhu Zhang
211
21
0
29 Nov 2023
Continual Learning for Image Segmentation with Dynamic Query
Continual Learning for Image Segmentation with Dynamic Query
Weijia Wu
Yuzhong Zhao
Zhuang Li
Lianlei Shan
Hong Zhou
Mike Zheng Shou
VLMCLL
266
28
0
29 Nov 2023
How does spatial structure affect psychological restoration? A method
  based on Graph Neural Networks and Street View Imagery
How does spatial structure affect psychological restoration? A method based on Graph Neural Networks and Street View ImageryLandscape and Urban Planning (LUP), 2023
Haoran Ma
Yan Zhang
Pengyuan Liu
Fan Zhang
Pengyu Zhu
123
30
0
29 Nov 2023
Panoptic Video Scene Graph Generation
Panoptic Video Scene Graph GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Jingkang Yang
Wen-Hsiao Peng
Xiangtai Li
Zujin Guo
Liangyu Chen
...
Zheng Ma
Kaiyang Zhou
Wayne Zhang
Chen Change Loy
Ziwei Liu
VOS
271
53
0
28 Nov 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
TransNeXt: Robust Foveal Visual Perception for Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Dai Shi
ViT
228
241
0
28 Nov 2023
LLaFS: When Large Language Models Meet Few-Shot Segmentation
LLaFS: When Large Language Models Meet Few-Shot SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jun Liu
VLM
474
71
0
28 Nov 2023
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation
Zijian Zhou
Miaojing Shi
Holger Caesar
VLM
257
14
0
27 Nov 2023
SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using
  Neural Radiance Fields
SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance FieldsComputer Vision and Pattern Recognition (CVPR), 2023
Quentin Herau
Nathan Piasco
Moussâb Bennehar
Luis Roldão
D. Tsishkou
Cyrille Migniot
Pascal Vasseur
C. Demonceaux
205
15
0
27 Nov 2023
Stable Segment Anything Model
Stable Segment Anything ModelInternational Conference on Learning Representations (ICLR), 2023
Qi Fan
Xin Tao
Lei Ke
Mingqiao Ye
Yuanhui Zhang
Pengfei Wan
Zhong-ming Wang
Yu-Wing Tai
Chi-Keung Tang
VLM
201
10
0
27 Nov 2023
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features
Mengxi Zhang
Yiming Liu
Xiangjun Yin
Huanjing Yue
Jingyu Yang
352
1
0
27 Nov 2023
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Bin Xie
Jiale Cao
Jin Xie
Fahad Shahbaz Khan
Yanwei Pang
VLM
225
86
0
27 Nov 2023
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Thanh-Dat Truong
Utsav Prabhu
Bhiksha Raj
Jackson Cothren
Khoa Luu
CLL
370
8
0
27 Nov 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context
  Segmentation
SEGIC: Unleashing the Emergent Correspondence for In-Context SegmentationEuropean Conference on Computer Vision (ECCV), 2023
Lingchen Meng
Shiyi Lan
Hengduo Li
Jose M. Alvarez
Zuxuan Wu
Yu-Gang Jiang
VLMISegMLLM
209
14
0
24 Nov 2023
OneFormer3D: One Transformer for Unified Point Cloud Segmentation
OneFormer3D: One Transformer for Unified Point Cloud SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Maksim Kolodiazhnyi
Anna Vorontsova
Anton Konushin
D. Rukhovich
ViT
227
101
0
24 Nov 2023
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
Benjamin Kiefer
Lojze Žust
Matej Kristan
J. Pers
Matija Tersek
...
Magdalena Šumunec
Nadir Kapetanović
A. Michel
Wolfgang Gross
Martin Weinmann
165
8
0
23 Nov 2023
Visual In-Context Prompting
Visual In-Context PromptingComputer Vision and Pattern Recognition (CVPR), 2023
Feng Li
Qing Jiang
Hao Zhang
Tianhe Ren
Shilong Liu
...
Hongyang Li
Chun-yue Li
Jianwei Yang
Lei Zhang
Jianfeng Gao
VLMLRMMLLM
164
51
0
22 Nov 2023
T-Rex: Counting by Visual Prompting
T-Rex: Counting by Visual Prompting
Qing Jiang
Feng Li
Tianhe Ren
Shilong Liu
Zhaoyang Zeng
Kent Yu
Lei Zhang
181
20
0
22 Nov 2023
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for
  Enhanced Dataset Pruning
Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset PruningComputer Vision and Pattern Recognition (CVPR), 2023
Xin Zhang
Jiawei Du
Yunsong Li
Weiying Xie
Qiufeng Wang
311
28
0
22 Nov 2023
Exploring Lip Segmentation Techniques in Computer Vision: A Comparative
  Analysis
Exploring Lip Segmentation Techniques in Computer Vision: A Comparative AnalysisLatin American Conference on Computational Intelligence (LACCI), 2023
Pietro Masur
Francisco Braulio Oliveira
Lucas Moreira Medino
Emanuel Huber
Milene Haraguchi Padilha
Cássio De Alcantara
Renata Sellaro
125
1
0
20 Nov 2023
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene
  Understanding
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding
Hao Li
Dingwen Zhang
Yalun Dai
Nian Liu
Lechao Cheng
Jingfeng Li
Jingdong Wang
Junwei Han
254
27
0
20 Nov 2023
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive
  Learning
OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning
Haiyang Ying
Yixuan Yin
Jinzhi Zhang
Fan Wang
Tao Yu
Ruqi Huang
Lu Fang
115
59
0
20 Nov 2023
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based
  Latent Diffusion Model
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model
Chunming He
Chengyu Fang
Yulun Zhang
Chenyu You
Kai Li
Longxiang Tang
Fengyang Xiao
Xiu Li
Z. Guo
357
57
0
20 Nov 2023
Open-Vocabulary Camouflaged Object Segmentation
Open-Vocabulary Camouflaged Object Segmentation
Youwei Pang
Xiaoqi Zhao
Jiaming Zuo
Lihe Zhang
Huchuan Lu
VLMObjD
289
11
0
19 Nov 2023
Enhancing Transformer-Based Segmentation for Breast Cancer Diagnosis
  using Auto-Augmentation and Search Optimisation Techniques
Enhancing Transformer-Based Segmentation for Breast Cancer Diagnosis using Auto-Augmentation and Search Optimisation Techniques
Leon Hamnett
M. Adewunmi
M. Abayomi
Kayode Raheem
Fahad Ahmed
ViT
68
1
0
18 Nov 2023
Segment Anything in Defect Detection
Segment Anything in Defect Detection
Bozhen Hu
Bin Gao
Cheng Tan
Tongle Wu
Stan Z. Li
84
7
0
17 Nov 2023
Towards Open-Ended Visual Recognition with Large Language Model
Towards Open-Ended Visual Recognition with Large Language Model
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
VLM
202
8
0
14 Nov 2023
Previous
123...212223...323334
Next