ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance FieldsIEEE International Conference on Robotics and Automation (ICRA), 2023
Zheng Chen
Qingan Yan
Huangying Zhan
Changjiang Cai
Xiangyu Xu
Yuzhong Huang
Weihan Wang
Ziyue Feng
Lantao Liu
Yi Tian Xu
3DV
237
6
0
30 Dec 2023
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance
  Segmentation
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation
Tuan-Anh Vu
Duc Thanh Nguyen
Qing Guo
Binh-Son Hua
N. Chung
Ivor W. Tsang
Sai-Kit Yeung
DiffM
186
5
0
29 Dec 2023
HEAP: Unsupervised Object Discovery and Localization with Contrastive
  Grouping
HEAP: Unsupervised Object Discovery and Localization with Contrastive GroupingAAAI Conference on Artificial Intelligence (AAAI), 2023
Xin Zhang
Jinheng Xie
Yuan. Yuan
Michael Bi Mi
Robby T. Tan
VOSOCLVLM
377
8
0
29 Dec 2023
Amodal Ground Truth and Completion in the Wild
Amodal Ground Truth and Completion in the WildComputer Vision and Pattern Recognition (CVPR), 2023
Guanqi Zhan
Chuanxia Zheng
Weidi Xie
Andrew Zisserman
227
42
0
28 Dec 2023
Unsupervised Universal Image Segmentation
Unsupervised Universal Image Segmentation
Dantong Niu
Xudong Wang
Xinyang Han
Long Lian
Roei Herzig
Trevor Darrell
VLM
231
35
0
28 Dec 2023
LISA++: An Improved Baseline for Reasoning Segmentation with Large
  Language Model
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model
Senqiao Yang
Tianyuan Qu
Xin Lai
Zhuotao Tian
Bohao Peng
Shu Liu
Jiaya Jia
VLM
356
45
0
28 Dec 2023
Fully Sparse 3D Occupancy Prediction
Fully Sparse 3D Occupancy Prediction
Haisong Liu
Yang Chen
Haiguang Wang
Zetong Yang
Tianyu Li
Jia Zeng
Li Chen
Hongyang Li
Limin Wang
342
41
0
28 Dec 2023
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous
  Driving
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving
Tianyu Li
Peijin Jia
Bangjun Wang
Li Chen
Yunlong Wang
Junchi Yan
Hongyang Li
170
61
0
26 Dec 2023
Semantic-aware SAM for Point-Prompted Instance Segmentation
Semantic-aware SAM for Point-Prompted Instance Segmentation
Zhaoyang Wei
Pengfei Chen
Xuehui Yu
Guorong Li
Jianbin Jiao
Zhenjun Han
VLM
307
18
0
26 Dec 2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
257
25
0
25 Dec 2023
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in
  Large-scale Natural Environments
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments
Kavisha Vidanapathirana
Joshua Knights
Stephen Hausler
Mark Cox
Milad Ramezani
...
Ethan Griffiths
Shaheer Mohamed
Sridha Sridharan
Clinton Fookes
Peyman Moghadam
3DV
225
15
0
23 Dec 2023
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDEVLM
243
19
0
22 Dec 2023
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical
  Instrument Segmentation
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation
Wenxi Yue
Jing Zhang
Kun Hu
Qiuxia Wu
Zongyuan Ge
Yong Xia
Jiebo Luo
Zhiyong Wang
168
7
0
22 Dec 2023
UniHuman: A Unified Model for Editing Human Images in the Wild
UniHuman: A Unified Model for Editing Human Images in the Wild
Nannan Li
Qing Liu
Krishna Kumar Singh
Yilin Wang
Jianming Zhang
Bryan A. Plummer
Zhe Lin
154
12
0
22 Dec 2023
Leveraging Habitat Information for Fine-grained Bird Identification
Leveraging Habitat Information for Fine-grained Bird Identification
Tin Nguyen
Peijie Chen
Anh Totti Nguyen
VLM
364
1
0
22 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
154
47
0
21 Dec 2023
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Han Shu
Wenshuo Li
Yehui Tang
Yiman Zhang
Yihao Chen
Houqiang Li
Yunhe Wang
Xinghao Chen
VLM
370
34
0
21 Dec 2023
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Tariq Berrada
Jakob Verbeek
Camille Couprie
Alahari Karteek
204
15
0
20 Dec 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete
  Diffusion Process
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
H. Bai
Henghui Ding
Jun Hao Liew
Jiajun Liu
Yao-Min Zhao
Yunchao Wei
DiffM
183
38
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISegObjD
342
38
0
19 Dec 2023
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with
  Spherical Representation
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation
Sangyun Shin
Kaichen Zhou
M. Vankadari
Andrew Markham
Niki Trigoni
3DPC
265
21
0
18 Dec 2023
MatchDet: A Collaborative Framework for Image Matching and Object
  Detection
MatchDet: A Collaborative Framework for Image Matching and Object Detection
Jinxiang Lai
Wenlong Wu
Bin-Bin Gao
Jun Liu
Jiawei Zhan
Congchong Nie
Yi Zeng
Chengjie Wang
VLM
260
1
0
18 Dec 2023
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
P. Nguyen
T.D. Ngo
E. Kalogerakis
Chuang Gan
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
ISeg
396
93
0
17 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
215
2
0
16 Dec 2023
Part Representation Learning with Teacher-Student Decoder for Occluded
  Person Re-identification
Part Representation Learning with Teacher-Student Decoder for Occluded Person Re-identificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shang Gao
Chenyang Yu
Silong Yong
Huchuan Lu
289
7
0
15 Dec 2023
Collaborating Foundation Models for Domain Generalized Semantic
  Segmentation
Collaborating Foundation Models for Domain Generalized Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
355
31
0
15 Dec 2023
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth
  Estimation of Dynamic Objects with Ground Contact Prior
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact PriorComputer Vision and Pattern Recognition (CVPR), 2023
Jaeho Moon
J. P. Bello
Byeongjun Kwon
Munchurl Kim
203
14
0
15 Dec 2023
General Object Foundation Model for Images and Videos at Scale
General Object Foundation Model for Images and Videos at ScaleComputer Vision and Pattern Recognition (CVPR), 2023
Junfeng Wu
Yi Jiang
Qihao Liu
Zehuan Yuan
Xiang Bai
Song Bai
VOSVLM
296
74
0
14 Dec 2023
Tokenize Anything via Prompting
Tokenize Anything via PromptingEuropean Conference on Computer Vision (ECCV), 2023
Ting Pan
Lulu Tang
Xinlong Wang
Shiguang Shan
VLM
211
35
0
14 Dec 2023
LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
LEMON: Learning 3D Human-Object Interaction Relation from 2D ImagesComputer Vision and Pattern Recognition (CVPR), 2023
Yuhang Yang
Wei Zhai
Hongcheng Luo
Yang Cao
Zheng-Jun Zha
276
40
0
14 Dec 2023
TAM-VT: Transformation-Aware Multi-scale Video Transformer for
  Segmentation and Tracking
TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and TrackingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Raghav Goyal
Wan-Cyuan Fan
Mennatullah Siam
Leonid Sigal
VOS
196
4
0
13 Dec 2023
SAM-guided Graph Cut for 3D Instance Segmentation
SAM-guided Graph Cut for 3D Instance SegmentationEuropean Conference on Computer Vision (ECCV), 2023
Haoyu Guo
He Zhu
Sida Peng
Yuang Wang
Yujun Shen
Ruizhen Hu
Xiaowei Zhou
3DV
229
32
0
13 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False PremisesComputer Vision and Pattern Recognition (CVPR), 2023
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLMMLLM
279
33
0
13 Dec 2023
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary
  Confusion
PnPNet: Pull-and-Push Networks for Volumetric Segmentation with Boundary Confusion
Xin You
Ming Ding
Minghui Zhang
Hanxiao Zhang
Yi Yu
Jie Yang
Yun Gu
912
5
0
13 Dec 2023
Semantic Lens: Instance-Centric Semantic Alignment for Video
  Super-Resolution
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-ResolutionAAAI Conference on Artificial Intelligence (AAAI), 2023
Qi Tang
Yao-Min Zhao
Meiqin Liu
Jian Jin
Chao Yao
203
9
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
CLIP as RNN: Segment Countless Visual Concepts without Training EndeavorComputer Vision and Pattern Recognition (CVPR), 2023
Shuyang Sun
Runjia Li
Juil Sock
Xiuye Gu
Siyang Li
VLMCLIP
407
54
0
12 Dec 2023
Interfacing Foundation Models' Embeddings
Interfacing Foundation Models' EmbeddingsNeural Information Processing Systems (NeurIPS), 2023
Xueyan Zou
Linjie Li
Jianfeng Wang
Jianwei Yang
Mingyu Ding
...
Hao Zhang
Shilong Liu
Arul Aravinthan
Yong Jae Lee
Lijuan Wang
50
1
0
12 Dec 2023
Toward Real Text Manipulation Detection: New Dataset and New Solution
Dongliang Luo
Yuliang Liu
Rui Yang
Xianjin Liu
Jishen Zeng
Yu Zhou
Xiang Bai
188
10
0
12 Dec 2023
Adaptive Human Trajectory Prediction via Latent Corridors
Adaptive Human Trajectory Prediction via Latent Corridors
Neerja Thakkar
K. Mangalam
Andrea V. Bajcsy
Jitendra Malik
270
7
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
250
106
0
11 Dec 2023
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance
  Segmentation
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
317
2
0
11 Dec 2023
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for
  Audio-Visual Segmentation
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
Qi Yang
Xing Nie
Tong Li
Pengfei Gao
Ying Guo
Cheng Zhen
Pengfei Yan
Shiming Xiang
VOS
170
24
0
11 Dec 2023
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
Jinxi Li
Ziyang Song
Bo Yang
3DH
145
26
0
11 Dec 2023
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient
  Semantic Segmentation
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation
Seul-Ki Yeom
Julian von Klitzing
ViT
136
19
0
11 Dec 2023
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Abdullah Rashwan
Jiageng Zhang
A. Taalimi
Fan Yang
Xingyi Zhou
Chaochao Yan
Liang-Chieh Chen
Yeqing Li
ViT
287
8
0
11 Dec 2023
OpenSD: Unified Open-Vocabulary Segmentation and Detection
OpenSD: Unified Open-Vocabulary Segmentation and Detection
Shuai Li
Ming-hui Li
Pengfei Wang
Lei Zhang
ObjDVLM
144
7
0
10 Dec 2023
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation
Mengnan Zhao
Lihe Zhang
Yuqiu Kong
Baocai Yin
218
1
0
09 Dec 2023
VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
VISAGE: Video Instance Segmentation with Appearance-Guided EnhancementEuropean Conference on Computer Vision (ECCV), 2023
Hanjung Kim
Jaehyun Kang
Miran Heo
Sukjun Hwang
Seoung Wug Oh
Seon Joo Kim
305
7
0
08 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Chen Ma
Ming-Ming Cheng
Ying Shan
DiffM
295
297
0
07 Dec 2023
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for
  Domain Generalized Semantic Segmentation
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Zhixiang Wei
Lin Chen
Yi Jin
Xiaoxiao Ma
Tianle Liu
Pengyang Lin
Ben Wang
Huajun Chen
Jinjin Zheng
385
97
0
07 Dec 2023
Previous
123...202122...323334
Next