ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
Instance Neural Radiance Field
Instance Neural Radiance FieldIEEE International Conference on Computer Vision (ICCV), 2023
Yichen Liu
Benran Hu
Junkai Huang
Yu-Wing Tai
Chi-Keung Tang
ISeg
347
43
0
10 Apr 2023
Semantic Human Parsing via Scalable Semantic Transfer over Multiple
  Label Domains
Semantic Human Parsing via Scalable Semantic Transfer over Multiple Label DomainsComputer Vision and Pattern Recognition (CVPR), 2023
Jie Yang
Chaoqun Wang
Zhen Li
Junle Wang
Ruimao Zhang
111
19
0
09 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
270
51
0
07 Apr 2023
SegGPT: Segmenting Everything In Context
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOSMLLMVLM
146
242
0
06 Apr 2023
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot
  Keypoint Detection
From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection
Changsheng Lu
Hao Zhu
Piotr Koniusz
185
14
0
06 Apr 2023
Segment Anything
Segment AnythingIEEE International Conference on Computer Vision (ICCV), 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLMVLM
842
10,757
0
05 Apr 2023
Uncertainty estimation in Deep Learning for Panoptic segmentation
Uncertainty estimation in Deep Learning for Panoptic segmentation
Michael J. Smith
F. Ferrie
OODUQCV
162
0
0
04 Apr 2023
Video Instance Segmentation in an Open-World
Video Instance Segmentation in an Open-WorldInternational Journal of Computer Vision (IJCV), 2023
Omkar Thawakar
Sanath Narayan
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Jorma T. Laaksonen
M. Shah
Fahad Shahbaz Khan
VLM
143
6
0
03 Apr 2023
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass NetworkIEEE International Conference on Computer Vision (ICCV), 2023
Cong Han
Yujie Zhong
Dengjie Li
Kai Han
Lin Ma
VLMSSeg
219
42
0
03 Apr 2023
RegionPLC: Regional Point-Language Contrastive Learning for Open-World
  3D Scene Understanding
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023
Jihan Yang
Runyu Ding
Weipeng Deng
Zhe Wang
Xiaojuan Qi
268
97
0
03 Apr 2023
Devil is in the Queries: Advancing Mask Transformers for Real-world
  Medical Image Segmentation and Out-of-Distribution Localization
Devil is in the Queries: Advancing Mask Transformers for Real-world Medical Image Segmentation and Out-of-Distribution LocalizationComputer Vision and Pattern Recognition (CVPR), 2023
Mingze Yuan
Yingda Xia
Hexin Dong
Zi Chen
Jiawen Yao
...
Bin Dong
Jing Zhou
Le Lu
Ling Zhang
Li Zhang
OODMedIm
148
25
0
01 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution
  Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerComputer Vision and Pattern Recognition (CVPR), 2023
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
315
68
0
30 Mar 2023
MobileInst: Video Instance Segmentation on the Mobile
MobileInst: Video Instance Segmentation on the MobileAAAI Conference on Artificial Intelligence (AAAI), 2023
Renhong Zhang
Tianheng Cheng
Shusheng Yang
Hao Jiang
Shuai Zhang
...
Xin Li
Xiaowen Ying
Dashan Gao
Wenyu Liu
Xinggang Wang
181
10
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
DDP: Diffusion Model for Dense Visual PredictionIEEE International Conference on Computer Vision (ICCV), 2023
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffMVLM
259
194
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image EditorComputer Vision and Pattern Recognition (CVPR), 2023
Vidit Goel
E. Peruzzo
Lezhi Li
Dejia Xu
Xingqian Xu
Andrii Zadaianchuk
Trevor Darrell
Zinan Lin
Humphrey Shi
DiffM
204
17
0
30 Mar 2023
Complementary Random Masking for RGB-Thermal Semantic Segmentation
Complementary Random Masking for RGB-Thermal Semantic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2023
Ukcheol Shin
Kyunghyun Lee
In So Kweon
Jean Oh
124
37
0
30 Mar 2023
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation
FreeSeg: Unified, Universal and Open-Vocabulary Image SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Jie Qin
Jie Wu
Pengxiang Yan
Ming Li
Ren Yuxi
...
Yitong Wang
Rui Wang
Shilei Wen
X. Pan
Xingang Wang
SSegVLM
243
120
0
30 Mar 2023
Masked and Adaptive Transformer for Exemplar Based Image Translation
Masked and Adaptive Transformer for Exemplar Based Image TranslationComputer Vision and Pattern Recognition (CVPR), 2023
Changlong Jiang
Fei Gao
Biao Ma
Yuhao Lin
N. Wang
Gang Xu
161
21
0
30 Mar 2023
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval
Finlay G. C. Hudson
W. Smith
ViT
316
2
0
30 Mar 2023
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed
  Video
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed VideoComputer Vision and Pattern Recognition (CVPR), 2023
Wenzheng Zeng
Yang Xiao
Sicheng Wei
Jinfang Gan
Xintao Zhang
Z. Cao
Zhiwen Fang
Qiufeng Wang
CVBM
170
12
0
28 Mar 2023
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic
  Scene Graph Generation
HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Zijian Zhou
Miaojing Shi
Holger Caesar
217
27
0
28 Mar 2023
Mask-Free Video Instance Segmentation
Mask-Free Video Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Lei Ke
Martin Danelljan
Henghui Ding
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
251
29
0
28 Mar 2023
OpenInst: A Simple Query-Based Method for Open-World Instance
  Segmentation
OpenInst: A Simple Query-Based Method for Open-World Instance SegmentationPattern Recognition (Pattern Recogn.), 2023
Cheng Wang
Guoli Wang
Qian Zhang
Pengning Guo
Wenyu Liu
Xinggang Wang
ISegVLM
157
8
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based
  Real-time Mobile Vision Applications
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision ApplicationsIEEE International Conference on Computer Vision (ICCV), 2023
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
309
164
0
27 Mar 2023
You Only Segment Once: Towards Real-Time Panoptic Segmentation
You Only Segment Once: Towards Real-Time Panoptic SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
182
72
0
26 Mar 2023
Affordance Grounding from Demonstration Video to Target Image
Affordance Grounding from Demonstration Video to Target ImageComputer Vision and Pattern Recognition (CVPR), 2023
Joya Chen
Difei Gao
Kevin Qinghong Lin
Mike Zheng Shou
154
43
0
26 Mar 2023
BoxVIS: Video Instance Segmentation with Box Annotations
BoxVIS: Video Instance Segmentation with Box Annotations
Minghan Li
Lei Zhang
ISegVOS
256
1
0
26 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded
  Instances on Challenging Videos
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging VideosComputer Vision and Pattern Recognition (CVPR), 2023
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
213
12
0
25 Mar 2023
OPDMulti: Openable Part Detection for Multiple Objects
OPDMulti: Openable Part Detection for Multiple ObjectsInternational Conference on 3D Vision (3DV), 2023
Xiaohao Sun
Hanxiao Jiang
Manolis Savva
Angel X. Chang
AI4CE
228
33
0
24 Mar 2023
Query-Dependent Video Representation for Moment Retrieval and Highlight
  Detection
Query-Dependent Video Representation for Moment Retrieval and Highlight DetectionComputer Vision and Pattern Recognition (CVPR), 2023
WonJun Moon
Sangeek Hyun
S. Park
Dongchan Park
Jae-Pil Heo
ViT
208
178
0
24 Mar 2023
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative
  Local-Flow Global-Parsing Learning
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing LearningComputer Vision and Pattern Recognition (CVPR), 2023
Zhenyu Xie
Zaiyu Huang
Xin Dong
Fuwei Zhao
Haoye Dong
Xijin Zhang
Feida Zhu
Xiaodan Liang
3DH
175
143
0
24 Mar 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient
  Vision Transformers
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
173
21
0
24 Mar 2023
Category Query Learning for Human-Object Interaction Classification
Category Query Learning for Human-Object Interaction ClassificationComputer Vision and Pattern Recognition (CVPR), 2023
Chi Xie
Fangao Zeng
Yue Hu
Shuang Liang
Yichen Wei
VLM
156
41
0
24 Mar 2023
Position-Guided Point Cloud Panoptic Segmentation Transformer
Position-Guided Point Cloud Panoptic Segmentation TransformerInternational Journal of Computer Vision (IJCV), 2023
Zeqi Xiao
Wenwei Zhang
Tai Wang
Chen Change Loy
Dahua Lin
Jiangmiao Pang
ViT3DPC
114
24
0
23 Mar 2023
Zero-guidance Segmentation Using Zero Segment Labels
Zero-guidance Segmentation Using Zero Segment LabelsIEEE International Conference on Computer Vision (ICCV), 2023
Pitchaporn Rewatbowornwong
Nattanat Chatthee
Ekapol Chuangsuwanich
Supasorn Suwajanakorn
VLM
127
17
0
23 Mar 2023
Tube-Link: A Flexible Cross Tube Framework for Universal Video
  Segmentation
Tube-Link: A Flexible Cross Tube Framework for Universal Video SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Xiangtai Li
Haobo Yuan
Wenwei Zhang
Guangliang Cheng
Jiangmiao Pang
Chen Change Loy
ViTVOS
163
28
0
22 Mar 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic
  Segmentation Using Diffusion Models
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Weijia Wu
Yuzhong Zhao
Mike Zheng Shou
Hong Zhou
Chunhua Shen
455
178
0
21 Mar 2023
BoxSnake: Polygonal Instance Segmentation with Box Supervision
BoxSnake: Polygonal Instance Segmentation with Box SupervisionIEEE International Conference on Computer Vision (ICCV), 2023
Rui Yang
Lin Song
Yixiao Ge
Xiu Li
ISeg
266
32
0
21 Mar 2023
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images
Active Coarse-to-Fine Segmentation of Moveable Parts from Real ImagesEuropean Conference on Computer Vision (ECCV), 2023
Ruiqi Wang
A. Patil
Fenggen Yu
Hao Zhang
217
5
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon GenesisImage and Vision Computing (IVC), 2023
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMViTCLIP
352
390
0
20 Mar 2023
Open-vocabulary Panoptic Segmentation with Embedding Modulation
Open-vocabulary Panoptic Segmentation with Embedding ModulationIEEE International Conference on Computer Vision (ICCV), 2023
Xi Chen
Shuang Li
Ser-Nam Lim
Antonio Torralba
Hengshuang Zhao
VLM
155
38
0
20 Mar 2023
Generative Semantic Segmentation
Generative Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2023
Jia-Qing Chen
Jiachen Lu
Xiatian Zhu
Li Zhang
GANISegVLM
167
57
0
20 Mar 2023
Reliability in Semantic Segmentation: Are We on the Right Track?
Reliability in Semantic Segmentation: Are We on the Right Track?Computer Vision and Pattern Recognition (CVPR), 2023
Pau de Jorge
Riccardo Volpi
Juil Sock
Grégory Rogez
UQCV
125
26
0
20 Mar 2023
Neural Refinement for Absolute Pose Regression with Feature Synthesis
Neural Refinement for Absolute Pose Regression with Feature SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Shuai Chen
Brandon Smart
Xinghui Li
Jiawang Bian
Kejie Li
Zirui Wang
V. Prisacariu
235
26
0
17 Mar 2023
LERF: Language Embedded Radiance Fields
LERF: Language Embedded Radiance FieldsIEEE International Conference on Computer Vision (ICCV), 2023
Justin Kerr
Chung Min Kim
Ken Goldberg
Angjoo Kanazawa
Matthew Tancik
247
523
0
16 Mar 2023
MATIS: Masked-Attention Transformers for Surgical Instrument
  Segmentation
MATIS: Masked-Attention Transformers for Surgical Instrument SegmentationIEEE International Symposium on Biomedical Imaging (ISBI), 2023
Nicolás Ayobi
Alejandra Pérez-Rondón
Santiago Rodríguez
Pablo Arbelaez
MedIm
265
34
0
16 Mar 2023
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
Unifying Top-down and Bottom-up Scanpath Prediction Using TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Zhibo Yang
Sounak Mondal
Seoyoung Ahn
Ruoyu Xue
G. Zelinsky
Minh Hoai
Dimitris Samaras
210
22
0
16 Mar 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
Global Knowledge Calibration for Fast Open-Vocabulary SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Kunyang Han
Yong-Jin Liu
Jun Hao Liew
Henghui Ding
Yunchao Wei
...
Yitong Wang
Yansong Tang
Yujiu Yang
Jiashi Feng
Yao-Min Zhao
VLM
205
47
0
16 Mar 2023
RSFNet: A White-Box Image Retouching Approach using Region-Specific
  Color Filters
RSFNet: A White-Box Image Retouching Approach using Region-Specific Color FiltersIEEE International Conference on Computer Vision (ICCV), 2023
Wenqi Ouyang
Yi Dong
Xiaoyang Kang
Peiran Ren
Xin Xu
Xuansong Xie
225
15
0
15 Mar 2023
High-level Feature Guided Decoding for Semantic Segmentation
High-level Feature Guided Decoding for Semantic Segmentation
Ye Huang
Di Kang
Shenghua Gao
Wen Li
Lixin Duan
278
4
0
15 Mar 2023
Previous
123...282930...323334
Next