ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,359 papers shown
Title
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware
  Information Decoupling and Advanced Heterogeneous Feature Fusion
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion
Jianxin Huang
Jiahang Li
Ning Jia
Yuxiang Sun
Chengju Liu
Qijun Chen
Rui Fan
ViT
46
8
0
31 Jul 2024
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
CAMAv2: A Vision-Centric Approach for Static Map Element Annotation
Shiyuan Chen
Jiaxin Zhang
Ruohong Mei
Yingfeng Cai
Haoran Yin
Tao Chen
Wei Sui
Cong Yang
31
0
0
31 Jul 2024
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene
  Understanding
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding
Hongjia Zhai
Gan Huang
Qirui Hu
Guanglin Li
Hujun Bao
Guofeng Zhang
3DGS
38
12
0
30 Jul 2024
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets
Muhammad Abdullah Jamal
Omid Mohareri
31
1
0
29 Jul 2024
MVPbev: Multi-view Perspective Image Generation from BEV with Test-time
  Controllability and Generalizability
MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllability and Generalizability
Buyu Liu
Kai Wang
Yansong Liu
Jun Bao
Tingting Han
Jun Yu
DiffM
24
3
0
28 Jul 2024
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon
  Intention Understanding
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding
Zhen Chen
Zongmin Zhang
Wenwu Guo
Xingjian Luo
Long Bai
Jinlin Wu
Hongliang Ren
Hongbin Liu
41
5
0
28 Jul 2024
Radio Frequency Signal based Human Silhouette Segmentation: A Sequential
  Diffusion Approach
Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach
Penghui Wen
Kun Hu
Dong Yuan
Zhiyuan Ning
ChangYang Li
Zhiyong Wang
31
0
0
27 Jul 2024
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu
Zhuoyang Zhang
Samir Khaki
Shang Yang
Haotian Tang
Chenfeng Xu
Kurt Keutzer
Song Han
SSeg
44
1
0
26 Jul 2024
A Survey on Cell Nuclei Instance Segmentation and Classification:
  Leveraging Context and Attention
A Survey on Cell Nuclei Instance Segmentation and Classification: Leveraging Context and Attention
João D. Nunes
D. Montezuma
Domingos Oliveira
Tania Pereira
Jaime S. Cardoso
47
1
0
26 Jul 2024
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic
  Segmentation
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation
Jingjun Yi
Qi Bi
Hao Zheng
Haolan Zhan
Wei Ji
Yawen Huang
Yuexiang Li
Yefeng Zheng
34
8
0
26 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
28
5
0
26 Jul 2024
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
Shuting He
Henghui Ding
44
10
0
25 Jul 2024
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation,
  and Interpretability
DINOv2 Rocks Geological Image Analysis: Classification, Segmentation, and Interpretability
Florent Brondolo
Samuel Beaussant
AI4CE
24
0
0
25 Jul 2024
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image
  Segmentation
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Xiao Liu
Peng Gao
Tao Yu
Fei-Yue Wang
Ruyue Yuan
MedIm
ViT
23
14
0
25 Jul 2024
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo
  Matching within A Joint Learning Framework
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework
Guanfeng Tang
Zhiyuan Wu
Jiahang Li
Ping Zhong
Xieyuanli Chen
Huiming Liu
Rui Fan
33
0
0
25 Jul 2024
Embedding-Free Transformer with Inference Spatial Reduction for
  Efficient Semantic Segmentation
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
28
3
0
24 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
31
1
0
23 Jul 2024
DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene
DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene
Xi Shi
Lingli Chen
Peng Wei
Xi Wu
Tian Jiang
Yonggang Luo
Lecheng Xie
3DGS
30
4
0
23 Jul 2024
Strike a Balance in Continual Panoptic Segmentation
Strike a Balance in Continual Panoptic Segmentation
Jinpeng Chen
Runmin Cong
Yuxuan Luo
H. Ip
Sam Kwong
38
4
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond
Silvio Galesso
Philipp Schroppel
Hssan Driss
Thomas Brox
23
2
0
22 Jul 2024
RoadPainter: Points Are Ideal Navigators for Topology transformER
RoadPainter: Points Are Ideal Navigators for Topology transformER
Zhongxing Ma
Shuang Liang
Yongkun Wen
Weixin Lu
Guowei Wan
ViT
3DPC
31
5
0
22 Jul 2024
Advancing Chart Question Answering with Robust Chart Component
  Recognition
Advancing Chart Question Answering with Robust Chart Component Recognition
Hanwen Zheng
Sijia Wang
Chris Thomas
Lifu Huang
30
1
0
19 Jul 2024
Early Preparation Pays Off: New Classifier Pre-tuning for Class
  Incremental Semantic Segmentation
Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation
Zhengyuan Xie
Haiquan Lu
Jia-Wen Xiao
Enguang Wang
Le Zhang
Xialei Liu
CLL
24
2
0
19 Jul 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion
  Models
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
54
4
0
18 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View
  Segmentation Masks
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
44
7
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided
  Self-Distillation
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
38
2
0
18 Jul 2024
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT
  via Iterative Fusion and Asymmetric Contrastive Learning
LIDIA: Precise Liver Tumor Diagnosis on Multi-Phase Contrast-Enhanced CT via Iterative Fusion and Asymmetric Contrastive Learning
Wei Huang
Wei Liu
Xiaoming Zhang
Xiaoli Yin
Xu Han
...
Yu Shi
Le Lu
Ling Zhang
Lei Zhang
Ke Yan
19
0
0
18 Jul 2024
Tree semantic segmentation from aerial image time series
Tree semantic segmentation from aerial image time series
Venkatesh Ramesh
Arthur Ouaknine
David Rolnick
26
0
0
18 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
51
0
0
18 Jul 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
52
2
0
18 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic
  Segmentation
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
40
0
0
17 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
73
0
0
17 Jul 2024
Stepping Stones: A Progressive Training Strategy for Audio-Visual
  Semantic Segmentation
Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation
Juncheng Ma
Peiwen Sun
Yaoting Wang
Di Hu
VOS
46
7
0
16 Jul 2024
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps
Josh Veitch-Michaelis
Andrew Cottam
Daniella Schweizer
Eben N. Broadbent
David Dao
Ce Zhang
Angélica María Almeyda Zambrano
Simeon Max
31
1
0
16 Jul 2024
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving
  Transformer for 3D Instance Segmentation
SGIFormer: Semantic-guided and Geometric-enhanced Interleaving Transformer for 3D Instance Segmentation
Lei Yao
Yi Wang
Moyun Liu
Lap-Pui Chau
31
0
0
16 Jul 2024
Cross-Phase Mutual Learning Framework for Pulmonary Embolism
  Identification on Non-Contrast CT Scans
Cross-Phase Mutual Learning Framework for Pulmonary Embolism Identification on Non-Contrast CT Scans
Bizhe Bai
Yan-Jie Zhou
Yujian Hu
Tony C. W. Mok
Yi-lang Xiang
Le Lu
Hongkun Zhang
Minfeng Xu
26
0
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
29
3
0
16 Jul 2024
SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge
SegSTRONG-C: Segmenting Surgical Tools Robustly On Non-adversarial Generated Corruptions -- An EndoVis'24 Challenge
Hao Ding
Tuxun Lu
Yuqian Zhang
Ruixing Liang
Hongchao Shu
...
Bo Wang
Marcos Fernández-Rodríguez
Estevao Lima
João L. Vilaça
Mathias Unberath
55
4
0
16 Jul 2024
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal
  Models
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
Zijian Zhou
Zheng Zhu
Holger Caesar
Miaojing Shi
VLM
27
2
0
15 Jul 2024
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes
Yaoting Wang
Peiwen Sun
Dongzhan Zhou
Guangyao Li
Honggang Zhang
Di Hu
VOS
38
5
0
15 Jul 2024
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang
Peiwen Sun
Yuanchao Li
Honggang Zhang
Di Hu
38
5
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
40
0
0
15 Jul 2024
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
Yuzhou Liu
Lingjie Zhu
Xiaodong Ma
Hanqiao Ye
Xiang Gao
Xianwei Zheng
Shuhan Shen
23
1
0
15 Jul 2024
Background Adaptation with Residual Modeling for Exemplar-Free
  Class-Incremental Semantic Segmentation
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation
Anqi Zhang
Guangyu Gao
CLL
VLM
33
4
0
13 Jul 2024
A Fair Ranking and New Model for Panoptic Scene Graph Generation
A Fair Ranking and New Model for Panoptic Scene Graph Generation
Julian Lorenz
Alexander Pest
Daniel Kienzle
K. Ludwig
Rainer Lienhart
41
1
0
12 Jul 2024
From Easy to Hard: Learning Curricular Shape-aware Features for Robust
  Panoptic Scene Graph Generation
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation
Hanrong Shi
Lin Li
Jun Xiao
Yueting Zhuang
Long Chen
27
2
0
12 Jul 2024
Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental
  Semantic Segmentation
Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation
Wei Cong
Yang Cong
Yuyang Liu
Gan Sun
VLM
CLL
34
2
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
42
3
0
12 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on
  Robustness
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
30
5
0
12 Jul 2024
Previous
123...789...262728
Next