ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,359 papers shown
Title
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic
  Segmentation
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
Tong Shao
Zhuotao Tian
Hang Zhao
Jingyong Su
VLM
34
14
0
11 Jul 2024
Enriching Information and Preserving Semantic Consistency in Expanding
  Curvilinear Object Segmentation Datasets
Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets
Qin Lei
Jiang Zhong
Qizhu Dai
DiffM
37
2
0
11 Jul 2024
ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction
  and Relative Depth Estimation
ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation
Ruijie Zhu
Chuxin Wang
Ziyang Song
Li Liu
Tianzhu Zhang
Yongdong Zhang
MDE
37
6
0
11 Jul 2024
Swiss DINO: Efficient and Versatile Vision Framework for On-device
  Personal Object Search
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search
Kirill Paramonov
Jia-Xing Zhong
Umberto Michieli
J. Moon
Mete Ozay
42
2
0
10 Jul 2024
Unified Embedding Alignment for Open-Vocabulary Video Instance
  Segmentation
Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation
Hao Fang
Peng Wu
Yawei Li
Xinxin Zhang
Xiankai Lu
VLM
27
6
0
10 Jul 2024
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
49
4
0
10 Jul 2024
Visual-Geometry GP-based Navigable Space for Autonomous Navigation
Visual-Geometry GP-based Navigable Space for Autonomous Navigation
Mahmoud Ali
Durgkant Pushp
Zheng Chen
Lantao Liu
39
0
0
09 Jul 2024
General and Task-Oriented Video Segmentation
General and Task-Oriented Video Segmentation
Mu Chen
Liulei Li
Wenguan Wang
Ruijie Quan
Yi Yang
VOS
48
4
0
09 Jul 2024
Anatomy-guided Pathology Segmentation
Anatomy-guided Pathology Segmentation
A. Jaus
C. Seibold
Simon Reiß
Lukas Heine
Anton Schily
Moon Kim
F. Bahnsen
Ken Herrmann
Rainer Stiefelhagen
Jens Kleesiek
MedIm
26
2
0
08 Jul 2024
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
Yuejiao Su
Yi Wang
Lap-Pui Chau
60
1
0
08 Jul 2024
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Jianwen Jiang
Gaojie Lin
Zhengkun Rong
Chao Liang
Yongming Zhu
Jiaqi Yang
Tianyun Zhong
3DH
80
8
0
08 Jul 2024
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
CPM: Class-conditional Prompting Machine for Audio-visual Segmentation
Yuanhong Chen
Chong Wang
Yuyuan Liu
Hu Wang
Gustavo Carneiro
40
2
0
07 Jul 2024
SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
Weiyi Xie
Nathalie Willems
Shubham Patil
Yang Li
Mayank Kumar
54
13
0
05 Jul 2024
For a semiotic AI: Bridging computer vision and visual semiotics for
  computational observation of large scale facial image archives
For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives
Lia Morra
A. Santangelo
Pietro Basci
Luca Piano
Fabio Garcea
Fabrizio Lamberti
Massimo Leone
41
1
0
03 Jul 2024
Context-Aware Video Instance Segmentation
Context-Aware Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Kiljoon Han
Minwoo Choi
S. Im
VOS
27
0
0
03 Jul 2024
AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene
  Reconstruction
AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction
Mustafa Khan
H. Fazlali
Dhruv Sharma
Tongtong Cao
Dongfeng Bai
Y. Ren
Bingbing Liu
3DGS
33
17
0
02 Jul 2024
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual
  Prompts
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts
Pasquale De Marinis
Nicola Fanelli
Raffaele Scaringi
Emanuele Colonna
Giuseppe Fiameni
G. Vessio
Giovanna Castellano
MLLM
VLM
24
2
0
02 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
24
1
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
31
0
0
01 Jul 2024
AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D
  Occupancy and Flow Prediction
AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction
Dubing Chen
Wencheng Han
Jin Fang
Jianbing Shen
27
0
0
01 Jul 2024
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for
  Zero-shot Panoptic Reconstruction
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction
Xuan Yu
Yili Liu
Chenrui Han
Sitong Mao
Shunbo Zhou
R. Xiong
Yiyi Liao
Yue Wang
ISeg
44
2
0
01 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
51
2
0
01 Jul 2024
Toward a Diffusion-Based Generalist for Dense Vision Tasks
Toward a Diffusion-Based Generalist for Dense Vision Tasks
Yue Fan
Yongqin Xian
Xiaohua Zhai
Alexander Kolesnikov
Muhammad Ferjad Naeem
Bernt Schiele
Federico Tombari
VLM
MDE
DiffM
40
1
0
29 Jun 2024
Segment Anything without Supervision
Segment Anything without Supervision
Xudong Wang
Jingfeng Yang
Trevor Darrell
VLM
35
10
0
28 Jun 2024
Fine-tuning of Geospatial Foundation Models for Aboveground Biomass
  Estimation
Fine-tuning of Geospatial Foundation Models for Aboveground Biomass Estimation
Michal Muszynski
Levente Klein
Ademir Ferreira da Silva
Anjani Prasad Atluri
Carlos Gomes
...
Shraddha Singh
Steve Meliksetian
Campbell Watson
Daiki Kimura
Harini Srinivasan
33
3
0
28 Jun 2024
PM-VIS+: High-Performance Video Instance Segmentation without Video
  Annotation
PM-VIS+: High-Performance Video Instance Segmentation without Video Annotation
Zhangjing Yang
Dun Liu
Xin Wang
Zhe Li
Barathwaj S. Anandan
Yi Wu
VLM
VOS
36
0
0
28 Jun 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and
  Understanding
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRM
MLLM
VLM
47
48
0
27 Jun 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment
  Anything Model
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming Yang
Shuicheng Yan
Chen Change Loy
VLM
32
10
0
27 Jun 2024
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
J. C. V. Gemert
VLM
42
0
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
58
22
0
26 Jun 2024
Depth-Guided Semi-Supervised Instance Segmentation
Depth-Guided Semi-Supervised Instance Segmentation
Xin Chen
Jie Hu
Xiawu Zheng
Jianghang Lin
Liujuan Cao
Rongrong Ji
ISeg
3DV
32
1
0
25 Jun 2024
GMT: Guided Mask Transformer for Leaf Instance Segmentation
GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen
Sotirios A. Tsaftaris
M. Giuffrida
20
1
0
24 Jun 2024
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Henghui Ding
Chang Liu
Yunchao Wei
Nikhila Ravi
Shuting He
...
Bo-Lu Zhao
Jing Liu
Feiyu Pan
Hao Fang
Xiankai Lu
48
8
0
24 Jun 2024
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
Xiaowen Ma
Rongrong Lian
Zhenkai Wu
Hongbo Guo
Mengting Ma
Sensen Wu
Zhenhong Du
Siyang Song
Wei Zhang
39
4
0
24 Jun 2024
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation
Thomas Stegmüller
Tim Lebailly
Nikola Dukic
Behzad Bozorgtabar
Tinne Tuytelaars
Jean-Philippe Thiran
VLM
31
1
0
23 Jun 2024
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
Oluwatosin O. Alabi
K. Toe
Zijian Zhou
C. Budd
Nicholas Raison
Miaojing Shi
Tom Kamiel Magda Vercauteren
ISeg
62
1
0
23 Jun 2024
Rethinking Remote Sensing Change Detection With A Mask View
Rethinking Remote Sensing Change Detection With A Mask View
Xiaowen Ma
Zhenkai Wu
Rongrong Lian
Wei Zhang
Siyang Song
29
3
0
21 Jun 2024
TraceNet: Segment one thing efficiently
TraceNet: Segment one thing efficiently
Mingyuan Wu
Zichuan Liu
Haozhen Zheng
Hongpeng Guo
Bo Chen
Xin Lu
Klara Nahrstedt
31
0
0
21 Jun 2024
Reparameterizable Dual-Resolution Network for Real-time Semantic
  Segmentation
Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation
Guoyu Yang
Yuan Wang
Daming Shi
SSeg
36
1
0
18 Jun 2024
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
Jiho Choi
Seonho Lee
Seungho Lee
Minhyun Lee
Hyunjung Shim
OCL
35
0
0
17 Jun 2024
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic
  Segmentation
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
Bingfeng Zhang
Siyue Yu
Yunchao Wei
Yao Zhao
Jimin Xiao
VLM
33
8
0
17 Jun 2024
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space
  State Model for Semantic Segmentation of Remote Sensing Imagery
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Libo Wang
Dongxu Li
Sijun Dong
Xiaoliang Meng
Xiaokang Zhang
Danfeng Hong
24
5
0
16 Jun 2024
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor
  Perception
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception
M. M. Rahman
Ryoma Yataka
Sorachi Kato
P. Wang
Peizhao Li
Adriano Cardace
P. Boufounos
26
4
0
15 Jun 2024
Panoptic-FlashOcc: An Efficient Baseline to Marry Semantic Occupancy
  with Panoptic via Instance Center
Panoptic-FlashOcc: An Efficient Baseline to Marry Semantic Occupancy with Panoptic via Instance Center
Zichen Yu
Changyong Shu
Qianpu Sun
Junjie Linghu
Xiaobao Wei
Jiangyong Yu
Zongdai Liu
Dawei Yang
Hui Li
Yan Chen
31
5
0
15 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part
  Representations
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
27
0
0
14 Jun 2024
Understanding Pedestrian Movement Using Urban Sensing Technologies: The
  Promise of Audio-based Sensors
Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors
Chaeyeon Han
Pavan Seshadri
Yiwei Ding
Noah Posner
B. Koo
Animesh Agrawal
Alexander Lerch
S. Guhathakurta
19
2
0
14 Jun 2024
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic
  Segmentation with Plain Vision Transformers
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers
Narges Norouzi
Svetlana Orlova
Daan de Geus
Gijs Dubbelman
ViT
FedML
46
3
0
14 Jun 2024
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
Xiangheng Shan
Dongyue Wu
Guilin Zhu
Yuanjie Shao
Nong Sang
Changxin Gao
VLM
29
15
0
14 Jun 2024
Depth Anything V2
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
323
0
13 Jun 2024
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann
Oğuzhan Fatih Kar
David Mizrahi
Ali Garjani
Mingfei Gao
David Griffiths
Jiaming Hu
Afshin Dehghan
Amir Zamir
MoE
VLM
MLLM
36
14
0
13 Jun 2024
Previous
123...8910...262728
Next