Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.01527
Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,359 papers shown
Title
Scale-Invariant Monocular Depth Estimation via SSI Depth
S. M. H. Miangoleh
Mahesh Kumar Krishna Reddy
Yağız Aksoy
MDE
23
5
0
13 Jun 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
55
7
0
12 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
40
9
0
12 Jun 2024
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
Zhensong Xu
Jiangtao Yao
Chengjing Wu
Ting Liu
Luoqi Liu
18
1
0
12 Jun 2024
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh
R. Tamburo
Shen Zheng
Juan R. Alvarez-Padilla
Hailiang Zhu
Michael Cardei
Nicholas Dunn
Christoph Mertz
Srinivasa G. Narasimhan
39
1
0
11 Jun 2024
CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation
Zhongzhen Huang
Yankai Jiang
Rongzhao Zhang
Shaoting Zhang
Xiaofan Zhang
MedIm
62
4
0
11 Jun 2024
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset
Shijie Lian
Ziyi Zhang
Hua Li
Wenjie Li
Laurence Tianruo Yang
Sam Kwong
Runmin Cong
VLM
18
12
0
10 Jun 2024
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Zhiyuan Cheng
Cheng Han
James Liang
Qifan Wang
Xiangyu Zhang
Dongfang Liu
AAML
32
4
0
09 Jun 2024
F-LMM: Grounding Frozen Large Multimodal Models
Size Wu
Sheng Jin
Wenwei Zhang
Lumin Xu
Wentao Liu
Wei Li
Chen Change Loy
MLLM
73
12
0
09 Jun 2024
ProMotion: Prototypes As Motion Learners
Yawen Lu
Dongfang Liu
Qifan Wang
Cheng Han
Yiming Cui
Zhiwen Cao
Xueling Zhang
Yingjie Victor Chen
Heng Fan
DiffM
38
2
0
07 Jun 2024
Semantic Segmentation on VSPW Dataset through Masked Video Consistency
Chen Liang
Qiang Guo
Chongkai Yu
Chengjing Wu
Ting Liu
Luoqi Liu
37
1
0
07 Jun 2024
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation
Feiyu Pan
Hao Fang
Xiankai Lu
32
3
0
07 Jun 2024
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Yang Sui
Yanyu Li
Anil Kag
Yerlan Idelbayev
Junli Cao
Ju Hu
Dhritiman Sagar
Bo Yuan
Sergey Tulyakov
Jian Ren
MQ
39
18
0
06 Jun 2024
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
29
22
0
06 Jun 2024
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu
Jifei Che
Han Li
Chengjing Wu
Ting Liu
Luoqi Liu
36
0
0
06 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Lu Yang
Pu Cao
Liulei Li
Huadong Ma
31
1
0
06 Jun 2024
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy
Yunho Kim
Jeong Hyun Lee
Choongin Lee
Juhyeok Mun
D. Youm
Jeongsoo Park
Jemin Hwangbo
26
1
0
05 Jun 2024
P2PFormer: A Primitive-to-polygon Method for Regular Building Contour Extraction from Remote Sensing Images
Tao Zhang
Shiqing Wei
Yikang Zhou
M. Luo
Wenling You
Shunping Ji
19
1
0
05 Jun 2024
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping
Yuzhou Ji
He Zhu
Junshu Tang
Wuyi Liu
Zhizhong Zhang
Yuan Xie
Xin Tan
31
8
0
04 Jun 2024
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
Mohamed El Amine Boudjoghra
Angela Dai
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
F. Khan
VLM
ISeg
73
6
0
04 Jun 2024
Segmentation-Free Guidance for Text-to-Image Diffusion Models
K. Azarian
Debasmit Das
Qiqi Hou
Fatih Porikli
VLM
46
0
0
03 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
46
2
0
03 Jun 2024
MP-PolarMask: A Faster and Finer Instance Segmentation for Concave Images
Ke-Lei Wang
Pin-Hsuan Chou
Young-Ching Chou
Chia-Jen Liu
Cheng-Kuan Lin
Yu-Chee Tseng
29
0
0
03 Jun 2024
On the Nonlinearity of Layer Normalization
Yunhao Ni
Yuxin Guo
Junlong Jia
Lei Huang
39
4
0
03 Jun 2024
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
29
0
0
03 Jun 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Yunheng Li
Zhongyu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
VLM
40
8
0
02 Jun 2024
Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation
Xinyue Chen
Miaojing Shi
31
0
0
01 Jun 2024
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Biao Wu
Diankai Zhang
Sihan Gao
Cheng-yong Zheng
Shaoli Liu
Ning Wang
27
0
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
30
1
0
31 May 2024
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
19
2
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
31
4
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
73
6
0
30 May 2024
View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields
Haodi He
Colton Stearns
Adam W. Harley
Leonidas J. Guibas
3DV
27
2
0
30 May 2024
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation
Niclas Vodisch
Kürsat Petek
Markus Kappeler
Abhinav Valada
Wolfram Burgard
VLM
32
4
0
29 May 2024
BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation
Tengbo Wang
Yu Bai
ISeg
42
0
0
27 May 2024
Memorize What Matters: Emergent Scene Decomposition from Multitraverse
Yiming Li
Zehong Wang
Yue Wang
Zhiding Yu
Zan Gojcic
Marco Pavone
Chen Feng
Jose M. Alvarez
3DGS
50
1
0
27 May 2024
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation
Zhuoyan Luo
Yinghao Wu
Yong-Jin Liu
Yicheng Xiao
Xiao-Ping Zhang
Yujiu Yang
30
0
0
24 May 2024
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
Bingyu Li
Da Zhang
Zhiyuan Zhao
Junyu Gao
Xuelong Li
28
5
0
24 May 2024
Synergistic Global-space Camera and Human Reconstruction from Videos
Yizhou Zhao
Tuanfeng Y. Wang
Bhiksha Raj
Min Xu
Jimei Yang
Chun-Hao Paul Huang
3DGS
3DH
38
1
0
23 May 2024
Efficient Robot Learning for Perception and Mapping
Niclas Vodisch
SSL
32
0
0
23 May 2024
RoGS: Large Scale Road Surface Reconstruction based on 2D Gaussian Splatting
Zhiheng Feng
Wenhua Wu
Hesheng Wang
3DGS
40
0
0
23 May 2024
Tuning-free Universally-Supervised Semantic Segmentation
Xiaobo Yang
Xiaojin Gong
VLM
42
1
0
23 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
41
12
0
23 May 2024
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Yuhang Yang
Wei Zhai
Chengfeng Wang
Chengjun Yu
Yang Cao
Zheng-jun Zha
40
5
0
22 May 2024
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
Dingwen Zhang
Hao Li
Diqi He
Nian Liu
Lechao Cheng
Jingdong Wang
Junwei Han
VLM
35
0
0
22 May 2024
Influence of Water Droplet Contamination for Transparency Segmentation
Volker Knauthe
Paul Weitz
Thomas Pollabauer
Tristan Wirth
Arne Rak
Arjan Kuijper
Dieter W. Fellner
28
1
0
21 May 2024
DLAFormer: An End-to-End Transformer For Document Layout Analysis
Jiawei Wang
Kai Hu
Qiang Huo
3DV
ViT
25
3
0
20 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
40
21
0
19 May 2024
HARIS: Human-Like Attention for Reference Image Segmentation
Mengxi Zhang
Heqing Lian
Yiming Liu
Jie Chen
VLM
21
0
0
17 May 2024
NeRO: Neural Road Surface Reconstruction
Ruibo Wang
Song Zhang
Ping Huang
Donghai Zhang
Haoyu Chen
3DV
27
1
0
17 May 2024
Previous
1
2
3
...
9
10
11
...
26
27
28
Next