Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Xing Mei
Guanbin Li
232
2
0
08 Apr 2024
NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization
Peng Tu
Xun Zhou
Mingming Wang
Xiaojun Yang
Bo Peng
Ping Chen
Xiu Su
Yawen Huang
Yefeng Zheng
Chang Xu
217
3
0
07 Apr 2024
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer
Hyeongjin Nam
Daniel Sungho Jung
Gyeongsik Moon
Kyoung Mu Lee
3DH
197
19
0
07 Apr 2024
Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation
Danpei Zhao
Bo Yuan
Ziqiang Chen
Tian Li
Zhuoran Liu
Wentao Li
Yue Gao
305
15
0
06 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
240
3
0
06 Apr 2024
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector
Junbo Li
Keyan Chen
Gengju Tian
Lu Li
Z. Shi
219
2
0
05 Apr 2024
Deep Learning for Satellite Image Time Series Analysis: A Review
Lynn Miller
Charlotte Pelletier
Geoffrey I. Webb
266
47
0
05 Apr 2024
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Shuting He
Henghui Ding
VOS
246
63
0
04 Apr 2024
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion
Jiahang Li
Peng Yun
Qijun Chen
Rui Fan
229
12
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
European Conference on Computer Vision (ECCV), 2024
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
326
2
0
04 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Computer Vision and Pattern Recognition (CVPR), 2024
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
247
6
0
02 Apr 2024
What is Point Supervision Worth in Video Instance Segmentation?
Shuaiyi Huang
De-An Huang
Zhiding Yu
Shiyi Lan
Subhashree Radhakrishnan
Jose M. Alvarez
Abhinav Shrivastava
A. Anandkumar
VOS
316
6
0
01 Apr 2024
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Beomyoung Kim
Donghyeon Kim
Sung Ju Hwang
516
0
0
01 Apr 2024
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
Xiaolu Liu
Song Wang
Wentong Li
Ruizi Yang
Junbo Chen
Jianke Zhu
191
44
0
01 Apr 2024
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
230
22
0
31 Mar 2024
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
Yang Miao
Francis Engelmann
Olga Vysotska
Federico Tombari
Marc Pollefeys
Daniel Barath
3DPC
357
17
0
30 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
476
4
0
30 Mar 2024
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang
Rui Sun
Naisong Luo
Yuwen Pan
Tianzhu Zhang
VLM
162
24
0
30 Mar 2024
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
Beomyoung Kim
Joonsang Yu
Sung Ju Hwang
VLM
CLL
243
26
0
29 Mar 2024
Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter
Yuiko Sakuma
Masakazu Yoshimura
Junji Otsuka
Atsushi Irie
Takeshi Ohashi
MQ
260
0
0
29 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
Amirhossein Kazerouni
Ilker Hacihaliloglu
Dorit Merhof
258
14
0
28 Mar 2024
Efficient 3D Instance Mapping and Localization with Neural Fields
George Tang
Krishna Murthy Jatavallabhula
Antonio Torralba
ISeg
281
5
0
28 Mar 2024
GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond
Chongjie Ye
Y. Nie
Jiahao Chang
Yuantao Chen
Yihao Zhi
Xiaoguang Han
3DGS
659
28
0
28 Mar 2024
Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
Xiaoyang Lyu
Chirui Chang
Peng Dai
Yang-tian Sun
Xiaojuan Qi
3DGS
308
12
0
28 Mar 2024
WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion
Khiem Vuong
N. D. Reddy
Shen Zheng
S. Narasimhan
185
3
0
27 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
169
20
0
27 Mar 2024
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding
Zhiheng Cheng
Qingyue Wei
Hongru Zhu
Yan Wang
Liangqiong Qu
Wei Shao
Yuyin Zhou
MedIm
174
57
0
27 Mar 2024
EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation
Chenhongyi Yang
Anastasia Tkach
Shreyas Hampali
Linguang Zhang
Elliot J. Crowley
Cem Keskin
193
0
0
26 Mar 2024
The Need for Speed: Pruning Transformers with One Recipe
Samir Khaki
Konstantinos N. Plataniotis
318
14
0
26 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
288
160
0
26 Mar 2024
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
Jiahao Chen
Yipeng Qin
Lingjie Liu
Jiangbo Lu
Guanbin Li
167
17
0
26 Mar 2024
Activity-Biometrics: Person Identification from Daily Activities
Shehreen Azad
Yogesh S Rawat
125
8
0
26 Mar 2024
Clustering Propagation for Universal Medical Image Segmentation
Yuhang Ding
Liulei Li
Wenguan Wang
Yi Yang
160
21
0
25 Mar 2024
V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster
Si Liu
Zihan Ding
Jiahui Fu
Hongyu Li
Siheng Chen
Shifeng Zhang
Xu Zhou
214
9
0
25 Mar 2024
Towards Large-Scale Training of Pathology Foundation Models
kaiko.ai
N. Aben
Edwin D. de Jong
Ioannis Gatopoulos
Nicolas Kanzig
Mikhail Karasikov
Axel Lagré
Roman Moser
J. Doorn
Fei Tang
MedIm
AI4CE
163
19
0
24 Mar 2024
3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Siwei Yang
Xianhang Li
Jieru Mei
Jieneng Chen
Cihang Xie
Yuyin Zhou
MedIm
164
10
0
23 Mar 2024
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting
Jun Guo
Xiaojian Ma
Yue Fan
Huaping Liu
Qing Li
3DGS
258
55
0
22 Mar 2024
Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion
Computer Vision and Pattern Recognition (CVPR), 2024
S. Casarin
C. Ugwu
Sergio Escalera
Oswald Lanz
233
0
0
22 Mar 2024
Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans
AAAI Conference on Artificial Intelligence (AAAI), 2024
Heng Guo
Jianfeng Zhang
Jiaxing Huang
Tony C. W. Mok
Dazhou Guo
Ke Yan
Le Lu
Dakai Jin
Minfeng Xu
MedIm
151
10
0
22 Mar 2024
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
Computer Vision and Pattern Recognition (CVPR), 2024
Jiahao Lu
Jiacheng Deng
Tianzhu Zhang
290
10
0
22 Mar 2024
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2024
Wenlve Zhou
Zhiheng Zhou
Tianlei Wang
Delu Zeng
190
0
0
22 Mar 2024
DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Zeeshan Hayder
Xuming He
ViT
263
17
0
21 Mar 2024
ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition
Tianhao Wu
Chuanxia Zheng
Tat-Jen Cham
Qianyi Wu
3DV
310
3
0
21 Mar 2024
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Zheng Zhang
Yeyao Ma
Enming Zhang
Xiang Bai
VLM
MLLM
247
77
0
21 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
258
27
0
21 Mar 2024
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining
Di Wang
Jing Zhang
Minqiang Xu
Lin Liu
Dongsheng Wang
...
Chengxi Han
Haonan Guo
Bo Du
Dacheng Tao
Guang Dai
210
93
0
20 Mar 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
358
39
0
20 Mar 2024
Rotary Position Embedding for Vision Transformer
Byeongho Heo
Song Park
Dongyoon Han
Sangdoo Yun
367
119
0
20 Mar 2024
Better Call SAL: Towards Learning to Segment Anything in Lidar
Aljovsa Ovsep
Tim Meinhardt
Francesco Ferroni
Neehar Peri
Deva Ramanan
Laura Leal-Taixé
VLM
212
35
0
19 Mar 2024
When Do We Not Need Larger Vision Models?
Baifeng Shi
Ziyang Wu
Maolin Mao
Xin Wang
Trevor Darrell
VLM
LRM
357
68
0
19 Mar 2024
Previous
1
2
3
...
17
18
19
...
32
33
34
Next