Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
243
3
0
28 Nov 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
Li-Yuan Tsao
Hao-Wei Chen
Hao-Wei Chung
Deqing Sun
Chun-Yi Lee
Kelvin Chan
Ming-Hsuan Yang
DiffM
201
7
0
27 Nov 2024
Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions
Jingdong Zhang
Hanrong Ye
Xin Li
Wenping Wang
Dan Xu
326
2
0
27 Nov 2024
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation
Sudarshan Rajagopalan
Vishal M. Patel
145
0
0
26 Nov 2024
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
Cong Wei
Yujie Zhong
Haoxian Tan
Yong Liu
Zheng Zhao
Jie Hu
Yujiu Yang
VOS
MLLM
VLM
LRM
254
17
0
26 Nov 2024
Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps
Xue Xia
Randall Balestriero
Tao Zhang
L. Hurni
VOS
AI4TS
176
0
0
26 Nov 2024
VideoOrion: Tokenizing Object Dynamics in Videos
Yicheng Feng
Yijiang Li
Wanpeng Zhang
Sipeng Zheng
Zongqing Lu
Sipeng Zheng
Zongqing Lu
362
7
0
25 Nov 2024
A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation
M. Valiuddin
R. V. Sloun
C.G.A. Viviers
Peter H. N. de With
Fons van der Sommen
UQCV
990
1
0
25 Nov 2024
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models
Harsh Goel
Sai Shankar Narasimhan
Oguzhan Akcin
Sandeep Chinchali
DiffM
325
2
0
25 Nov 2024
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs
Chen Xin
T. Motz
Andreas Hartel
Enkelejda Kasneci
310
1
0
23 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Jingyu Sun
Elliot J. Crowley
VLM
253
3
0
22 Nov 2024
DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines
BigData Congress [Services Society] (BSS), 2024
Mizanur Rahman Jewel
Mohamed Elmahallawy
S. Madria
Samuel Frimpong
222
5
0
20 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
189
3
0
20 Nov 2024
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Xuechao Zou
Shun Zhang
Kai Li
Shiying Wang
Junliang Xing
Lei Jin
Congyan Lang
Pin Tao
230
4
0
20 Nov 2024
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images
Shen Li
Lei Jiang
Wei Wang
Hongwei Hu
Liang Li
279
0
0
20 Nov 2024
MGNiceNet: Unified Monocular Geometric Scene Understanding
Asian Conference on Computer Vision (ACCV), 2024
Markus Schön
Michael Buchholz
Klaus C. J. Dietmayer
3DPC
524
0
0
18 Nov 2024
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data
Chika Maduabuchi
Ericmoore Jossou
Matteo Bucci
344
0
0
12 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TS
VLM
145
7
0
11 Nov 2024
Watermark Anything with Localized Messages
International Conference on Learning Representations (ICLR), 2024
Tom Sander
Pierre Fernandez
Alain Durmus
Teddy Furon
Matthijs Douze
VLM
390
31
0
11 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Neural Information Processing Systems (NeurIPS), 2024
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
248
9
0
08 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
Neural Information Processing Systems (NeurIPS), 2024
Zhitong Gao
Bingnan Li
Mathieu Salzmann
Xuming He
OOD
VLM
337
5
0
06 Nov 2024
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Jinchao Ge
Bowen Zhang
Akide Liu
Minh Hieu Phan
Qi Chen
Yangyang Shu
Yang Zhao
VLM
CLL
204
0
0
05 Nov 2024
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
Neural Information Processing Systems (NeurIPS), 2024
Qishuai Wen
Chun-Guang Li
ViT
443
0
0
05 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
International Conference on Learning Representations (ICLR), 2024
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Zhiyong Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
354
39
0
04 Nov 2024
Event-guided Low-light Video Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhen Yao
Mooi Choo Choo Chuah
187
12
0
01 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing
IEEE Access (IEEE Access), 2024
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
265
4
0
01 Nov 2024
ZIM: Zero-Shot Image Matting for Anything
Beomyoung Kim
Chanyong Shin
Joonhyun Jeong
Hyungsik Jung
Se Yun Lee
Sewhan Chun
Dong-Hyun Hwang
Joonsang Yu
VLM
278
7
0
01 Nov 2024
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction
Neural Information Processing Systems (NeurIPS), 2024
Hongbo Zhao
Lue Fan
Yuntao Chen
Haochen Wang
Yiran Yang
Xiaojuan Jin
Yixin Zhang
Gaofeng Meng
Rundong Wang
221
9
0
30 Oct 2024
Unlocking Comics: The AI4VA Dataset for Visual Understanding
Peter Grönquist
Deblina Bhattacharjee
Bahar Aydemir
Baran Ozaydin
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
115
1
0
27 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
Neural Information Processing Systems (NeurIPS), 2024
Rajat Modi
Vibhav Vineet
Yogesh S Rawat
258
2
0
25 Oct 2024
SegLLM: Multi-round Reasoning Segmentation
XuDong Wang
Shaolun Zhang
Shufan Li
Konstantinos Kallidromitis
Kehan Li
Yusuke Kato
Kazuki Kozuka
Trevor Darrell
VLM
LRM
239
11
0
24 Oct 2024
Is Smoothness the Key to Robustness? A Comparison of Attention and Convolution Models Using a Novel Metric
Baiyuan Chen
MLT
260
0
0
23 Oct 2024
PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting
IEEE Transactions on Image Processing (TIP), 2024
Yu Wang
Xiaobao Wei
Ming Lu
Guoliang Kang
3DGS
245
10
0
23 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
201
1
0
23 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
Neural Information Processing Systems (NeurIPS), 2024
Jingjing Jiang
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
200
7
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
332
11
0
22 Oct 2024
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation
Ruting Chi
Zhiyi Huang
Yuexing Han
ISeg
221
0
0
21 Oct 2024
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment
International Conference on Learning Representations (ICLR), 2024
Yankai Jiang
Wenhui Lei
Xiaofan Zhang
Shanghang Zhang
MedIm
358
5
0
21 Oct 2024
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Computer Vision and Pattern Recognition (CVPR), 2024
Ziming Huang
Xurui Li
Haotian Liu
Feng Xue
Yuzhe Wang
Yu Zhou
314
5
0
18 Oct 2024
DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering
Neural Information Processing Systems (NeurIPS), 2024
Jiahao Lu
Jiacheng Deng
Ruijie Zhu
Yanzhe Liang
Wenfei Yang
Tianzhu Zhang
Xu Zhou
3DGS
302
22
0
17 Oct 2024
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
140
6
0
17 Oct 2024
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Changcheng Xiao
Qiong Cao
Yujie Zhong
Xiang Zhang
Tao Wang
Canqun Yang
L. Lan
170
3
0
17 Oct 2024
Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation
International Conference on Pattern Recognition (ICPR), 2024
Wenbo Xu
Yanan Wu
Haoran Jiang
Yang Wang
Qiang Wu
Jian Zhang
CLL
VLM
174
1
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Neural Information Processing Systems (NeurIPS), 2024
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
357
5
0
16 Oct 2024
Order-aware Interactive Segmentation
International Conference on Learning Representations (ICLR), 2024
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
892
2
0
16 Oct 2024
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
Dongjun Hwang
Yejin Kim
Junsuk Choe
Seong Joon Oh
Junsuk Choe
VLM
664
0
0
15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
Yu Zhang
Kefeng Zheng
Fei Liu
Qingfu Zhang
Zhenkun Wang
203
8
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
207
1
0
14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware Control
European Conference on Computer Vision (ECCV), 2024
Fan Li
Zixiao Zhang
Yi Huang
Jianzhuang Liu
Renjing Pei
Bin Shao
Songcen Xu
DiffM
186
12
0
14 Oct 2024
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Zhiyun Song
Yue Zhao
Xiaomin Li
Manman Fei
Xiangyu Zhao
...
Chung-Hsing Yeh
Qian Wang
Guoyan Zheng
Songtao Ai
Lichi Zhang
252
2
0
14 Oct 2024
Previous
1
2
3
...
10
11
12
...
32
33
34
Next