v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown

Title
On Moving Object Segmentation from Monocular Video with Transformers Christian Homeyer Christoph Schnörr 243 3 0 28 Nov 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior Li-Yuan Tsao Hao-Wei Chen Hao-Wei Chung Deqing Sun Chun-Yi Lee Kelvin Chan Ming-Hsuan Yang DiffM 201 7 0 27 Nov 2024
Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions Jingdong Zhang Hanrong Ye Xin Li Wenping Wang Dan Xu 326 2 0 27 Nov 2024
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation Sudarshan Rajagopalan Vishal M. Patel 145 0 0 26 Nov 2024
HyperSeg: Towards Universal Visual Segmentation with Large Language Model Cong Wei Yujie Zhong Haoxian Tan Yong Liu Zheng Zhao Jie Hu Yujiu Yang VOS MLLM VLM LRM 254 17 0 26 Nov 2024
Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps Xue Xia Randall Balestriero Tao Zhang L. Hurni VOS AI4TS 176 0 0 26 Nov 2024
VideoOrion: Tokenizing Object Dynamics in Videos Yicheng Feng Yijiang Li Wanpeng Zhang Sipeng Zheng Zongqing Lu Sipeng Zheng Zongqing Lu 362 7 0 25 Nov 2024
A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation M. Valiuddin R. V. Sloun C.G.A. Viviers Peter H. N. de With Fons van der Sommen UQCV 990 1 0 25 Nov 2024
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models Harsh Goel Sai Shankar Narasimhan Oguzhan Akcin Sandeep Chinchali DiffM 325 2 0 25 Nov 2024
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin T. Motz Andreas Hartel Enkelejda Kasneci 310 1 0 23 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks Miguel Espinosa Chenhongyi Yang Linus Ericsson Jingyu Sun Elliot J. Crowley VLM 253 3 0 22 Nov 2024
DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground MinesBigData Congress [Services Society] (BSS), 2024 Mizanur Rahman Jewel Mohamed Elmahallawy S. Madria Samuel Frimpong 222 5 0 20 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2024 Ziyi Wang Yijiao Wang Xumin Yu Jie Zhou Jiwen Lu 189 3 0 20 Nov 2024
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024 Xuechao Zou Shun Zhang Kai Li Shiying Wang Junliang Xing Lei Jin Congyan Lang Pin Tao 230 4 0 20 Nov 2024
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images Shen Li Lei Jiang Wei Wang Hongwei Hu Liang Li 279 0 0 20 Nov 2024
MGNiceNet: Unified Monocular Geometric Scene UnderstandingAsian Conference on Computer Vision (ACCV), 2024 Markus Schön Michael Buchholz Klaus C. J. Dietmayer 3DPC 524 0 0 18 Nov 2024
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data Chika Maduabuchi Ericmoore Jossou Matteo Bucci 344 0 0 12 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps Xue Xia Daiwei Zhang Wenxuan Song Wei Huang L. Hurni AI4TS VLM 145 7 0 11 Nov 2024
Watermark Anything with Localized MessagesInternational Conference on Learning Representations (ICLR), 2024 Tom Sander Pierre Fernandez Alain Durmus Teddy Furon Matthijs Douze VLM 390 31 0 11 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video RepresentationsNeural Information Processing Systems (NeurIPS), 2024 Sjoerd van Steenkiste Daniel Zoran Yi Yang Yulia Rubanova Rishabh Kabra ... Thomas Keck João Carreira Alexey Dosovitskiy Mehdi S. M. Sajjadi Thomas Kipf 248 9 0 08 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution ShiftsNeural Information Processing Systems (NeurIPS), 2024 Zhitong Gao Bingnan Li Mathieu Salzmann Xuming He OOD VLM 337 5 0 06 Nov 2024
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge Bowen Zhang Akide Liu Minh Hieu Phan Qi Chen Yangyang Shu Yang Zhao VLM CLL 204 0 0 05 Nov 2024
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression PerspectiveNeural Information Processing Systems (NeurIPS), 2024 Qishuai Wen Chun-Guang Li ViT 443 0 0 05 Nov 2024
GenXD: Generating Any 3D and 4D ScenesInternational Conference on Learning Representations (ICLR), 2024 Yuyang Zhao Chung-Ching Lin Kevin Qinghong Lin Zhiwen Yan Linjie Li Zhiyong Yang Jianfeng Wang G. Lee Lijuan Wang VGen 354 39 0 04 Nov 2024
Event-guided Low-light Video Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024 Zhen Yao Mooi Choo Choo Chuah 187 12 0 01 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image EditingIEEE Access (IEEE Access), 2024 Naufal Suryanto Andro Aprila Adiputra Ahmada Yusril Kadiptya Thi-Thu-Huong Le Derry Pratama Yongsu Kim Howon Kim DiffM 265 4 0 01 Nov 2024
ZIM: Zero-Shot Image Matting for Anything Beomyoung Kim Chanyong Shin Joonhyun Jeong Hyungsik Jung Se Yun Lee Sewhan Chun Dong-Hyun Hwang Joonsang Yu VLM 278 7 0 01 Nov 2024
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map ConstructionNeural Information Processing Systems (NeurIPS), 2024 Hongbo Zhao Lue Fan Yuntao Chen Haochen Wang Yiran Yang Xiaojuan Jin Yixin Zhang Gaofeng Meng Rundong Wang 221 9 0 30 Oct 2024
Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist Deblina Bhattacharjee Bahar Aydemir Baran Ozaydin Tong Zhang Mathieu Salzmann Sabine Süsstrunk 115 1 0 27 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training RecipesNeural Information Processing Systems (NeurIPS), 2024 Rajat Modi Vibhav Vineet Yogesh S Rawat 258 2 0 25 Oct 2024
SegLLM: Multi-round Reasoning Segmentation XuDong Wang Shaolun Zhang Shufan Li Konstantinos Kallidromitis Kehan Li Yusuke Kato Kazuki Kozuka Trevor Darrell VLM LRM 239 11 0 24 Oct 2024
Is Smoothness the Key to Robustness? A Comparison of Attention and Convolution Models Using a Novel Metric Baiyuan Chen MLT 260 0 0 23 Oct 2024
PLGS: Robust Panoptic Lifting with 3D Gaussian SplattingIEEE Transactions on Image Processing (TIP), 2024 Yu Wang Xiaobao Wei Ming Lu Guoliang Kang 3DGS 245 10 0 23 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context Maximilian Augustin Syed Shakib Sarwar Mostafa Elhoushi Sai Qian Zhang Yuecheng Li B. D. Salvo 201 1 0 23 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelNeural Information Processing Systems (NeurIPS), 2024 Jingjing Jiang Xianghong Li Tao Xiang Jifeng Dai ISeg 200 7 0 22 Oct 2024
Frontiers in Intelligent Colonoscopy Ge-Peng Ji Jingyi Liu Peng Xu Nick Barnes Fahad Shahbaz Khan Salman Khan Deng-Ping Fan 332 11 0 22 Oct 2024
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation Ruting Chi Zhiyi Huang Yuexing Han ISeg 221 0 0 21 Oct 2024
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute AlignmentInternational Conference on Learning Representations (ICLR), 2024 Yankai Jiang Wenhui Lei Xiaofan Zhang Shanghang Zhang MedIm 358 5 0 21 Oct 2024
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosComputer Vision and Pattern Recognition (CVPR), 2024 Ziming Huang Xurui Li Haotian Liu Feng Xue Yuzhe Wang Yu Zhou 314 5 0 18 Oct 2024
DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene RenderingNeural Information Processing Systems (NeurIPS), 2024 Jiahao Lu Jiacheng Deng Ruijie Zhu Yanzhe Liang Wenfei Yang Tianzhu Zhang Xu Zhou 3DGS 302 22 0 17 Oct 2024
GAN-Based Speech Enhancement for Low SNR Using Latent Feature ConditioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 Shrishti Saha Shetu Emanuël A. P. Habets Andreas Brendel 140 6 0 17 Oct 2024
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation Changcheng Xiao Qiong Cao Yujie Zhong Xiang Zhang Tao Wang Canqun Yang L. Lan 170 3 0 17 Oct 2024
Task Consistent Prototype Learning for Incremental Few-shot Semantic SegmentationInternational Conference on Pattern Recognition (ICPR), 2024 Wenbo Xu Yanan Wu Haoran Jiang Yang Wang Qiang Wu Jian Zhang CLL VLM 174 1 0 16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent CollaborationNeural Information Processing Systems (NeurIPS), 2024 Yiwei Guo Shaobin Zhuang Kunchang Li Yu Qiao Yali Wang VLM CLIP 357 5 0 16 Oct 2024
Order-aware Interactive SegmentationInternational Conference on Learning Representations (ICLR), 2024 Bin Wang Anwesa Choudhuri Meng Zheng Zhongpai Gao Benjamin Planche Andong Deng Qin Liu Terrence Chen Ulas Bagci Ziyan Wu VLM 892 2 0 16 Oct 2024
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation Dongjun Hwang Yejin Kim Junsuk Choe Seong Joon Oh Junsuk Choe VLM 664 0 0 15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure Yu Zhang Kefeng Zheng Fei Liu Qingfu Zhang Zhenkun Wang 203 8 0 14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo Yulong Wang Zixuan Ye Jifeng Dai Yuwen Xiong ViT 207 1 0 14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware ControlEuropean Conference on Computer Vision (ECCV), 2024 Fan Li Zixiao Zhang Yi Huang Jianzhuang Liu Renjing Pei Bin Shao Songcen Xu DiffM 186 12 0 14 Oct 2024
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation Zhiyun Song Yue Zhao Xiaomin Li Manman Fei Xiangyu Zhao ... Chung-Hsing Yeh Qian Wang Guoyan Zheng Songtao Ai Lichi Zhang 252 2 0 14 Oct 2024