v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown

Title
LoCUS: Learning Multiscale 3D-consistent Features from Posed ImagesIEEE International Conference on Computer Vision (ICCV), 2023 Dominik A. Kloepfer Dylan Campbell João F. Henriques 3DPC 3DV 160 1 0 02 Oct 2023
ViPlanner: Visual Semantic Imperative Learning for Local NavigationIEEE International Conference on Robotics and Automation (ICRA), 2023 Pascal Roth Julian Nubert Fan Yang Mayank Mittal Marco Hutter 280 57 0 02 Oct 2023
Completing Visual Objects via Bridging Generation and SegmentationInternational Conference on Machine Learning (ICML), 2023 Xiang Li Yinpeng Chen Chung-Ching Lin Hao Chen Kai Hu Rita Singh Bhiksha Raj Lijuan Wang Zicheng Liu DiffM 308 3 0 01 Oct 2023
PharmacoNet: Accelerating Large-Scale Virtual Screening by Deep Pharmacophore ModelingChemical Science (Chem. Sci.), 2023 Seonghwan Seo Woo Youn Kim 243 5 0 01 Oct 2023
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations Alina Elena Baia Valentina Poggioni Andrea Cavallaro AAML 192 1 0 30 Sep 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision GeneralistsInternational Conference on Learning Representations (ICLR), 2023 Yulu Gan Sungwoo Park Alexander Schubert Anthony Philippakis Ahmed Alaa VLM 244 29 0 30 Sep 2023
Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation Zhan Xiong Junling He Pieter Valkema Tri Q. Nguyen M. Naesens J. Kers F. Verbeek MedIm 88 0 0 29 Sep 2023
Investigating Shift Equivalence of Convolutional Neural Networks in Industrial Defect SegmentationIEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2023 Yunsheng Tian Jieliang Luo Yichen Li Zhengtao Zhang Hui Li 168 7 0 29 Sep 2023
Superpixel Transformers for Efficient Semantic SegmentationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023 Xiao Han Jieru Mei Lu Zhang Hang Yan Yongkai Wu Liang-Chieh Chen Henrik Kretzschmar ViT 142 16 0 28 Sep 2023
Radar Instance Transformer: Reliable Moving Instance Segmentation in Sparse Radar Point CloudsIEEE Transactions on robotics (TRO), 2023 Matthias Zeller Vardeep S. Sandhu Benedikt Mersch D. Hristopulos Michael Heidingsfeld Cyrill Stachniss 299 17 0 28 Sep 2023
Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling Ke Yu Yuanmin Tang Giulia DeSalvo Suraj Kothawade Abdullah Rashwan S. Tavakkol Kayhan Batmanghelich Xiaoqi Yin ISeg 184 0 0 28 Sep 2023
Mask4Former: Mask Transformer for 4D Panoptic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2023 Kadir Yilmaz Jonas Schult Alexey Nekrasov Bastian Leibe ISeg 3DPC 275 22 0 28 Sep 2023
The Robust Semantic Segmentation UNCV2023 Challenge Results Xuanlong Yu Yi Zuo Zitao Wang Xiaowen Zhang Jiaxuan Zhao ... Angela Yao Wenlong Chen Ivor J. A. Simpson Neill D. F. Campbell Gianni Franchi UQCV 240 7 0 27 Sep 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Ao Wang Hui Chen Zijia Lin Sicheng Zhao Jiawei Han Guiguang Ding ViT 253 8 0 27 Sep 2023
DECO: Dense Estimation of 3D Human-Scene Contact In The WildIEEE International Conference on Computer Vision (ICCV), 2023 Shashank Tripathi Agniv Chatterjee Jean-Claude Passy Hongwei Yi Dimitrios Tzionas Michael J. Black 3DH 169 35 0 26 Sep 2023
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection Kemal Oksuz Selim Kuzucu Tom Joy P. Dokania MoE 466 13 0 26 Sep 2023
Volumetric Semantically Consistent 3D Panoptic MappingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023 Yang Miao Iro Armeni Marc Pollefeys Dániel Baráth 3DPC 185 10 0 26 Sep 2023
Dynamic Scene Graph Representation for Surgical Video Felix Holm Ghazal Ghazaei Tobias Czempiel Ege Özsoy Stefan Saur Nassir Navab MedIm 190 27 0 25 Sep 2023
Assessment of a new GeoAI foundation model for flood inundation mapping Wenwen Li Hyunho Lee Sizhe Wang Chia-Yu Hsu S. Arundel AI4CE 155 24 0 25 Sep 2023
3D Indoor Instance Segmentation in an Open-WorldNeural Information Processing Systems (NeurIPS), 2023 Mohamed El Amine Boudjoghra Salwa K. Al Khatib Jean Lahoud Hisham Cholakkal Rao Muhammad Anwer Salman Khan Fahad Khan 3DV ISeg 148 8 0 25 Sep 2023
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023 Quang H. Nguyen T. Vu Anh Tran Kim Dan Nguyen DiffM 473 131 0 25 Sep 2023
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition K. Nguyen T. Phung Hoang-Giang Cao 107 8 0 24 Sep 2023
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and ReasoningIEEE International Conference on Computer Vision (ICCV), 2023 Liulei Li Wenguan Wang Yi Yang NAI VLM 311 45 0 24 Sep 2023
I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR DiagnosesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023 Trong-Thang Pham Jacob Brecheisen Anh Nguyen Hien Nguyen Ngan Le 165 16 0 24 Sep 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric RepresentationIEEE International Conference on Computer Vision (ICCV), 2023 Ke Fan Jingshi Lei Xuelin Qian Miaopeng Yu Tianjun Xiao Tong He Zheng Zhang Yanwei Fu VOS 111 8 0 23 Sep 2023
ClusterFormer: Clustering As A Universal Visual Learner James Liang Yiming Cui Qifan Wang Tong Geng Wenguan Wang Dongfang Liu VLM 357 19 0 22 Sep 2023
NTO3D: Neural Target Object 3D Reconstruction with Segment AnythingComputer Vision and Pattern Recognition (CVPR), 2023 Xi Wei Renrui Zhang Jiarui Wu Jiaming Liu Ming Lu Yandong Guo Shanghang Zhang 231 9 0 22 Sep 2023
Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and SamplingComputer Vision and Pattern Recognition (CVPR), 2023 Leon Sick Dominik Engel Pedro Hermosilla Timo Ropinski 200 17 0 21 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance SegmentationIEEE International Conference on Computer Vision (ICCV), 2023 Junlong Li Ting Yu Yongming Rao Jie Zhou Jiwen Lu 158 20 0 21 Sep 2023
A Vision-Centric Approach for Static Map Element AnnotationIEEE International Conference on Robotics and Automation (ICRA), 2023 Jiaxin Zhang Shiyuan Chen Haoran Yin Ruohong Mei Xuan Liu Cong Yang Qian Zhang Wei Sui 3DV 235 7 0 21 Sep 2023
Multi-grained Temporal Prototype Learning for Few-shot Video Object SegmentationIEEE International Conference on Computer Vision (ICCV), 2023 Nian Liu Kepan Nan Wangbo Zhao Yuanwei Liu Xiwen Yao Salman Khan Hisham Cholakkal Rao Muhammad Anwer Junwei Han Fahad Shahbaz Khan VOS 217 11 0 20 Sep 2023
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene ParsingIEEE Transactions on Intelligent Vehicles (TIV), 2023 Jiahang Li Yikang Zhang Peng Yun Guangliang Zhou Qijun Chen Rui Fan ViT OffRL 344 39 0 19 Sep 2023
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban ScenesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Xiao Fu Shangzhan Zhang Tianrun Chen Yichong Lu Xiaowei Zhou Andreas Geiger Yiyi Liao 3DPC 486 12 0 19 Sep 2023
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels David Tschirschwitz C. Benz Morris Florek Henrik Norderhus Benno Stein Volker Rodehorst 155 1 0 18 Sep 2023
Discovering Sounding Objects by Audio Queries for Audio Visual SegmentationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023 Shaofei Huang Han Li Yuqing Wang Hongji Zhu Jiao Dai Jizhong Han Wenge Rong Si Liu VOS 125 30 0 18 Sep 2023
Uncertainty-aware 3D Object-Level Mapping with Deep Shape PriorsIEEE International Conference on Robotics and Automation (ICRA), 2023 Ziwei Liao Jun Yang Jingxing Qian Angela P. Schoellig Steven L. Waslander 148 7 0 17 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic SegmentationBritish Machine Vision Conference (BMVC), 2023 Zhaochong An Guolei Sun Zongwei Wu Hao Tang Luc Van Gool VOS 190 6 0 14 Sep 2023
NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches Chi-en Amy Tai Matthew Keller Saeejith Nair Yuhao Chen Yifan Wu ... Krish Parmar Pengcheng Xi Heather H. Keller Sharon I Kirkpatrick Alexander Wong 137 7 0 14 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition Zhiqiang Hu Tao Yu 166 5 0 13 Sep 2023
MPI-Flow: Learning Realistic Optical Flow with Multiplane ImagesIEEE International Conference on Computer Vision (ICCV), 2023 Yingping Liang Jiaming Liu Debing Zhang Ying Fu 140 8 0 13 Sep 2023
ASPED: An Audio Dataset for Detecting PedestriansIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Pavan Seshadri Chaeyeon Han B. Koo Noah Posner S. Guhathakurta Alexander Lerch 100 3 0 12 Sep 2023
IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation Qiyu Sun Huilin Chen Meng Zheng Ziyan Wu Michael Felsberg Yang Tang 256 6 0 12 Sep 2023
Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields Teppei Suzuki AI4CE 296 10 0 12 Sep 2023
Panoptic Vision-Language Feature FieldsIEEE Robotics and Automation Letters (RA-L), 2023 Haoran Chen Kenneth Blomqvist Francesco Milano Roland Siegwart VLM 218 16 0 11 Sep 2023
Toward a Deeper Understanding: RetNet Viewed through ConvolutionPattern Recognition (Pattern Recogn.), 2023 Chenghao Li Chaoning Zhang ViT 195 14 0 11 Sep 2023
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural roboticsIEEE Robotics and Automation Letters (RA-L), 2023 Claus Smitt Michael Halstead Patrick Zimmer Thomas Labe Esra Guclu C. Stachniss Chris McCool 150 28 0 11 Sep 2023
Mask2Anomaly: Mask Transformer for Universal Open-set SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Shyam Nandan Rai Fabio Cermelli Barbara Caputo Carlo Masone ISeg ViT 273 15 0 08 Sep 2023
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual GroundingEuropean Conference on Computer Vision (ECCV), 2023 Ozan Unal Daniel Gehrig Suman Saha Luc Van Gool 213 27 0 08 Sep 2023
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous DrivingIEEE International Conference on Computer Vision (ICCV), 2023 Thomas E. Huang Yifan Liu Luc Van Gool Fisher Yu 292 9 0 08 Sep 2023
Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving ScenesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023 Youssef Shoeb Robin Shing Moon Chan Gesina Schwalbe Azarm Nowzard Fatma Guney Hanno Gottschalk 176 8 0 08 Sep 2023