ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images
LoCUS: Learning Multiscale 3D-consistent Features from Posed ImagesIEEE International Conference on Computer Vision (ICCV), 2023
Dominik A. Kloepfer
Dylan Campbell
João F. Henriques
3DPC3DV
160
1
0
02 Oct 2023
ViPlanner: Visual Semantic Imperative Learning for Local Navigation
ViPlanner: Visual Semantic Imperative Learning for Local NavigationIEEE International Conference on Robotics and Automation (ICRA), 2023
Pascal Roth
Julian Nubert
Fan Yang
Mayank Mittal
Marco Hutter
280
57
0
02 Oct 2023
Completing Visual Objects via Bridging Generation and Segmentation
Completing Visual Objects via Bridging Generation and SegmentationInternational Conference on Machine Learning (ICML), 2023
Xiang Li
Yinpeng Chen
Chung-Ching Lin
Hao Chen
Kai Hu
Rita Singh
Bhiksha Raj
Lijuan Wang
Zicheng Liu
DiffM
308
3
0
01 Oct 2023
PharmacoNet: Accelerating Large-Scale Virtual Screening by Deep
  Pharmacophore Modeling
PharmacoNet: Accelerating Large-Scale Virtual Screening by Deep Pharmacophore ModelingChemical Science (Chem. Sci.), 2023
Seonghwan Seo
Woo Youn Kim
243
5
0
01 Oct 2023
Black-box Attacks on Image Activity Prediction and its Natural Language
  Explanations
Black-box Attacks on Image Activity Prediction and its Natural Language Explanations
Alina Elena Baia
Valentina Poggioni
Andrea Cavallaro
AAML
192
1
0
30 Sep 2023
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision
  Generalists
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision GeneralistsInternational Conference on Learning Representations (ICLR), 2023
Yulu Gan
Sungwoo Park
Alexander Schubert
Anthony Philippakis
Ahmed Alaa
VLM
244
29
0
30 Sep 2023
Advances in Kidney Biopsy Lesion Assessment through Dense Instance
  Segmentation
Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation
Zhan Xiong
Junling He
Pieter Valkema
Tri Q. Nguyen
M. Naesens
J. Kers
F. Verbeek
MedIm
88
0
0
29 Sep 2023
Investigating Shift Equivalence of Convolutional Neural Networks in
  Industrial Defect Segmentation
Investigating Shift Equivalence of Convolutional Neural Networks in Industrial Defect SegmentationIEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2023
Yunsheng Tian
Jieliang Luo
Yichen Li
Zhengtao Zhang
Hui Li
168
7
0
29 Sep 2023
Superpixel Transformers for Efficient Semantic Segmentation
Superpixel Transformers for Efficient Semantic SegmentationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Xiao Han
Jieru Mei
Lu Zhang
Hang Yan
Yongkai Wu
Liang-Chieh Chen
Henrik Kretzschmar
ViT
142
16
0
28 Sep 2023
Radar Instance Transformer: Reliable Moving Instance Segmentation in
  Sparse Radar Point Clouds
Radar Instance Transformer: Reliable Moving Instance Segmentation in Sparse Radar Point CloudsIEEE Transactions on robotics (TRO), 2023
Matthias Zeller
Vardeep S. Sandhu
Benedikt Mersch
D. Hristopulos
Michael Heidingsfeld
Cyrill Stachniss
299
17
0
28 Sep 2023
Two-Step Active Learning for Instance Segmentation with Uncertainty and
  Diversity Sampling
Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling
Ke Yu
Yuanmin Tang
Giulia DeSalvo
Suraj Kothawade
Abdullah Rashwan
S. Tavakkol
Kayhan Batmanghelich
Xiaoqi Yin
ISeg
184
0
0
28 Sep 2023
Mask4Former: Mask Transformer for 4D Panoptic Segmentation
Mask4Former: Mask Transformer for 4D Panoptic SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2023
Kadir Yilmaz
Jonas Schult
Alexey Nekrasov
Bastian Leibe
ISeg3DPC
275
22
0
28 Sep 2023
The Robust Semantic Segmentation UNCV2023 Challenge Results
The Robust Semantic Segmentation UNCV2023 Challenge Results
Xuanlong Yu
Yi Zuo
Zitao Wang
Xiaowen Zhang
Jiaxuan Zhao
...
Angela Yao
Wenlong Chen
Ivor J. A. Simpson
Neill D. F. Campbell
Gianni Franchi
UQCV
240
7
0
27 Sep 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
Jiawei Han
Guiguang Ding
ViT
253
8
0
27 Sep 2023
DECO: Dense Estimation of 3D Human-Scene Contact In The Wild
DECO: Dense Estimation of 3D Human-Scene Contact In The WildIEEE International Conference on Computer Vision (ICCV), 2023
Shashank Tripathi
Agniv Chatterjee
Jean-Claude Passy
Hongwei Yi
Dimitrios Tzionas
Michael J. Black
3DH
169
35
0
26 Sep 2023
MoCaE: Mixture of Calibrated Experts Significantly Improves Object
  Detection
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection
Kemal Oksuz
Selim Kuzucu
Tom Joy
P. Dokania
MoE
466
13
0
26 Sep 2023
Volumetric Semantically Consistent 3D Panoptic Mapping
Volumetric Semantically Consistent 3D Panoptic MappingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Yang Miao
Iro Armeni
Marc Pollefeys
Dániel Baráth
3DPC
185
10
0
26 Sep 2023
Dynamic Scene Graph Representation for Surgical Video
Dynamic Scene Graph Representation for Surgical Video
Felix Holm
Ghazal Ghazaei
Tobias Czempiel
Ege Özsoy
Stefan Saur
Nassir Navab
MedIm
190
27
0
25 Sep 2023
Assessment of a new GeoAI foundation model for flood inundation mapping
Assessment of a new GeoAI foundation model for flood inundation mapping
Wenwen Li
Hyunho Lee
Sizhe Wang
Chia-Yu Hsu
S. Arundel
AI4CE
155
24
0
25 Sep 2023
3D Indoor Instance Segmentation in an Open-World
3D Indoor Instance Segmentation in an Open-WorldNeural Information Processing Systems (NeurIPS), 2023
Mohamed El Amine Boudjoghra
Salwa K. Al Khatib
Jean Lahoud
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Fahad Khan
3DVISeg
148
8
0
25 Sep 2023
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for
  Pixel-Level Semantic Segmentation
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023
Quang H. Nguyen
T. Vu
Anh Tran
Kim Dan Nguyen
DiffM
473
131
0
25 Sep 2023
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and
  Weeds Competition
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition
K. Nguyen
T. Phung
Hoang-Giang Cao
107
8
0
24 Sep 2023
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and
  Reasoning
LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and ReasoningIEEE International Conference on Computer Vision (ICCV), 2023
Liulei Li
Wenguan Wang
Yi Yang
NAIVLM
311
45
0
24 Sep 2023
I-AI: A Controllable & Interpretable AI System for Decoding
  Radiologists' Intense Focus for Accurate CXR Diagnoses
I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR DiagnosesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Trong-Thang Pham
Jacob Brecheisen
Anh Nguyen
Hien Nguyen
Ngan Le
165
16
0
24 Sep 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals
  with Object-centric Representation
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric RepresentationIEEE International Conference on Computer Vision (ICCV), 2023
Ke Fan
Jingshi Lei
Xuelin Qian
Miaopeng Yu
Tianjun Xiao
Tong He
Zheng Zhang
Yanwei Fu
VOS
111
8
0
23 Sep 2023
ClusterFormer: Clustering As A Universal Visual Learner
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
357
19
0
22 Sep 2023
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
NTO3D: Neural Target Object 3D Reconstruction with Segment AnythingComputer Vision and Pattern Recognition (CVPR), 2023
Xi Wei
Renrui Zhang
Jiarui Wu
Jiaming Liu
Ming Lu
Yandong Guo
Shanghang Zhang
231
9
0
22 Sep 2023
Unsupervised Semantic Segmentation Through Depth-Guided Feature
  Correlation and Sampling
Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and SamplingComputer Vision and Pattern Recognition (CVPR), 2023
Leon Sick
Dominik Engel
Pedro Hermosilla
Timo Ropinski
200
17
0
21 Sep 2023
TCOVIS: Temporally Consistent Online Video Instance Segmentation
TCOVIS: Temporally Consistent Online Video Instance SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Junlong Li
Ting Yu
Yongming Rao
Jie Zhou
Jiwen Lu
158
20
0
21 Sep 2023
A Vision-Centric Approach for Static Map Element Annotation
A Vision-Centric Approach for Static Map Element AnnotationIEEE International Conference on Robotics and Automation (ICRA), 2023
Jiaxin Zhang
Shiyuan Chen
Haoran Yin
Ruohong Mei
Xuan Liu
Cong Yang
Qian Zhang
Wei Sui
3DV
235
7
0
21 Sep 2023
Multi-grained Temporal Prototype Learning for Few-shot Video Object
  Segmentation
Multi-grained Temporal Prototype Learning for Few-shot Video Object SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Nian Liu
Kepan Nan
Wangbo Zhao
Yuanwei Liu
Xiwen Yao
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
Junwei Han
Fahad Shahbaz Khan
VOS
217
11
0
20 Sep 2023
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene
  Parsing
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene ParsingIEEE Transactions on Intelligent Vehicles (TIV), 2023
Jiahang Li
Yikang Zhang
Peng Yun
Guangliang Zhou
Qijun Chen
Rui Fan
ViTOffRL
344
39
0
19 Sep 2023
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes
PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban ScenesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiao Fu
Shangzhan Zhang
Tianrun Chen
Yichong Lu
Xiaowei Zhou
Andreas Geiger
Yiyi Liao
3DPC
486
12
0
19 Sep 2023
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object
  Detection with Repeated Labels
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels
David Tschirschwitz
C. Benz
Morris Florek
Henrik Norderhus
Benno Stein
Volker Rodehorst
155
1
0
18 Sep 2023
Discovering Sounding Objects by Audio Queries for Audio Visual
  Segmentation
Discovering Sounding Objects by Audio Queries for Audio Visual SegmentationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Shaofei Huang
Han Li
Yuqing Wang
Hongji Zhu
Jiao Dai
Jizhong Han
Wenge Rong
Si Liu
VOS
125
30
0
18 Sep 2023
Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors
Uncertainty-aware 3D Object-Level Mapping with Deep Shape PriorsIEEE International Conference on Robotics and Automation (ICRA), 2023
Ziwei Liao
Jun Yang
Jingxing Qian
Angela P. Schoellig
Steven L. Waslander
148
7
0
17 Sep 2023
Temporal-aware Hierarchical Mask Classification for Video Semantic
  Segmentation
Temporal-aware Hierarchical Mask Classification for Video Semantic SegmentationBritish Machine Vision Conference (BMVC), 2023
Zhaochong An
Guolei Sun
Zongwei Wu
Hao Tang
Luc Van Gool
VOS
190
6
0
14 Sep 2023
NutritionVerse: Empirical Study of Various Dietary Intake Estimation
  Approaches
NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches
Chi-en Amy Tai
Matthew Keller
Saeejith Nair
Yuhao Chen
Yifan Wu
...
Krish Parmar
Pengcheng Xi
Heather H. Keller
Sharon I Kirkpatrick
Alexander Wong
137
7
0
14 Sep 2023
Dynamic Spectrum Mixer for Visual Recognition
Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu
Tao Yu
166
5
0
13 Sep 2023
MPI-Flow: Learning Realistic Optical Flow with Multiplane Images
MPI-Flow: Learning Realistic Optical Flow with Multiplane ImagesIEEE International Conference on Computer Vision (ICCV), 2023
Yingping Liang
Jiaming Liu
Debing Zhang
Ying Fu
140
8
0
13 Sep 2023
ASPED: An Audio Dataset for Detecting Pedestrians
ASPED: An Audio Dataset for Detecting PedestriansIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Pavan Seshadri
Chaeyeon Han
B. Koo
Noah Posner
S. Guhathakurta
Alexander Lerch
100
3
0
12 Sep 2023
IBAFormer: Intra-batch Attention Transformer for Domain Generalized
  Semantic Segmentation
IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation
Qiyu Sun
Huilin Chen
Meng Zheng
Ziyan Wu
Michael Felsberg
Yang Tang
256
6
0
12 Sep 2023
Federated Learning for Large-Scale Scene Modeling with Neural Radiance
  Fields
Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields
Teppei Suzuki
AI4CE
296
10
0
12 Sep 2023
Panoptic Vision-Language Feature Fields
Panoptic Vision-Language Feature FieldsIEEE Robotics and Automation Letters (RA-L), 2023
Haoran Chen
Kenneth Blomqvist
Francesco Milano
Roland Siegwart
VLM
218
16
0
11 Sep 2023
Toward a Deeper Understanding: RetNet Viewed through Convolution
Toward a Deeper Understanding: RetNet Viewed through ConvolutionPattern Recognition (Pattern Recogn.), 2023
Chenghao Li
Chaoning Zhang
ViT
195
14
0
11 Sep 2023
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D
  representations for agricultural robotics
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural roboticsIEEE Robotics and Automation Letters (RA-L), 2023
Claus Smitt
Michael Halstead
Patrick Zimmer
Thomas Labe
Esra Guclu
C. Stachniss
Chris McCool
150
28
0
11 Sep 2023
Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Mask2Anomaly: Mask Transformer for Universal Open-set SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Shyam Nandan Rai
Fabio Cermelli
Barbara Caputo
Carlo Masone
ISegViT
273
15
0
08 Sep 2023
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual GroundingEuropean Conference on Computer Vision (ECCV), 2023
Ozan Unal
Daniel Gehrig
Suman Saha
Luc Van Gool
213
27
0
08 Sep 2023
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous
  Driving
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous DrivingIEEE International Conference on Computer Vision (ICCV), 2023
Thomas E. Huang
Yifan Liu
Luc Van Gool
Fisher Yu
292
9
0
08 Sep 2023
Have We Ever Encountered This Before? Retrieving Out-of-Distribution
  Road Obstacles from Driving Scenes
Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving ScenesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Youssef Shoeb
Robin Shing Moon Chan
Gesina Schwalbe
Azarm Nowzard
Fatma Guney
Hanno Gottschalk
176
8
0
08 Sep 2023
Previous
123...232425...323334
Next