ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
On Moving Object Segmentation from Monocular Video with Transformers
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
243
3
0
28 Nov 2024
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion
  Prior
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior
Li-Yuan Tsao
Hao-Wei Chen
Hao-Wei Chung
Deqing Sun
Chun-Yi Lee
Kelvin Chan
Ming-Hsuan Yang
DiffM
201
7
0
27 Nov 2024
Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions
Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions
Jingdong Zhang
Hanrong Ye
Xin Li
Wenping Wang
Dan Xu
326
2
0
27 Nov 2024
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation
Sudarshan Rajagopalan
Vishal M. Patel
145
0
0
26 Nov 2024
HyperSeg: Towards Universal Visual Segmentation with Large Language
  Model
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
Cong Wei
Yujie Zhong
Haoxian Tan
Yong Liu
Zheng Zhao
Jie Hu
Yujiu Yang
VOSMLLMVLMLRM
254
17
0
26 Nov 2024
Self-supervised Video Instance Segmentation Can Boost Geographic Entity
  Alignment in Historical Maps
Self-supervised Video Instance Segmentation Can Boost Geographic Entity Alignment in Historical Maps
Xue Xia
Randall Balestriero
Tao Zhang
L. Hurni
VOSAI4TS
176
0
0
26 Nov 2024
VideoOrion: Tokenizing Object Dynamics in Videos
VideoOrion: Tokenizing Object Dynamics in Videos
Yicheng Feng
Yijiang Li
Wanpeng Zhang
Sipeng Zheng
Zongqing Lu
Sipeng Zheng
Zongqing Lu
362
7
0
25 Nov 2024
A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation
A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation
M. Valiuddin
R. V. Sloun
C.G.A. Viviers
Peter H. N. de With
Fons van der Sommen
UQCV
990
1
0
25 Nov 2024
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models
Harsh Goel
Sai Shankar Narasimhan
Oguzhan Akcin
Sandeep Chinchali
DiffM
325
2
0
25 Nov 2024
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction
  on Edge Devices with NPUs
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs
Chen Xin
T. Motz
Andreas Hartel
Enkelejda Kasneci
310
1
0
23 Nov 2024
There is no SAMantics! Exploring SAM as a Backbone for Visual
  Understanding Tasks
There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks
Miguel Espinosa
Chenhongyi Yang
Linus Ericsson
Jingyu Sun
Elliot J. Crowley
VLM
253
3
0
22 Nov 2024
DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light
  Condition in Underground Mines
DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground MinesBigData Congress [Services Society] (BSS), 2024
Mizanur Rahman Jewel
Mohamed Elmahallawy
S. Madria
Samuel Frimpong
222
5
0
20 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic
  Segmentation
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2024
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
189
3
0
20 Nov 2024
Adapting Vision Foundation Models for Robust Cloud Segmentation in
  Remote Sensing Images
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing ImagesIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2024
Xuechao Zou
Shun Zhang
Kai Li
Shiying Wang
Junliang Xing
Lei Jin
Congyan Lang
Pin Tao
230
4
0
20 Nov 2024
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural
  Images
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images
Shen Li
Lei Jiang
Wei Wang
Hongwei Hu
Liang Li
279
0
0
20 Nov 2024
MGNiceNet: Unified Monocular Geometric Scene UnderstandingAsian Conference on Computer Vision (ACCV), 2024
Markus Schön
Michael Buchholz
Klaus C. J. Dietmayer
3DPC
524
0
0
18 Nov 2024
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data
Chika Maduabuchi
Ericmoore Jossou
Matteo Bucci
344
0
0
12 Nov 2024
MapSAM: Adapting Segment Anything Model for Automated Feature Detection
  in Historical Maps
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
Xue Xia
Daiwei Zhang
Wenxuan Song
Wei Huang
L. Hurni
AI4TSVLM
145
7
0
11 Nov 2024
Watermark Anything with Localized Messages
Watermark Anything with Localized MessagesInternational Conference on Learning Representations (ICLR), 2024
Tom Sander
Pierre Fernandez
Alain Durmus
Teddy Furon
Matthijs Douze
VLM
390
31
0
11 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Moving Off-the-Grid: Scene-Grounded Video RepresentationsNeural Information Processing Systems (NeurIPS), 2024
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
248
9
0
08 Nov 2024
Generalize or Detect? Towards Robust Semantic Segmentation Under
  Multiple Distribution Shifts
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution ShiftsNeural Information Processing Systems (NeurIPS), 2024
Zhitong Gao
Bingnan Li
Mathieu Salzmann
Xuming He
OODVLM
337
5
0
06 Nov 2024
CIT: Rethinking Class-incremental Semantic Segmentation with a Class
  Independent Transformation
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Jinchao Ge
Bowen Zhang
Akide Liu
Minh Hieu Phan
Qi Chen
Yangyang Shu
Yang Zhao
VLMCLL
204
0
0
05 Nov 2024
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression PerspectiveNeural Information Processing Systems (NeurIPS), 2024
Qishuai Wen
Chun-Guang Li
ViT
443
0
0
05 Nov 2024
GenXD: Generating Any 3D and 4D Scenes
GenXD: Generating Any 3D and 4D ScenesInternational Conference on Learning Representations (ICLR), 2024
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Zhiyong Yang
Jianfeng Wang
G. Lee
Lijuan Wang
VGen
354
39
0
04 Nov 2024
Event-guided Low-light Video Semantic Segmentation
Event-guided Low-light Video Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhen Yao
Mooi Choo Choo Chuah
187
12
0
01 Nov 2024
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with
  Realistic Scene Modifications via Diffusion-Based Image Editing
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image EditingIEEE Access (IEEE Access), 2024
Naufal Suryanto
Andro Aprila Adiputra
Ahmada Yusril Kadiptya
Thi-Thu-Huong Le
Derry Pratama
Yongsu Kim
Howon Kim
DiffM
265
4
0
01 Nov 2024
ZIM: Zero-Shot Image Matting for Anything
ZIM: Zero-Shot Image Matting for Anything
Beomyoung Kim
Chanyong Shin
Joonhyun Jeong
Hyungsik Jung
Se Yun Lee
Sewhan Chun
Dong-Hyun Hwang
Joonsang Yu
VLM
278
7
0
01 Nov 2024
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for
  Large-scale Map Construction
OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map ConstructionNeural Information Processing Systems (NeurIPS), 2024
Hongbo Zhao
Lue Fan
Yuntao Chen
Haochen Wang
Yiran Yang
Xiaojuan Jin
Yixin Zhang
Gaofeng Meng
Rundong Wang
221
9
0
30 Oct 2024
Unlocking Comics: The AI4VA Dataset for Visual Understanding
Unlocking Comics: The AI4VA Dataset for Visual Understanding
Peter Grönquist
Deblina Bhattacharjee
Bahar Aydemir
Baran Ozaydin
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
115
1
0
27 Oct 2024
On Occlusions in Video Action Detection: Benchmark Datasets And Training
  Recipes
On Occlusions in Video Action Detection: Benchmark Datasets And Training RecipesNeural Information Processing Systems (NeurIPS), 2024
Rajat Modi
Vibhav Vineet
Yogesh S Rawat
258
2
0
25 Oct 2024
SegLLM: Multi-round Reasoning Segmentation
SegLLM: Multi-round Reasoning Segmentation
XuDong Wang
Shaolun Zhang
Shufan Li
Konstantinos Kallidromitis
Kehan Li
Yusuke Kato
Kazuki Kozuka
Trevor Darrell
VLMLRM
239
11
0
24 Oct 2024
Is Smoothness the Key to Robustness? A Comparison of Attention and
  Convolution Models Using a Novel Metric
Is Smoothness the Key to Robustness? A Comparison of Attention and Convolution Models Using a Novel Metric
Baiyuan Chen
MLT
260
0
0
23 Oct 2024
PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting
PLGS: Robust Panoptic Lifting with 3D Gaussian SplattingIEEE Transactions on Image Processing (TIP), 2024
Yu Wang
Xiaobao Wei
Ming Lu
Guoliang Kang
3DGS
245
10
0
23 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
201
1
0
23 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelNeural Information Processing Systems (NeurIPS), 2024
Jingjing Jiang
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
200
7
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
332
11
0
22 Oct 2024
Integrated Image-Text Based on Semi-supervised Learning for Small Sample
  Instance Segmentation
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation
Ruting Chi
Zhiyi Huang
Yuexing Han
ISeg
221
0
0
21 Oct 2024
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment
Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute AlignmentInternational Conference on Learning Representations (ICLR), 2024
Yankai Jiang
Wenhui Lei
Xiaofan Zhang
Shanghang Zhang
MedIm
358
5
0
21 Oct 2024
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosComputer Vision and Pattern Recognition (CVPR), 2024
Ziming Huang
Xurui Li
Haotian Liu
Feng Xue
Yuzhe Wang
Yu Zhou
314
5
0
18 Oct 2024
DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation
  for Dynamic Scene Rendering
DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene RenderingNeural Information Processing Systems (NeurIPS), 2024
Jiahao Lu
Jiacheng Deng
Ruijie Zhu
Yanzhe Liang
Wenfei Yang
Tianzhu Zhang
Xu Zhou
3DGS
302
22
0
17 Oct 2024
GAN-Based Speech Enhancement for Low SNR Using Latent Feature
  Conditioning
GAN-Based Speech Enhancement for Low SNR Using Latent Feature ConditioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Shrishti Saha Shetu
Emanuël A. P. Habets
Andreas Brendel
140
6
0
17 Oct 2024
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object
  Tracking and Segmentation
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Changcheng Xiao
Qiong Cao
Yujie Zhong
Xiang Zhang
Tao Wang
Canqun Yang
L. Lan
170
3
0
17 Oct 2024
Task Consistent Prototype Learning for Incremental Few-shot Semantic
  Segmentation
Task Consistent Prototype Learning for Incremental Few-shot Semantic SegmentationInternational Conference on Pattern Recognition (ICPR), 2024
Wenbo Xu
Yanan Wu
Haoran Jiang
Yang Wang
Qiang Wu
Jian Zhang
CLLVLM
174
1
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with
  Heterogeneous Agent Collaboration
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent CollaborationNeural Information Processing Systems (NeurIPS), 2024
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLMCLIP
357
5
0
16 Oct 2024
Order-aware Interactive Segmentation
Order-aware Interactive SegmentationInternational Conference on Learning Representations (ICLR), 2024
Bin Wang
Anwesa Choudhuri
Meng Zheng
Zhongpai Gao
Benjamin Planche
Andong Deng
Qin Liu
Terrence Chen
Ulas Bagci
Ziyan Wu
VLM
892
2
0
16 Oct 2024
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
Dongjun Hwang
Yejin Kim
Junsuk Choe
Seong Joon Oh
Junsuk Choe
VLM
664
0
0
15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model
  Discovery of Turbulence Closure
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
Yu Zhang
Kefeng Zheng
Fei Liu
Qingfu Zhang
Zhenkun Wang
203
8
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
207
1
0
14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware Control
MagicEraser: Erasing Any Objects via Semantics-Aware ControlEuropean Conference on Computer Vision (ECCV), 2024
Fan Li
Zixiao Zhang
Yi Huang
Jianzhuang Liu
Renjing Pei
Bin Shao
Songcen Xu
DiffM
186
12
0
14 Oct 2024
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for
  Resource-Efficient 3D MRI Segmentation
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Zhiyun Song
Yue Zhao
Xiaomin Li
Manman Fei
Xiangyu Zhao
...
Chung-Hsing Yeh
Qian Wang
Guoyan Zheng
Songtao Ai
Lichi Zhang
252
2
0
14 Oct 2024
Previous
123...101112...323334
Next