ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,359 papers shown
Title
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation
Daniel A. P. Oliveira
D. Matos
VGen
12
0
0
15 May 2025
MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment
MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of an Urban Environment
Barak Pinkovich
Boaz Matalon
Ehud Rivlin
H. Rotstein
16
0
0
13 May 2025
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
39
0
0
13 May 2025
Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding
Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding
Chih-Chung Hsu
I-Hsuan Wu
Wen-Hai Tseng
Ching-Heng Cheng
Ming-Hsuan Wu
Jin-Hui Jiang
Yu-Jou Hsiao
18
0
0
11 May 2025
UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration
UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration
Chunming He
Rihan Zhang
Fengyang Xiao
Chengyu Fang
Longxiang Tang
Y. Zhang
Sina Farsiu
21
0
0
10 May 2025
Joint Super-Resolution and Segmentation for 1-m Impervious Surface Area Mapping in China's Yangtze River Economic Belt
Joint Super-Resolution and Segmentation for 1-m Impervious Surface Area Mapping in China's Yangtze River Economic Belt
Jie Deng
Danfeng Hong
Chenyu Li
Naoto Yokoya
38
0
0
08 May 2025
Visual Affordances: Enabling Robots to Understand Object Functionality
Visual Affordances: Enabling Robots to Understand Object Functionality
Tommaso Apicella
Alessio Xompero
Andrea Cavallaro
39
0
0
08 May 2025
Split Matching for Inductive Zero-shot Semantic Segmentation
Split Matching for Inductive Zero-shot Semantic Segmentation
Jialei Chen
Xu Zheng
Dongyue Li
Chong Yi
Seigo Ito
D. Paudel
Luc Van Gool
Hiroshi Murase
Daisuke Deguchi
VLM
50
0
0
08 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Y. Chen
Zhuotao Tian
VLM
38
0
0
07 May 2025
Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?
Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?
Shashank Agnihotri
David Schader
Nico Sharei
Mehmet Ege Kaçar
M. Keuper
36
1
0
07 May 2025
Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle
Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle
Petr Jahoda
Jan Cech
31
0
0
07 May 2025
3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
Andrew Caunes
T. Chateau
Vincent Frémont
3DPC
26
0
0
06 May 2025
Panoramic Out-of-Distribution Segmentation
Panoramic Out-of-Distribution Segmentation
Mengfei Duan
Kailun Yang
Y. Zhang
Yihong Cao
Fei Teng
Kai Luo
Jiaming Zhang
Zhiyong Li
Shutao Li
52
0
0
06 May 2025
Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models
Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models
Yankai Jiang
Peng Zhang
D. Yang
Yuan Tian
Hai Lin
X. Wang
MedIm
78
0
0
05 May 2025
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR Images
Siddharth Kothari
Srinivasan Murali
Sankalp Kothari
Ujjwal Verma
Jaya Sreevalsan-Nair
42
0
0
03 May 2025
VSC: Visual Search Compositional Text-to-Image Diffusion Model
VSC: Visual Search Compositional Text-to-Image Diffusion Model
Do Huu Dat
Nam Hyeonu
Po Yuan Mao
Tae-Hyun Oh
DiffM
CoGe
57
0
0
02 May 2025
Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing
Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing
Fahong Zhang
Yilei Shi
Xiao Xiang Zhu
38
0
0
02 May 2025
Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection
Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection
Daniel Bogdoll
Rajanikant Ananta
Abeyankar Giridharan
Isabel Moore
Gregory Stevens
Henry X. Liu
VLM
51
0
0
30 Apr 2025
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
BARIS: Boundary-Aware Refinement with Environmental Degradation Priors for Robust Underwater Instance Segmentation
Pin-Chi Pan
Soo-Chang Pei
54
0
0
28 Apr 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping
PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping
Feng Chen
Ilias Stogiannidis
Andrew Wood
Danilo Bueno
Dominic Williams
...
Stephen A. Rolfe
Tracy Lawson
Tony Pridmore
M. Giuffrida
Sotirios A. Tsaftaris
62
0
0
28 Apr 2025
Open-set Anomaly Segmentation in Complex Scenarios
Open-set Anomaly Segmentation in Complex Scenarios
Song Xia
Yi Yu
Henghui Ding
Wenhan Yang
S. Liu
Alex C. Kot
Xudong Jiang
DiffM
50
0
0
28 Apr 2025
Learning Streaming Video Representation via Multitask Training
Learning Streaming Video Representation via Multitask Training
Yibin Yan
Jilan Xu
Shangzhe Di
Yikun Liu
Yudi Shi
Qirui Chen
Zeqian Li
Yifei Huang
Weidi Xie
CLL
82
0
0
28 Apr 2025
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis
Alexander Baumann
Leonardo Ayala
S.
Jan Sellner
Alexander Studier-Fischer
Berkin Özdemir
Lena Maier-Hein
Slobodan Ilic
51
0
0
27 Apr 2025
What is the Added Value of UDA in the VFM Era?
What is the Added Value of UDA in the VFM Era?
B. B. Englert
Tommie Kerssies
Gijs Dubbelman
37
0
0
25 Apr 2025
DreamO: A Unified Framework for Image Customization
DreamO: A Unified Framework for Image Customization
Chong Mou
Yanze Wu
Wenxu Wu
Zinan Guo
Pengze Zhang
...
Shaojin Wu
S. Zhao
Jian Andrew Zhang
Qian He
Xinglong Wu
44
0
0
23 Apr 2025
Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks
Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks
Murat Bilgehan Ertan
Ronak Sahu
Phuong Ha Nguyen
Kaleel Mahmood
Marten van Dijk
27
0
0
23 Apr 2025
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art
Jing Zhang
Dan Guo
Zhangbin Li
Meng Wang
31
0
0
20 Apr 2025
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang
Anpei Chen
Volodymyr Havrylov
Andreas Geiger
Dan Zhang
27
1
0
18 Apr 2025
Fighting Fires from Space: Leveraging Vision Transformers for Enhanced Wildfire Detection and Characterization
Fighting Fires from Space: Leveraging Vision Transformers for Enhanced Wildfire Detection and Characterization
Aman Agarwal
James Gearon
Raksha Rank
Etienne Chenevert
26
0
0
18 Apr 2025
Occlusion-Ordered Semantic Instance Segmentation
Occlusion-Ordered Semantic Instance Segmentation
Soroosh Baselizadeh
Cheuk-To Yu
O. Veksler
Yuri Boykov
ISeg
3DV
54
0
0
18 Apr 2025
Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing
Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing
Mehmet Yamaç
Muhammad Numan Yousaf
S. Kiranyaz
M. Gabbouj
24
1
0
17 Apr 2025
Towards Learning to Complete Anything in Lidar
Towards Learning to Complete Anything in Lidar
Ayca Takmaz
Cristiano Saltori
Neehar Peri
Tim Meinhardt
Riccardo de Lutio
Laura Leal-Taixé
Aljosa Osep
3DV
VLM
46
0
0
16 Apr 2025
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
J. Xu
Y. Huang
Baoqi Pei
Junlin Hou
Qingqiu Li
Guo Chen
Y. Zhang
Rui Feng
Weidi Xie
DiffM
46
0
0
16 Apr 2025
A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning
A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning
M. D. Wang
Hanbo Bi
Yingchao Feng
Linlin Xin
Shuo Gong
Tianqi Wang
Zhiyuan Yan
Peijin Wang
Wenhui Diao
Xian Sun
29
0
0
16 Apr 2025
A comprehensive review of remote sensing in wetland classification and mapping
A comprehensive review of remote sensing in wetland classification and mapping
Shuai Yuan
Xiangan Liang
Tianwu Lin
Shuang Chen
Rui Liu
Jie Wang
H. M. Zhang
Peng Gong
29
0
0
15 Apr 2025
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
Yasser Benigmim
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Raoul de Charette
VLM
32
0
0
14 Apr 2025
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Tao Zhang
X. Li
Zilong Huang
Y. Li
Weixian Lei
XueQing Deng
Shihao Chen
S. Ji
Jiashi Feng
MLLM
LRM
56
1
0
14 Apr 2025
TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting
TextSplat: Text-Guided Semantic Fusion for Generalizable Gaussian Splatting
Zhicong Wu
Hongbin Xu
Gang Xu
Ping Nie
Zhixin Yan
Jinkai Zheng
Liangqiong Qu
Ming Li
Liqiang Nie
3DGS
29
0
0
13 Apr 2025
Uncertainty Guided Refinement for Fine-Grained Salient Object Detection
Uncertainty Guided Refinement for Fine-Grained Salient Object Detection
Yao Yuan
Pan Gao
Qun Dai
Jie Qin
Wei Xiang
30
0
0
13 Apr 2025
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
Kaiyu Li
Zepeng Xin
Li Pang
Chao Pang
Yupeng Deng
Jing Yao
Guisong Xia
Deyu Meng
Zhi Wang
Xiangyong Cao
VLM
LRM
37
0
0
13 Apr 2025
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges
Joshua Fixelle
ViT
27
0
0
11 Apr 2025
FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents
FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents
Xin Tan
Yuzhou Ji
He Zhu
Yuan Xie
3DGS
34
0
0
11 Apr 2025
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena
Tommaso Apicella
Stefano Rosa
Pietro Morerio
Alessio Del Bue
Lorenzo Natale
32
0
0
11 Apr 2025
ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings
ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings
Astitva Srivastava
Harrison Jesse Smith
Thu Nguyen-Phuoc
Yuting Ye
31
0
0
10 Apr 2025
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction
Xu Zhao
Pengju Zhang
Bo Liu
Yihong Wu
41
0
0
10 Apr 2025
GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes
GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes
S. Back
J. Lee
Kangmin Kim
Heeseon Rho
Geonhyup Lee
...
S. Lee
Sangjun Noh
Youngjin Lee
Taeyeop Lee
K. Lee
3DV
41
0
0
09 Apr 2025
Domain Generalization through Attenuation of Domain-Specific Information
Domain Generalization through Attenuation of Domain-Specific Information
Reiji Saito
Kazuhiro Hotta
24
0
0
09 Apr 2025
Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging
Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging
Siyuan Dai
Kai Ye
Guodong Liu
Haoteng Tang
Liang Zhan
MedIm
24
0
0
09 Apr 2025
Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation
Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation
Enming Zhang
Z. Li
Yanru Wu
J. Wang
Yang Tan
Ruizhe Zhao
Guan Wang
Yang Li
ViT
31
0
0
08 Apr 2025
1234...262728
Next