ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
A. Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXivPDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,359 papers shown
Title
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
50
0
0
14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware Control
MagicEraser: Erasing Any Objects via Semantics-Aware Control
Fan Li
Zixiao Zhang
Yi Huang
Jianzhuang Liu
Renjing Pei
Bin Shao
Songcen Xu
DiffM
38
6
0
14 Oct 2024
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for
  Resource-Efficient 3D MRI Segmentation
REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Zhiyun Song
Y. Zhao
Xiaomin Li
Manman Fei
Xiangyu Zhao
...
Chung-Hsing Yeh
Qian Wang
Guoyan Zheng
Songtao Ai
Lichi Zhang
27
1
0
14 Oct 2024
When Attention Sink Emerges in Language Models: An Empirical View
When Attention Sink Emerges in Language Models: An Empirical View
Xiangming Gu
Tianyu Pang
Chao Du
Qian Liu
Fengzhuo Zhang
Cunxiao Du
Ye Wang
Min-Bin Lin
RALM
29
0
0
14 Oct 2024
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
46
5
0
14 Oct 2024
AFlow: Automating Agentic Workflow Generation
AFlow: Automating Agentic Workflow Generation
Jiayi Zhang
Jinyu Xiang
Zhaoyang Yu
Fengwei Teng
Xionghui Chen
...
Jinlin Wang
Bingnan Zheng
Bang Liu
Yuyu Luo
Chenglin Wu
AIFin
AI4CE
23
30
0
14 Oct 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All
  Image Segmentation
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
Ye Sun
Hao Zhang
Tiehua Zhang
Xingjun Ma
Yu-Gang Jiang
VLM
32
3
0
13 Oct 2024
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
Siyi Jiao
Wenzheng Zeng
Changxin Gao
Nong Sang
28
1
0
13 Oct 2024
Context-Aware Full Body Anonymization using Text-to-Image Diffusion
  Models
Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models
Pascal Zwick
Kevin Roesch
Marvin Klemp
Oliver Bringmann
DiffM
17
1
0
11 Oct 2024
UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction
  from Underwater Multi-view Monocular Images
UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images
Zeyu Chen
Jingyi Tang
Gu Wang
Shengquan Li
Xinghui Li
Xiangyang Ji
Xiu Li
23
0
0
10 Oct 2024
Shift and matching queries for video semantic segmentation
Shift and matching queries for video semantic segmentation
Tsubasa Mizuno
Toru Tamaki
25
0
0
10 Oct 2024
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for
  Efficient Banana Plantation Segmentation in UAV Imagery
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
Ang He
Ximei Wu
Xing Xu
Jing Chen
Xiaobin Guo
Sheng Xu
15
0
0
09 Oct 2024
OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and
  Open-Vocabulary Semantic Scene Graphs
OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and Open-Vocabulary Semantic Scene Graphs
Venkata Naren Devarakonda
Raktim Gautam Goswami
Ali Umut Kaypak
Naman Patel
Rooholla Khorrambakht
P. Krishnamurthy
Farshad Khorrami
LM&Ro
30
3
0
08 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
A Simple Image Segmentation Framework via In-Context Examples
Yang Liu
Chenchen Jing
Hengtao Li
Muzhi Zhu
Hao Chen
Xinlong Wang
Chunhua Shen
33
6
0
07 Oct 2024
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for
  3D Scene Understanding
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding
Shenghao Li
27
1
0
06 Oct 2024
ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and
  Object Tracking in Dynamic Construction Environments
ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments
Lorenzo Terenzi
Julian Nubert
Pol Eyschen
Pascal Roth
Simin Fei
E. Jelavic
Marco Hutter
26
0
0
05 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
ControlAR: Controllable Image Generation with Autoregressive Models
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
132
14
0
03 Oct 2024
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving
Zhangshuo Qi
Junyi Ma
Jingyi Xu
Zijie Zhou
Luqi Cheng
Guangming Xiong
32
3
0
01 Oct 2024
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation
Boyu Han
Qianqian Xu
Zhiyong Yang
Shilong Bao
Peisong Wen
Yangbangyan Jiang
Qingming Huang
26
2
0
30 Sep 2024
Match Stereo Videos via Bidirectional Alignment
Match Stereo Videos via Bidirectional Alignment
Junpeng Jing
Ye Mao
Anlan Qiu
K. Mikolajczyk
VGen
24
2
0
30 Sep 2024
Segmenting Wood Rot using Computer Vision Models
Segmenting Wood Rot using Computer Vision Models
Roland Kammerbauer
Thomas H. Schmitt
Tobias Bocklet
21
1
0
30 Sep 2024
Universal Medical Image Representation Learning with Compositional
  Decoders
Universal Medical Image Representation Learning with Compositional Decoders
Kaini Wang
Ling Yang
Siping Zhou
Guangquan Zhou
Wentao Zhang
Bin Cui
Shuo Li
SSL
MedIm
31
0
0
30 Sep 2024
Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation
Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation
Raphael Hagmanns
Peter Mortimer
Miguel Granero
T. Luettel
J. Petereit
21
0
0
27 Sep 2024
EfficientCrackNet: A Lightweight Model for Crack Segmentation
EfficientCrackNet: A Lightweight Model for Crack Segmentation
Abid Hasan Zim
Aquib Iqbal
Zaid Al-Huda
Asad U. Malik
Minoru Kuribayash
21
1
0
26 Sep 2024
AgMTR: Agent Mining Transformer for Few-shot Segmentation in Remote
  Sensing
AgMTR: Agent Mining Transformer for Few-shot Segmentation in Remote Sensing
Hanbo Bi
Yingchao Feng
Yongqiang Mao
Jianning Pei
Wenhui Diao
Hongqi Wang
Xian Sun
ViT
21
4
0
26 Sep 2024
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
Liangyu Zhong
Joachim Sicking
Fabian Hüger
Hanno Gottschalk
VLM
28
0
0
25 Sep 2024
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating
  Robustness of Vision Foundation Models for Semantic Segmentation
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
59
2
0
25 Sep 2024
EventHDR: from Event to High-Speed HDR Videos and Beyond
EventHDR: from Event to High-Speed HDR Videos and Beyond
Yunhao Zou
Ying Fu
Tsuyoshi Takatani
Yinqiang Zheng
39
4
0
25 Sep 2024
Adapting Segment Anything Model for Unseen Object Instance Segmentation
Adapting Segment Anything Model for Unseen Object Instance Segmentation
Rui Cao
Chuanxin Song
Biqi Yang
Jiangliu Wang
Pheng-Ann Heng
Yun-Hui Liu
VLM
22
1
0
23 Sep 2024
The BRAVO Semantic Segmentation Challenge Results in UNCV2024
The BRAVO Semantic Segmentation Challenge Results in UNCV2024
Tuan-Hung Vu
Eduardo Valle
Andrei Bursuc
Tommie Kerssies
Daan de Geus
...
Michael J. Smith
F. Ferrie
Shamik Basu
Christos Sakaridis
Luc Van Gool
UQCV
VLM
28
3
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
52
10
0
23 Sep 2024
Combining Absolute and Semi-Generalized Relative Poses for Visual
  Localization
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization
Vojtech Panek
Torsten Sattler
Zuzana Kukelova
29
0
0
21 Sep 2024
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive
  Technology
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology
Xin Jiang
Junwei Zheng
Ruiping Liu
Jiahang Li
Jiaming Zhang
Sven Matthiesen
Rainer Stiefelhagen
VLM
21
0
0
21 Sep 2024
MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors
MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors
Zhenhua Du
Binbin Xu
Haoyu Zhang
K. Huo
Shuaifeng Zhi
18
0
0
21 Sep 2024
A Bottom-Up Approach to Class-Agnostic Image Segmentation
A Bottom-Up Approach to Class-Agnostic Image Segmentation
Sebastian Dille
Ari Blondal
Sylvain Paris
Yağız Aksoy
11
0
0
20 Sep 2024
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With A Affine Transformation Contrastive Learning
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With A Affine Transformation Contrastive Learning
Wenhui Diao
Haichen Yu
Kaiyue Kang
Tong Ling
Di Liu
...
Hanbo Bi
Libo Ren
Xuexue Li
Yongqiang Mao
Xian Sun
29
1
0
20 Sep 2024
Towards Robust Automation of Surgical Systems via Digital Twin-based
  Scene Representations from Foundation Models
Towards Robust Automation of Surgical Systems via Digital Twin-based Scene Representations from Foundation Models
Hao Ding
Lalithkumar Seenivasan
Hongchao Shu
Grayson Byrd
Han Zhang
Pu Xiao
Juan Antonio Barragan
Russell H. Taylor
Peter Kazanzides
Mathias Unberath
32
5
0
19 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
83
0
0
19 Sep 2024
Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic
  Segmentation in Autonomous Driving
Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving
Yuting Hong
Hui Xiao
Huazheng Hao
Xiaojie Qiu
Baochen Yao
Chengbin Peng
29
0
0
19 Sep 2024
How to predict on-road air pollution based on street view images and
  machine learning: a quantitative analysis of the optimal strategy
How to predict on-road air pollution based on street view images and machine learning: a quantitative analysis of the optimal strategy
Hui Zhong
Di Chen
Pengqin Wang
Wenrui Wang
Shaojie Shen
Yonghong Liu
Meixin Zhu
25
0
0
19 Sep 2024
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving
Sándor Kunsági-Máté
Levente Peto
Lehel Seres
Tamás Matuszka
3DPC
17
2
0
19 Sep 2024
Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments
Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments
Gang Chen
Zhaoying Wang
Wei Dong
Javier Alonso-Mora
86
0
0
18 Sep 2024
Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to
  Fine-tune Vision Foundation Models
Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to Fine-tune Vision Foundation Models
Pengzhou Cai
Xueyuan Zhang
Libin Lan
Ze Zhao
23
0
0
18 Sep 2024
Robot Manipulation in Salient Vision through Referring Image
  Segmentation and Geometric Constraints
Robot Manipulation in Salient Vision through Referring Image Segmentation and Geometric Constraints
Chen Jiang
Allie Luo
Martin Jägersand
15
0
0
17 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road
  Topology Problem
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
32
2
0
17 Sep 2024
Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot
  Segmentation
Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation
Hanbo Bi
Yingchao Feng
Wenhui Diao
Peijin Wang
Yongqiang Mao
Kun Fu
Hongqi Wang
Xian Sun
VLM
32
3
0
16 Sep 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
31
0
0
15 Sep 2024
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation
Hugo Porta
Emanuele Dalsasso
Diego Marcos
D. Tuia
93
0
0
14 Sep 2024
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Wenhao Xu
Changwei Wang
Xuxiang Feng
Rongtao Xu
Longzhao Huang
Zherui Zhang
Li Guo
Shibiao Xu
VLM
34
2
0
13 Sep 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic
  Narrative Grounding
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Hongyu Li
Tianrui Hui
Zihan Ding
Jing Zhang
Bin Ma
Xiaoming Wei
Jizhong Han
Si Liu
DiffM
40
1
0
12 Sep 2024
Previous
123...567...262728
Next