ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.01527
  4. Cited By
Masked-attention Mask Transformer for Universal Image Segmentation
v1v2v3 (latest)

Masked-attention Mask Transformer for Universal Image Segmentation

2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
    ISeg
ArXiv (abs)PDFHTML

Papers citing "Masked-attention Mask Transformer for Universal Image Segmentation"

50 / 1,661 papers shown
Title
Guided Distillation for Semi-Supervised Instance Segmentation
Guided Distillation for Semi-Supervised Instance SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Tariq Berrada
Camille Couprie
Alahari Karteek
Jakob Verbeek
198
19
0
03 Aug 2023
ReIDTrack: Multi-Object Track and Segmentation Without Motion
ReIDTrack: Multi-Object Track and Segmentation Without Motion
Kaer Huang
Bingchuan Sun
F. Chen
Tao Zhang
Jun Xie
Jian Li
Christopher W Twombly
Zhepeng Wang
VOT
210
3
0
03 Aug 2023
Dynamic Token Pruning in Plain Vision Transformers for Semantic
  Segmentation
Dynamic Token Pruning in Plain Vision Transformers for Semantic SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Quan Tang
Bowen Zhang
Jiajun Liu
Fagui Liu
Yifan Liu
ViT
288
52
0
02 Aug 2023
Synthetic Instance Segmentation from Semantic Image Segmentation Masks
Synthetic Instance Segmentation from Semantic Image Segmentation MasksKnowledge-Based Systems (KBS), 2023
Yuchen Shen
Dong Zhang
Yuhui Zheng
Zechao Li
L. Fu
Qiaolin Ye
ISeg
160
0
0
02 Aug 2023
LISA: Reasoning Segmentation via Large Language Model
LISA: Reasoning Segmentation via Large Language ModelComputer Vision and Pattern Recognition (CVPR), 2023
Xin Lai
Zhuotao Tian
Yukang Chen
Yanwei Li
Yuhui Yuan
Shu Liu
Jiaya Jia
LM&RoVLMMLLMLRM
449
704
0
01 Aug 2023
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning
  in End-to-End Autonomous Driving
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous DrivingIEEE International Conference on Computer Vision (ICCV), 2023
Xiaosong Jia
Yulu Gao
Li Chen
Junchi Yan
Patrick Langechuan Liu
Hongyang Li
260
113
0
01 Aug 2023
Partitioned Saliency Ranking with Dense Pyramid Transformers
Partitioned Saliency Ranking with Dense Pyramid TransformersACM Multimedia (ACM MM), 2023
Chengxiao Sun
Yan Xu
Jialun Pei
Haopeng Fang
He Tang
ViT
152
6
0
01 Aug 2023
Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics
Audio-Visual Segmentation by Exploring Cross-Modal Mutual SemanticsACM Multimedia (ACM MM), 2023
Chen Liu
Peike Li
Xingqun Qi
Hu Zhang
Lincheng Li
Dadong Wang
Xin Yu
VOS
211
43
0
31 Jul 2023
Transferable Attack for Semantic Segmentation
Transferable Attack for Semantic Segmentation
Mengqi He
Jing Zhang
Zhaoyuan Yang
Mingyi He
Nick Barnes
Yuchao Dai
193
2
0
31 Jul 2023
Towards Deeply Unified Depth-aware Panoptic Segmentation with
  Bi-directional Guidance Learning
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance LearningIEEE International Conference on Computer Vision (ICCV), 2023
Ju He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Jinpeng Lan
Bin Luo
Yifeng Geng
Xuansong Xie
MDE
214
11
0
27 Jul 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision
  Review
When Multi-Task Learning Meets Partial Supervision: A Computer Vision ReviewProceedings of the IEEE (Proc. IEEE), 2023
Maxime Fontana
Michael W. Spratling
Miaojing Shi
224
9
0
25 Jul 2023
Unmasking Anomalies in Road-Scene Segmentation
Unmasking Anomalies in Road-Scene SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Shyam Nandan Rai
Fabio Cermelli
Dario Fontanel
Carlo Masone
Barbara Caputo
ISeg
185
56
0
25 Jul 2023
CTVIS: Consistent Training for Online Video Instance Segmentation
CTVIS: Consistent Training for Online Video Instance SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Kaining Ying
Qing Zhong
Wei Mao
Zhenhua Wang
Hao Chen
Lin Yuanbo Wu
Yifan Liu
Chengxiang Fan
Yunzhi Zhuge
Chunhua Shen
307
61
0
24 Jul 2023
GEM: Boost Simple Network for Glass Surface Segmentation via Vision
  Foundation Models
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation ModelsIEEE transactions on multimedia (IEEE TMM), 2023
Jing Hao
Xinyu Li
Liang Gao
Shumin Han
VLMDiffM
246
4
0
22 Jul 2023
An Intelligent Remote Sensing Image Quality Inspection System
An Intelligent Remote Sensing Image Quality Inspection SystemIET Image Processing (IIP), 2023
Yi Yu
Tao Wang
Kang Ran
Changjiang Li
Hao Wu
183
1
0
22 Jul 2023
Cascade-DETR: Delving into High-Quality Universal Object Detection
Cascade-DETR: Delving into High-Quality Universal Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Mingqiao Ye
Lei Ke
Siyuan Li
Yu-Wing Tai
Chi-Keung Tang
Martin Danelljan
Feng Yu
255
51
0
20 Jul 2023
Gradient-Semantic Compensation for Incremental Semantic Segmentation
Gradient-Semantic Compensation for Incremental Semantic SegmentationIEEE transactions on multimedia (IEEE TMM), 2023
Wei Cong
Yang Cong
Jiahua Dong
Gan Sun
Henghui Ding
CLLVLM
293
21
0
20 Jul 2023
Impact of Disentanglement on Pruning Neural Networks
Impact of Disentanglement on Pruning Neural Networks
Carl Shneider
Peyman Rostami
Anis Kacem
Nilotpal Sinha
Abd El Rahman Shabayek
Djamila Aouada
165
0
0
19 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
RepViT: Revisiting Mobile CNN From ViT PerspectiveComputer Vision and Pattern Recognition (CVPR), 2023
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
358
418
0
18 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Chaoyang Zhu
Long Chen
ObjDVLM
459
64
0
18 Jul 2023
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation
Pair then Relation: Pair-Net for Panoptic Scene Graph GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jinghao Wang
Zhengyu Wen
Xiangtai Li
Zujin Guo
Jingkang Yang
Ziwei Liu
189
25
0
17 Jul 2023
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient
  Network
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient NetworkInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
K. Yan
Xiaoli Yin
Yingda Xia
Fakai Wang
Shu Wang
...
Xiaoyu Bai
Jingren Zhou
Ling Zhang
Le Lu
Yu Shi
MedIm
254
9
0
17 Jul 2023
Unified Open-Vocabulary Dense Visual Prediction
Unified Open-Vocabulary Dense Visual PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Hengcan Shi
Munawar Hayat
Jianfei Cai
ObjDVLM
171
53
0
17 Jul 2023
On Point Affiliation in Feature Upsampling
On Point Affiliation in Feature Upsampling
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
3DPC
122
2
0
17 Jul 2023
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance
  Segmentation
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance SegmentationIEEE Transactions on Image Processing (IEEE TIP), 2023
Jialun Pei
Tao Jiang
He Tang
Nian Liu
Yueming Jin
Deng-Ping Fan
Pheng-Ann Heng
ISeg
218
19
0
16 Jul 2023
Multi-Object Discovery by Low-Dimensional Object Motion
Multi-Object Discovery by Low-Dimensional Object MotionIEEE International Conference on Computer Vision (ICCV), 2023
Sadra Safadoust
Fatma Guney
OCL
248
15
0
16 Jul 2023
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation
Mennatullah Siam
R. Karim
Henghui Zhao
Richard P. Wildes
VOS
219
3
0
15 Jul 2023
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel
  Segmentation
RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel SegmentationNeural Information Processing Systems (NeurIPS), 2023
MD Wahiduzzaman Khan
Hong Sheng
Hu Zhang
Heming Du
Sen Wang
...
Jack Phu
A. Agar
Zichen Huang
M. Golzan
Xin Yu
141
9
0
13 Jul 2023
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks
  for Autonomous Driving on Water Surfaces
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces
Shanliang Yao
Runwei Guan
Zhaodong Wu
Yi Ni
Zile Huang
...
H. Seo
Ka Lok Man
Jieming Ma
Xiaohui Zhu
Yutao Yue
243
63
0
13 Jul 2023
Learning Hierarchical Interactive Multi-Object Search for Mobile
  Manipulation
Learning Hierarchical Interactive Multi-Object Search for Mobile ManipulationIEEE Robotics and Automation Letters (RA-L), 2023
F. Schmalstieg
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
385
25
0
12 Jul 2023
Machine Learning for Autonomous Vehicle's Trajectory Prediction: A
  comprehensive survey, Challenges, and Future Research Directions
Machine Learning for Autonomous Vehicle's Trajectory Prediction: A comprehensive survey, Challenges, and Future Research DirectionsVehicular Communications (Veh. Commun.), 2023
Vibha Bharilya
Neetesh Kumar
243
96
0
12 Jul 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Objaverse-XL: A Universe of 10M+ 3D ObjectsNeural Information Processing Systems (NeurIPS), 2023
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
248
626
0
11 Jul 2023
Test-Time Training on Video Streams
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTAViT3DGS
503
29
0
11 Jul 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Semantic-SAM: Segment and Recognize Anything at Any Granularity
Feng Li
Hao Zhang
Pei Sun
Xueyan Zou
Siyi Liu
Jianwei Yang
Chun-yue Li
Lei Zhang
Jianfeng Gao
VLM
229
220
0
10 Jul 2023
Cluster-Induced Mask Transformers for Effective Opportunistic Gastric
  Cancer Screening on Non-contrast CT Scans
Cluster-Induced Mask Transformers for Effective Opportunistic Gastric Cancer Screening on Non-contrast CT ScansInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Ming Yuan
Yingda Xia
Xin Chen
Jiawen Yao
Junling Wang
...
Bin Dong
Le Lu
Li Zhang
Zaiyi Liu
Ling Zhang
130
3
0
10 Jul 2023
TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task
  Foundation Model Learning
TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning
Z. Zhang
Xue Pan
64
0
0
07 Jul 2023
PSDR-Room: Single Photo to Scene using Differentiable Rendering
PSDR-Room: Single Photo to Scene using Differentiable RenderingACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Kai Yan
Fujun Luan
MiloŠ HaŠAn
Thibault Groueix
Valentin Deschaintre
Shuang Zhao
181
24
0
06 Jul 2023
Towards accurate instance segmentation in large-scale LiDAR point clouds
Towards accurate instance segmentation in large-scale LiDAR point cloudsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (ISPRS Annals), 2023
Binbin Xiang
T. Peters
Theodora Kontogianni
Frawa Vetterli
Stefano Puliti
R. Astrup
Konrad Schindler
3DPCISeg
209
21
0
06 Jul 2023
A Critical Look at the Current Usage of Foundation Model for Dense
  Recognition Task
A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task
Shiqi Yang
Atsushi Hashimoto
Yoshitaka Ushiku
DiffMVLM
238
1
0
06 Jul 2023
MaskBEV: Joint Object Detection and Footprint Completion for Bird's-eye
  View 3D Point Clouds
MaskBEV: Joint Object Detection and Footprint Completion for Bird's-eye View 3D Point CloudsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
William Guimont-Martin
Jean-Michel Fortin
François Pomerleau
Philippe Giguère
3DPC
184
2
0
04 Jul 2023
ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour
ChildPlay: A New Benchmark for Understanding Children's Gaze BehaviourIEEE International Conference on Computer Vision (ICCV), 2023
Samy Tafasca
Anshul Gupta
J. Odobez
153
32
0
04 Jul 2023
EffSeg: Efficient Fine-Grained Instance Segmentation using
  Structure-Preserving Sparsity
EffSeg: Efficient Fine-Grained Instance Segmentation using Structure-Preserving Sparsity
Cédric Picron
Tinne Tuytelaars
ISeg
165
0
0
04 Jul 2023
AVSegFormer: Audio-Visual Segmentation with Transformer
AVSegFormer: Audio-Visual Segmentation with TransformerAAAI Conference on Artificial Intelligence (AAAI), 2023
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
VOS
320
78
0
03 Jul 2023
Hierarchical Open-vocabulary Universal Image Segmentation
Hierarchical Open-vocabulary Universal Image SegmentationNeural Information Processing Systems (NeurIPS), 2023
Xudong Wang
Shufang Li
Konstantinos Kallidromitis
Yu Kato
Kazuki Kozuka
Trevor Darrell
VLMOCL
248
60
0
03 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for
  Referring Video Object Segmentation
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object SegmentationPattern Recognition (Pattern Recogn.), 2023
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
Guang Dai
VOS
327
11
0
02 Jul 2023
Learning Content-enhanced Mask Transformer for Domain Generalized
  Urban-Scene Segmentation
Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023
Qi Bi
Shaodi You
Theo Gevers
ViT
569
67
0
01 Jul 2023
ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation
ReMaX: Relaxing for Better Training on Efficient Panoptic SegmentationNeural Information Processing Systems (NeurIPS), 2023
Shuyang Sun
Weijun Wang
Qihang Yu
Andrew G. Howard
Juil Sock
Liang-Chieh Chen
231
19
0
29 Jun 2023
RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation
  based on Visual Foundation Model
RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation ModelIEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Keyan Chen
Chenyang Liu
Hao Chen
Haotian Zhang
Wenyuan Li
Zhengxia Zou
Z. Shi
VLM
341
349
0
28 Jun 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjDVLM
366
213
0
28 Jun 2023
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
Symphonize 3D Semantic Scene Completion with Contextual Instance QueriesComputer Vision and Pattern Recognition (CVPR), 2023
Hao Jiang
Tianheng Cheng
Naiyu Gao
Haoyang Zhang
Tianwei Lin
Wenyu Liu
Xinggang Wang
226
92
0
27 Jun 2023
Previous
123...252627...323334
Next