Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2112.01527
Cited By
v1
v2
v3 (latest)
Masked-attention Mask Transformer for Universal Image Segmentation
2 December 2021
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked-attention Mask Transformer for Universal Image Segmentation"
50 / 1,661 papers shown
Title
AFlow: Automating Agentic Workflow Generation
International Conference on Learning Representations (ICLR), 2024
Jiayi Zhang
Jinyu Xiang
Quan Shi
Xinbing Liang
Xionghui Chen
...
Jinlin Wang
Bingnan Zheng
Bang Liu
Yuyu Luo
Chenglin Wu
AIFin
AI4CE
335
6
0
14 Oct 2024
When Attention Sink Emerges in Language Models: An Empirical View
International Conference on Learning Representations (ICLR), 2024
Xiangming Gu
Tianyu Pang
Chao Du
Qian Liu
Fengzhuo Zhang
Cunxiao Du
Ye Wang
Min Lin
RALM
317
0
0
14 Oct 2024
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
383
26
0
14 Oct 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Ye Sun
Hao Zhang
Tiehua Zhang
Xingjun Ma
Yu-Gang Jiang
VLM
196
8
0
13 Oct 2024
DFIMat: Decoupled Flexible Interactive Matting in Multi-Person Scenarios
Asian Conference on Computer Vision (ACCV), 2024
Siyi Jiao
Wenzheng Zeng
Changxin Gao
Nong Sang
148
3
0
13 Oct 2024
Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models
Pascal Zwick
Kevin Roesch
Marvin Klemp
Oliver Bringmann
DiffM
177
1
0
11 Oct 2024
UW-SDF: Exploiting Hybrid Geometric Priors for Neural SDF Reconstruction from Underwater Multi-view Monocular Images
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Zeyu Chen
Jingyi Tang
Gu Wang
Shengquan Li
Xinghui Li
Xiangyang Ji
Xiu Li
109
1
0
10 Oct 2024
Shift and matching queries for video semantic segmentation
Tsubasa Mizuno
Toru Tamaki
194
0
0
10 Oct 2024
Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery
Frontiers in Plant Science (Front. Plant Sci.), 2024
Ang He
Ximei Wu
Xing Xu
Jing Chen
Xiaobin Guo
Sheng Xu
147
5
0
09 Oct 2024
OrionNav: Online Planning for Robot Autonomy with Context-Aware LLM and Open-Vocabulary Semantic Scene Graphs
Venkata Naren Devarakonda
Raktim Gautam Goswami
Ali Umut Kaypak
Naman Patel
Rooholla Khorrambakht
Prashanth Krishnamurthy
Farshad Khorrami
LM&Ro
313
10
0
08 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
Neural Information Processing Systems (NeurIPS), 2024
Yang Liu
Chenchen Jing
Hengtao Li
Huanyi Zheng
Hao Chen
Xinlong Wang
Chunhua Shen
137
12
0
07 Oct 2024
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding
Shenghao Li
123
1
0
06 Oct 2024
ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments
Lorenzo Terenzi
Julian Nubert
Pol Eyschen
Pascal Roth
Simin Fei
E. Jelavic
Marco Hutter
133
1
0
05 Oct 2024
ControlAR: Controllable Image Generation with Autoregressive Models
International Conference on Learning Representations (ICLR), 2024
Zongming Li
Tianheng Cheng
Shoufa Chen
Peize Sun
Haocheng Shen
Longjin Ran
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
DiffM
607
36
0
03 Oct 2024
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving
Zhangshuo Qi
Junyi Ma
Jingyi Xu
Zijie Zhou
Luqi Cheng
Guangming Xiong
331
5
0
01 Oct 2024
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Boyu Han
Qianqian Xu
Zhiyong Yang
Shilong Bao
Peisong Wen
Yangbangyan Jiang
Qingming Huang
361
15
0
30 Sep 2024
Match Stereo Videos via Bidirectional Alignment
Junpeng Jing
Ye Mao
Anlan Qiu
K. Mikolajczyk
VGen
206
8
0
30 Sep 2024
Segmenting Wood Rot using Computer Vision Models
Jahrestagung der Gesellschaft für Informatik (GI Jahrestagung), 2024
Roland Kammerbauer
Thomas H. Schmitt
Tobias Bocklet
102
1
0
30 Sep 2024
Universal Medical Image Representation Learning with Compositional Decoders
Kaini Wang
Ling Yang
Siping Zhou
Guangquan Zhou
Wentao Zhang
Bin Cui
Shuo Li
SSL
MedIm
250
1
0
30 Sep 2024
Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Raphael Hagmanns
Peter Mortimer
Miguel Granero
T. Luettel
J. Petereit
295
7
0
27 Sep 2024
EfficientCrackNet: A Lightweight Model for Crack Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Abid Hasan Zim
Aquib Iqbal
Zaid Al-Huda
Asad U. Malik
Minoru Kuribayash
183
7
0
26 Sep 2024
AgMTR: Agent Mining Transformer for Few-shot Segmentation in Remote Sensing
International Journal of Computer Vision (IJCV), 2024
Hanbo Bi
Yingchao Feng
Yongqiang Mao
Jianning Pei
Wenhui Diao
Hongqi Wang
Xian Sun
ViT
205
14
0
26 Sep 2024
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
Liangyu Zhong
Joachim Sicking
Fabian Hüger
Hanno Gottschalk
VLM
212
0
0
25 Sep 2024
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation
Tommie Kerssies
Daan de Geus
Gijs Dubbelman
246
3
0
25 Sep 2024
EventHDR: from Event to High-Speed HDR Videos and Beyond
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yunhao Zou
Ying Fu
Tsuyoshi Takatani
Yinqiang Zheng
201
16
0
25 Sep 2024
Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks
The Visual Computer (VC), 2024
Roberto Alcover-Couso
Juan C. Sanmiguel
Marcos Escudero-Viñolo
Jose M. Martínez
FedML
MoMe
149
3
0
24 Sep 2024
Adapting Segment Anything Model for Unseen Object Instance Segmentation
Rui Cao
Chuanxin Song
Biqi Yang
Jiangliu Wang
Pheng-Ann Heng
Yun-Hui Liu
VLM
260
4
0
23 Sep 2024
The BRAVO Semantic Segmentation Challenge Results in UNCV2024
Tuan-Hung Vu
Eduardo Valle
Andrei Bursuc
Tommie Kerssies
Daan de Geus
...
Michael J. Smith
F. Ferrie
Shamik Basu
Daniel Gehrig
Luc Van Gool
UQCV
VLM
315
5
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
International Conference on Learning Representations (ICLR), 2024
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
497
24
0
23 Sep 2024
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization
Vojtech Panek
Torsten Sattler
Zuzana Kukelova
183
0
0
21 Sep 2024
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Xin Jiang
Junwei Zheng
Ruiping Liu
Jiahang Li
Jiaming Zhang
Sven Matthiesen
Rainer Stiefelhagen
VLM
151
2
0
21 Sep 2024
MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors
IEEE Robotics and Automation Letters (RA-L), 2024
Zhenhua Du
Binbin Xu
Haoyu Zhang
K. Huo
Shuaifeng Zhi
143
0
0
21 Sep 2024
A Bottom-Up Approach to Class-Agnostic Image Segmentation
Sebastian Dille
Ari Blondal
Sylvain Paris
Yağız Aksoy
135
0
0
20 Sep 2024
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With Affine Transformation Contrastive Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Wenhui Diao
Haichen Yu
Kaiyue Kang
Tong Ling
Di Liu
...
Hanbo Bi
Libo Ren
Xuexue Li
Yongqiang Mao
Xian Sun
835
8
0
20 Sep 2024
Towards Robust Automation of Surgical Systems via Digital Twin-based Scene Representations from Foundation Models
Hao Ding
Lalithkumar Seenivasan
Hongchao Shu
Grayson Byrd
Han Zhang
Pu Xiao
Juan Antonio Barragan
Russell H. Taylor
Peter Kazanzides
Mathias Unberath
177
10
0
19 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
878
0
0
19 Sep 2024
Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving
Yuting Hong
Hui Xiao
Huazheng Hao
Xiaojie Qiu
Baochen Yao
Chengbin Peng
175
0
0
19 Sep 2024
How to predict on-road air pollution based on street view images and machine learning: a quantitative analysis of the optimal strategy
Hui Zhong
Di Chen
Pengqin Wang
Wenrui Wang
Shaojie Shen
Yonghong Liu
Meixin Zhu
75
0
0
19 Sep 2024
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving
Sándor Kunsági-Máté
Levente Peto
Lehel Seres
Tamás Matuszka
3DPC
213
3
0
19 Sep 2024
Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments
IEEE Transactions on robotics (IEEE Trans. Robot.), 2024
Gang Chen
Zhaoying Wang
Wei Dong
Javier Alonso-Mora
505
4
0
18 Sep 2024
Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation using Rein to Fine-tune Vision Foundation Models
Pengzhou Cai
Xueyuan Zhang
Libin Lan
Ze Zhao
286
0
0
18 Sep 2024
Robot Manipulation in Salient Vision through Referring Image Segmentation and Geometric Constraints
IEEE International Conference on Robotics and Automation (ICRA), 2024
Chen Jiang
Allie Luo
Martin Jägersand
229
4
0
17 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
243
3
0
17 Sep 2024
MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping
Image and Vision Computing (IVC), 2024
Amirreza Fateh
Mohammad Reza Mohammadi
Mohammad Reza Jahed Motlagh
ViT
768
11
0
17 Sep 2024
Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Hanbo Bi
Yingchao Feng
Wenhui Diao
Peijin Wang
Yongqiang Mao
Kun Fu
Hongqi Wang
Xian Sun
VLM
164
16
0
16 Sep 2024
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation
Qilong Zhangli
Di Liu
Abhishek Aich
Dimitris Metaxas
S. Schulter
183
1
0
15 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
IEEE Access (IEEE Access), 2024
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen
3DV
219
2
0
14 Sep 2024
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Hugo Porta
Emanuele Dalsasso
Diego Marcos
D. Tuia
517
1
0
14 Sep 2024
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Wenhao Xu
Changwei Wang
Xuxiang Feng
Rongtao Xu
Longzhao Huang
Zherui Zhang
Li Guo
Shibiao Xu
VLM
299
7
0
13 Sep 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
ACM Multimedia (MM), 2024
Hongyu Li
Tianrui Hui
Zihan Ding
Jing Zhang
Bin Ma
Xiaoming Wei
Jizhong Han
Si Liu
DiffM
193
4
0
12 Sep 2024
Previous
1
2
3
...
11
12
13
...
32
33
34
Next