ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.06220
  4. Cited By
OneFormer: One Transformer to Rule Universal Image Segmentation

OneFormer: One Transformer to Rule Universal Image Segmentation

10 November 2022
Jitesh Jain
Jiacheng Li
M. Chiu
Ali Hassani
Nikita Orlov
Humphrey Shi
    ViT
ArXivPDFHTML

Papers citing "OneFormer: One Transformer to Rule Universal Image Segmentation"

50 / 55 papers shown
Title
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis
Yun Chang
Leonor Fermoselle
Duy Ta
Bernadette Bucher
Luca Carlone
Jiuguang Wang
30
0
0
09 Apr 2025
Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
Zilong Huang
Jun-Jian He
Junyan Ye
Lihan Jiang
Weijia Li
Y. Chen
Ting Han
51
0
0
01 Apr 2025
Towards High-performance Spiking Transformers from ANN to SNN Conversion
Towards High-performance Spiking Transformers from ANN to SNN Conversion
Zihan Huang
Xinyu Shi
Zecheng Hao
Tong Bu
Jianhao Ding
Zhaofei Yu
Tiejun Huang
28
7
0
28 Feb 2025
Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications
Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications
Yu-Chieh Chao
Yubei Chen
Weiwei Wang
Achintha Wijesinghe
Suchinthaka Wanninayaka
Songyang Zhang
Zhi Ding
DiffM
63
0
0
25 Feb 2025
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Shuai Yang
Jing Tan
Mengchen Zhang
Tong Wu
Y. Li
Gordon Wetzstein
Ziwei Liu
D. Lin
MDE
VGen
51
6
0
24 Feb 2025
Generalized Class Discovery in Instance Segmentation
Generalized Class Discovery in Instance Segmentation
Cuong Manh Hoang
Yeejin Lee
Byeongkeun Kang
ISeg
87
0
0
12 Feb 2025
ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal
ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal
Xiujin Zhu
Chee-Onn Chow
Joon Huang Chuah
Mamba
40
0
0
05 Nov 2024
Integrated Image-Text Based on Semi-supervised Learning for Small Sample
  Instance Segmentation
Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation
Ruting Chi
Zhiyi Huang
Yuexing Han
ISeg
23
0
0
21 Oct 2024
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
46
5
0
14 Oct 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All
  Image Segmentation
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
Ye Sun
Hao Zhang
Tiehua Zhang
Xingjun Ma
Yu-Gang Jiang
VLM
32
3
0
13 Oct 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
52
10
0
23 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
66
0
0
19 Sep 2024
Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Huy-Dung Nguyen
Anass Bairouk
Mirjana Maras
Wei Xiao
Tsun-Hsuan Wang
Patrick Chareyre
Ramin Hasani
Marc Blanchon
Daniela Rus
39
1
0
16 Sep 2024
A Simple and Generalist Approach for Panoptic Segmentation
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
38
0
0
29 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Yu Qiao
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
62
48
0
05 Aug 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
38
0
0
15 Jul 2024
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation
Yuejiao Su
Yi Wang
Lap-Pui Chau
60
1
0
08 Jul 2024
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling
Minghao Zhou
Hong Wang
Yefeng Zheng
Deyu Meng
24
1
0
02 Jul 2024
GMT: Guided Mask Transformer for Leaf Instance Segmentation
GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen
Sotirios A. Tsaftaris
M. Giuffrida
13
1
0
24 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part
  Representations
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
27
0
0
14 Jun 2024
WonderWorld: Interactive 3D Scene Generation from a Single Image
WonderWorld: Interactive 3D Scene Generation from a Single Image
Hong-Xing Yu
Haoyi Duan
Charles Herrmann
William T. Freeman
Jiajun Wu
3DGS
VGen
41
37
0
13 Jun 2024
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset
  Challenge
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge
Nan Zhang
Xidan Zhang
Jianing Wei
Fangjun Wang
Zhiming Tan
MDE
22
0
0
06 Jun 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
31
1
0
06 Jun 2024
RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling
RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling
Tianhang Wang
Fan Lu
Zehan Zheng
Zhijun Li
Changjun Jiang
Changjun Jiang
53
2
0
27 May 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
56
0
0
23 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
59
4
0
04 Apr 2024
Continual Segmentation with Disentangled Objectness Learning and Class
  Recognition
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
25
5
0
06 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated
  Driving
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
20
2
0
23 Feb 2024
Subobject-level Image Tokenization
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLM
OCL
46
6
0
22 Feb 2024
You've Got to Feel It To Believe It: Multi-Modal Bayesian Inference for
  Semantic and Property Prediction
You've Got to Feel It To Believe It: Multi-Modal Bayesian Inference for Semantic and Property Prediction
Parker Ewen
Hao Chen
Yuzhen Chen
Anran Li
Anup Bagali
Gitesh Gunjal
Ram Vasudevan
19
5
0
08 Feb 2024
Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled
  Spatial Ontologies
Indoor and Outdoor 3D Scene Graph Generation via Language-Enabled Spatial Ontologies
Jared Strader
Nathan Hughes
William Chen
Alberto Speranzon
Luca Carlone
3DV
15
18
0
18 Dec 2023
CLiSA: A Hierarchical Hybrid Transformer Model using Orthogonal Cross
  Attention for Satellite Image Cloud Segmentation
CLiSA: A Hierarchical Hybrid Transformer Model using Orthogonal Cross Attention for Satellite Image Cloud Segmentation
Subhajit Paul
Ashutosh Gupta
11
2
0
29 Nov 2023
OpenForest: A data catalogue for machine learning in forest monitoring
OpenForest: A data catalogue for machine learning in forest monitoring
Arthur Ouaknine
T. Kattenborn
Etienne Laliberté
David Rolnick
41
5
0
01 Nov 2023
Pseudo-Generalized Dynamic View Synthesis from a Video
Pseudo-Generalized Dynamic View Synthesis from a Video
Xiaoming Zhao
Alex Colburn
Fangchang Ma
Miguel Angel Bautista
J. Susskind
A. Schwing
30
11
0
12 Oct 2023
A Vision-Centric Approach for Static Map Element Annotation
A Vision-Centric Approach for Static Map Element Annotation
Jiaxin Zhang
Shiyuan Chen
Haoran Yin
Ruohong Mei
Xuan Liu
Cong Yang
Qian Zhang
Wei Sui
3DV
24
3
0
21 Sep 2023
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene
  Parsing
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing
Jiahang Li
Yikang Zhang
Peng Yun
Guangliang Zhou
Qijun Chen
Rui Fan
ViT
OffRL
11
26
0
19 Sep 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
29
7
0
14 Jun 2023
Matte Anything: Interactive Natural Image Matting with Segment Anything
  Models
Matte Anything: Interactive Natural Image Matting with Segment Anything Models
J. Yao
Xinggang Wang
Lang Ye
Wenyu Liu
13
38
0
07 Jun 2023
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Xiuye Gu
Yin Cui
Jonathan Huang
Abdullah M. Rashwan
X. Yang
...
Golnaz Ghiasi
Weicheng Kuo
Huizhong Chen
Liang-Chieh Chen
David A. Ross
ISeg
24
26
0
02 Jun 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
22
14
0
29 May 2023
Interactive Segment Anything NeRF with Feature Imitation
Interactive Segment Anything NeRF with Feature Imitation
Xiaokang Chen
Jiaxiang Tang
Diwen Wan
Jingbo Wang
Gang Zeng
29
22
0
25 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
32
89
0
14 May 2023
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Eric Zhang
Kai Wang
Xingqian Xu
Zhangyang Wang
Humphrey Shi
DiffM
42
172
0
30 Mar 2023
DDP: Diffusion Model for Dense Visual Prediction
DDP: Diffusion Model for Dense Visual Prediction
Yuanfeng Ji
Zhe Chen
Enze Xie
Lanqing Hong
Xihui Liu
Zhaoqiang Liu
Tong Lu
Zhenguo Li
Ping Luo
DiffM
VLM
26
129
0
30 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
Graph Transformer GANs for Graph-Constrained House Generation
H. Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
29
19
0
14 Mar 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance
  Segmentation
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
27
17
0
03 Jan 2023
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
28
180
0
15 Nov 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng-Wei Zhang
Chao Zhang
Hanhua Hu
17
25
0
03 Oct 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object
  Detection and Segmentation
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
42
363
0
06 Jun 2022
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via
  Feature Distillation
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
80
124
0
27 May 2022
12
Next