Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.14651
Cited By
You Only Segment Once: Towards Real-Time Panoptic Segmentation
26 March 2023
Jie Hu
Linyan Huang
Tianhe Ren
Shengchuan Zhang
Rongrong Ji
Liujuan Cao
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"You Only Segment Once: Towards Real-Time Panoptic Segmentation"
38 / 38 papers shown
Title
Your ViT is Secretly an Image Segmentation Model
Tommie Kerssies
Niccolò Cavagnero
Alexander Hermans
Narges Norouzi
Giuseppe Averta
Bastian Leibe
Gijs Dubbelman
Daan de Geus
ViT
VLM
59
1
0
24 Mar 2025
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding
Ali Tourani
Saad Ejaz
Hriday Bavle
David Morilla-Cabello
J. López
Holger Voos
76
2
0
03 Mar 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
Jiawen Zhu
Huayi Tang
Xin Chen
Xinying Wang
Dong Wang
Huchuan Lu
42
1
0
01 Mar 2025
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation
Hongwei Niu
Linhuang Xie
Jianghang Lin
Shengchuan Zhang
67
0
0
16 Dec 2024
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
107
3
0
22 Nov 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
37
0
0
19 Sep 2024
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data
Ali Tourani
Saad Ejaz
Hriday Bavle
Jose Luis Sanchez-Lopez
Holger Voos
3DPC
15
0
0
10 Sep 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming Yang
Shuicheng Yan
Chen Change Loy
VLM
32
10
0
27 Jun 2024
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images
Rushikesh Zawar
Shaurya Dewan
Andrew F. Luo
Margaret M. Henderson
Michael J. Tarr
Leila Wehbe
VGen
CoGe
31
1
0
19 Jun 2024
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Yansong Qu
Shaohui Dai
Xinyang Li
Jianghang Lin
Liujuan Cao
Shengchuan Zhang
Rongrong Ji
30
19
0
27 May 2024
3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Xuri Ge
Songpei Xu
Fuhai Chen
Jie Wang
Guoxin Wang
Shan An
Joemon M. Jose
3DPC
20
10
0
26 Apr 2024
Efficient Transformer Encoders for Mask2Former-style models
Manyi Yao
Abhishek Aich
Yumin Suh
Amit Roy-Chowdhury
Christian Shelton
Manmohan Chandraker
36
0
0
23 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
51
0
0
23 Apr 2024
IrrNet: Advancing Irrigation Mapping with Incremental Patch Size Training on Remote Sensing Imagery
Oishee Bintey Hoque
S. Swarup
Abhijin Adiga
S. Nouwakpo
M. Marathe
25
1
0
17 Apr 2024
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation
Gabriele Rosi
Claudia Cuttano
Niccolò Cavagnero
Giuseppe Averta
Fabio Cermelli
SSeg
49
1
0
15 Apr 2024
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation
Elham Amin Mansour
Ozan Unal
Suman Saha
Benjamin Bejar
Luc Van Gool
32
1
0
04 Apr 2024
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
Niccolò Cavagnero
Gabriele Rosi
Claudia Cuttano
Francesca Pistilli
Marco Ciccone
Giuseppe Averta
Fabio Cermelli
30
9
0
29 Feb 2024
Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks
Tianhe Ren
Shilong Liu
Ailing Zeng
Jing Lin
Kunchang Li
...
Feng Li
Jie-jin Yang
Hongyang Li
Qing Jiang
Lei Zhang
VLM
35
358
0
25 Jan 2024
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
Chong Zhou
Xiangtai Li
Chen Change Loy
Bo Dai
VLM
22
44
0
11 Dec 2023
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications
Junyi Ma
Xieyuanli Chen
Jiawei Huang
Jingyi Xu
Zhen Luo
Jintao Xu
Weihao Gu
Rui Ai
Hesheng Wang
20
22
0
29 Nov 2023
Towards Real Time Egocentric Segment Captioning for The Blind and Visually Impaired in RGB-D Theatre Images
Khadidja Delloul
S. Larabi
10
2
0
26 Aug 2023
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
Wenqi Shao
Mengzhao Chen
Zhaoyang Zhang
Peng-Tao Xu
Lirui Zhao
Zhiqiang Li
Kaipeng Zhang
Peng Gao
Yu Qiao
Ping Luo
MQ
10
173
0
25 Aug 2023
Pseudo-label Alignment for Semi-supervised Instance Segmentation
Jie Hu
Cheng Chen
Liujuan Cao
Shengchuan Zhang
Annan Shu
Guannan Jiang
Rongrong Ji
ISeg
23
12
0
10 Aug 2023
Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Haowei Wang
Jiji Tang
Jiayi Ji
Xiaoshuai Sun
Rongsheng Zhang
...
Minda Zhao
Lincheng Li
zeng zhao
Tangjie Lv
R. Ji
3DV
21
13
0
06 Aug 2023
Improving Human-Object Interaction Detection via Virtual Image Learning
Shuman Fang
Shuai Liu
Jie Li
Guannan Jiang
Xianming Lin
R. Ji
VLM
17
5
0
04 Aug 2023
ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation
Shuyang Sun
Weijun Wang
Qihang Yu
Andrew G. Howard
Philip H. S. Torr
Liang-Chieh Chen
19
15
0
29 Jun 2023
detrex: Benchmarking Detection Transformers
Tianhe Ren
Siyi Liu
Feng Li
Hao Zhang
Ailing Zeng
...
Zhaoyang Zeng
Xianbiao Qi
Yuhui Yuan
Jianwei Yang
Lei Zhang
17
13
0
12 Jun 2023
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
Mengzhao Chen
Wenqi Shao
Peng Xu
Mingbao Lin
Kaipeng Zhang
Fei Chao
Rongrong Ji
Yu Qiao
Ping Luo
ViT
32
42
0
29 May 2023
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
17
11
0
25 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
32
112
0
19 Apr 2023
InterFormer: Real-time Interactive Image Segmentation
YouFu Huang
Hao Yang
Ke Sun
Shengchuan Zhang
Liujuan Cao
Guannan Jiang
Rongrong Ji
27
22
0
06 Apr 2023
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Yiwei Ma
Xiaioqing Zhang
Xiaoshuai Sun
Jiayi Ji
Haowei Wang
Guannan Jiang
Weilin Zhuang
R. Ji
15
39
0
28 Mar 2023
SMMix: Self-Motivated Image Mixing for Vision Transformers
Mengzhao Chen
Mingbao Lin
Zhihang Lin
Yu-xin Zhang
Fei Chao
Rongrong Ji
31
10
0
26 Dec 2022
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Golnaz Ghiasi
Yin Cui
A. Srinivas
Rui Qian
Tsung-Yi Lin
E. D. Cubuk
Quoc V. Le
Barret Zoph
ISeg
223
962
0
13 Dec 2020
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
249
1,982
0
28 Jul 2020
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,214
0
17 Apr 2017
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
210
2,034
0
07 Jun 2016
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Vijay Badrinarayanan
Alex Kendall
R. Cipolla
SSeg
420
15,438
0
02 Nov 2015
1