Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2104.10956
Cited By
v1
v2
v3 (latest)
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection
22 April 2021
Tai Wang
Xinge Zhu
Jiangmiao Pang
Dahua Lin
3DPC
Re-assign community
ArXiv (abs)
PDF
HTML
Github (5785★)
Papers citing
"FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection"
50 / 378 papers shown
Title
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
561
11
0
25 Nov 2024
Open Vocabulary Monocular 3D Object Detection
Jin Yao
Hao Gu
Xuweiyi Chen
Jiayun Wang
Zezhou Cheng
ObjD
VLM
398
9
0
25 Nov 2024
Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction
Miguel Antunes-García
L. Bergasa
Santiago Montiel-Marín
R. Barea
Fabio Sánchez-García
Ángel Llamazares
175
0
0
11 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation
International Conference on 3D Vision (3DV), 2024
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
240
3
0
09 Nov 2024
EMMA: End-to-End Multimodal Model for Autonomous Driving
Jyh-Jing Hwang
Runsheng Xu
Hubert Lin
Wei-Chih Hung
Jingwei Ji
...
Benjamin Sapp
Yin Zhou
James Guo
Dragomir Anguelov
Mingxing Tan
VLM
LM&Ro
336
104
0
30 Oct 2024
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Gyusam Chang
Jiwon Lee
Donghyun Kim
Jinkyu Kim
Dongwook Lee
Daehyun Ji
Sujin Jang
Sangpil Kim
330
8
0
29 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
371
0
0
17 Oct 2024
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
213
4
0
16 Oct 2024
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Detection
Jiwei Chen
Laiyan Ding
Chi Zhang
Feifei Li
197
1
0
14 Oct 2024
CAMOT: Camera Angle-aware Multi-Object Tracking
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Felix Limanta
Kuniaki Uto
Koichi Shinoda
VOT
350
11
0
26 Sep 2024
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework
Xiaoyu Li
Peidong Li
Lijun Zhao
Dedong Liu
Jinghan Gao
Xian Wu
Yitao Wu
Dixiao Cui
VOT
300
3
0
18 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
204
1
0
09 Sep 2024
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jinqing Zhang
Yanan Zhang
Yunlong Qi
Z. Fu
Qingjie Liu
Yunhong Wang
105
9
0
03 Sep 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
119
5
0
29 Aug 2024
Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System
International Conference on Industrial Informatics (INDIN), 2024
Georg Schäfer
Jakob Rehrl
Stefan Huber
Simon Hirlaender
346
3
0
28 Aug 2024
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments
Computer Vision and Pattern Recognition (CVPR), 2024
Haisheng Su
Feixiang Song
Cong Ma
Wei Wu
Junchi Yan
265
0
0
28 Aug 2024
HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction
IEEE Robotics and Automation Letters (RA-L), 2024
Xiao Zhao
Bo Chen
Mingyang Sun
Dingkang Yang
Youxing Wang
Xukun Zhang
Mingcheng Li
Dongliang Kou
Xiaoyi Wei
Lihua Zhang
239
13
0
17 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
280
9
0
12 Aug 2024
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha
Minseok Joo
Jihwan Park
Sanghyeok Lee
In-Ho Kim
Hyunwoo J. Kim
340
2
0
27 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
158
1
0
24 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
234
4
0
22 Jul 2024
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang
Xu Gao
Tong Wang
Xin Hao
Yifeng Shi
Xiao Tan
Xiaoqing Ye
Jingdong Wang
3DPC
130
0
0
22 Jul 2024
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation
Jian Sun
Yuqi Dai
Chi-Man Vong
Qing Xu
Shengbo Eben Li
Jianqiang Wang
Lei He
Keqiang Li
292
3
0
18 Jul 2024
Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving
Yuqi Dai
Jian Sun
Shengbo Eben Li
Qing Xu
Jianqiang Wang
Lei He
Keqiang Li
236
3
0
17 Jul 2024
Perception Helps Planning: Facilitating Multi-Stage Lane-Level Integration via Double-Edge Structures
Guoliang You
Xiaomeng Chu
YiFan Duan
Wenyu Zhang
Xingchen Li
Sha Zhang
Yao Li
Jianmin Ji
Yanyong Zhang
183
1
0
16 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
152
10
0
15 Jul 2024
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim
Youngseok Kim
Sihwan Hwang
H. Jeong
Dongsuk Kum
197
10
0
14 Jul 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang
Jinqing Zhang
Yanan Zhang
Qingjie Liu
Zhenghui Hu
Baohui Wang
Yunhong Wang
186
4
0
14 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
271
7
0
13 Jul 2024
OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects
Akshay Krishnan
Abhijit Kundu
Kevis-Kokitsi Maninis
James Hays
Matthew Brown
143
19
0
11 Jul 2024
Occupancy as Set of Points
Yiang Shi
Tianheng Cheng
Qian Zhang
Wenyu Liu
Xinggang Wang
3DPC
234
24
0
04 Jul 2024
Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking
Mingzhe Guo
Zhipeng Zhang
Liping Jing
Yuan He
Ke Wang
Heng Fan
193
3
0
03 Jul 2024
BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection
Yang Song
Lin Wang
321
7
0
27 Jun 2024
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection
Michelle Adeline
Junn Yong Loo
Vishnu Monn Baskaran
189
2
0
25 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
270
19
0
13 Jun 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
201
4
0
12 Jun 2024
UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection
Yuchao Wang
Peirui Cheng
Pengju Tian
Ziyang Yuan
Liangjin Zhao
Jing Tian
Wensheng Wang
Zhirui Wang
Xian Sun
172
5
0
07 Jun 2024
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Yuanhui Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DGS
271
87
0
27 May 2024
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
Shaoyuan Xie
Lingdong Kong
Wenwei Zhang
Jiawei Ren
Liang Pan
Kai-xiang Chen
Ziwei Liu
AAML
254
26
0
27 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
Chong Chen
Zhebin Zhang
Chen Li
Tianfu Wu
269
7
0
20 May 2024
oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2024
Abdul Hannan Khan
Syed Tahseen Raza Rizvi
Dheeraj Varma Chittari Macharavtu
Andreas Dengel
154
0
0
13 May 2024
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
Neural Information Processing Systems (NeurIPS), 2024
Xue-Qiu Jiang
Sheng Jin
Xiaoqin Zhang
Ling Shao
Shijian Lu
MDE
162
16
0
13 May 2024
ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
Jinke Li
Xiao He
Chonghua Zhou
Xiaoqiang Cheng
Yang Wen
Dan Zhang
ViT
183
22
0
07 May 2024
Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System
Daniel Dworak
M. Komorkiewicz
P. Skruch
Jerzy Baranowski
143
1
0
25 Apr 2024
Object criticality for safer navigation
Andrea Ceccarelli
Leonardo Montecchi
143
2
0
25 Apr 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
257
16
0
22 Apr 2024
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
Xiahan Chen
Mingjian Chen
Sanli Tang
Yi Niu
Jiang Zhu
147
4
0
08 Apr 2024
Better Monocular 3D Detectors with LiDAR from the Past
Yurong You
Cheng Perng Phoo
Carlos Diaz-Ruiz
Katie Z Luo
Wei-Lun Chao
Mark E. Campbell
B. Hariharan
Kilian Q. Weinberger
3DPC
233
1
0
08 Apr 2024
DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation
Duy-Tho Le
Hengcan Shi
Jianfei Cai
Hamid Rezatofighi
127
17
0
06 Apr 2024
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras
European Conference on Computer Vision (ECCV), 2024
Zhongyu Xia
ZhiWei Lin
Xinhao Wang
Yongtao Wang
Yun Xing
Shengxiang Qi
Nan Dong
Ming-Hsuan Yang
166
15
0
03 Apr 2024
Previous
1
2
3
4
5
6
7
8
Next