Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.11926
Cited By
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
21 March 2023
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection"
50 / 150 papers shown
Title
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
Zhiwen Zeng
Yunfei Yin
Zheng Yuan
Argho Dey
Xianjian Bao
16
0
0
10 May 2025
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion
Haoteng Li
Zhao Yang
Zezhong Qian
Gongpeng Zhao
Yuqi Huang
Jun-chen Yu
Huazheng Zhou
Longjun Liu
46
1
0
03 May 2025
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Jiaqi Peng
Tai Wang
Jiangmiao Pang
Yuan Shen
33
0
0
27 Apr 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
Haotian Dong
X. Wang
D. Lin
Yipeng Wu
Qin Chen
R. Liu
Kairui Yang
Ping Li
Qing-Wu Guo
VGen
42
0
0
25 Apr 2025
Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection
Linhua Kong
Dongxia Chang
Lian Liu
Zisen Kong
Pengyuan Li
Yao Zhao
17
0
0
23 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
24
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
35
0
0
17 Apr 2025
A Modular Energy Aware Framework for Multicopter Modeling in Control and Planning Applications
Sebastian Gasche
Christian Kallies
Andreas Himmel
R. Findeisen
29
0
0
04 Apr 2025
Control Map Distribution using Map Query Bank for Online Map Generation
Ziming Liu
Leichen Wang
Ge Yang
Xinrun Li
Xingtao Hu
Hao Sun
Guangyu Gao
26
0
0
04 Apr 2025
MDP: Multidimensional Vision Model Pruning with Latency Constraint
Xinglong Sun
Barath Lakshmanan
Maying Shen
Shiyi Lan
Jingde Chen
Jose M. Alvarez
VLM
44
0
0
02 Apr 2025
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
Fuhao Li
Huan Jin
Bin-Bin Gao
Liaoyuan Fan
Lihui Jiang
Long Zeng
63
0
0
28 Mar 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
53
0
0
27 Mar 2025
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
Haoyu Fu
Diankun Zhang
Zongchuang Zhao
Jianfeng Cui
Dingkang Liang
Chong Zhang
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
38
1
0
25 Mar 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
68
0
0
18 Mar 2025
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
Nvidia
Hassan Abu Alhaija
Jose M. Alvarez
Maciej Bala
Tiffany Cai
...
Yuchong Ye
Xiaodong Yang
X. Yang
Xiaohui Zeng
Yu Zeng
VGen
90
1
0
18 Mar 2025
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
T. Monninger
Md Zafar Anwar
Stanislaw Antol
Steffen Staab
Sihao Ding
33
0
0
17 Mar 2025
SparseAlign: A Fully Sparse Framework for Cooperative Object Detection
Yunshuang Yuan
Yan Xia
Daniel Cremers
Monika Sester
55
0
0
17 Mar 2025
Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation
Kailin Li
Zhenxin Li
Shiyi Lan
Yuan Xie
Zhizhong Zhang
J. Liu
Zuxuan Wu
Zhiding Yu
Jose M.Alvarez
50
1
0
17 Mar 2025
Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning
Lizhen Xu
Xiuxiu Bai
Xiaojun Jia
Jianwu Fang
Shanmin Pang
58
0
0
13 Mar 2025
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation
Yuwen Du
Anning Hu
Zichen Chao
Yifan Lu
Junhao Ge
Genjia Liu
Weitao Wu
Lanjun Wang
Siheng Chen
50
0
0
13 Mar 2025
CoCMT: Communication-Efficient Cross-Modal Transformer for Collaborative Perception
Rujia Wang
Xiangbo Gao
Hao Xiang
Runsheng Xu
Zhengzhong Tu
47
2
0
13 Mar 2025
HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking
Jing Yang
Sen Yang
Xiao Tan
Hanli Wang
45
1
0
13 Mar 2025
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
Xiaosong Jia
Junqi You
Zhiyuan Zhang
Junchi Yan
42
4
0
07 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
61
1
0
05 Mar 2025
Glad: A Streaming Scene Generator for Autonomous Driving
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
X. Zhang
3DGS
VGen
41
1
0
26 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
L. Zhang
Philip H. S. Torr
66
4
0
24 Feb 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
S. Zhang
83
0
0
28 Jan 2025
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Y. Li
Yang Yang
Zhen Lei
3DPC
46
2
0
11 Jan 2025
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Houqiang Li
Yanyong Zhang
90
0
0
17 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Shanmin Pang
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
69
1
0
03 Dec 2024
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Felix Fent
Gerhard Rigoll
72
0
0
29 Nov 2024
CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction
Lipeng Gu
Xuefeng Yan
Weiming Wang
Honghua Chen
Dingkun Zhu
Liangliang Nan
Mingqiang Wei
59
0
0
28 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
75
1
0
23 Nov 2024
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Shaoqing Xu
Fang Li
Shengyin Jiang
Ziying Song
Li Liu
Zhi-xin Yang
3DGS
SSL
77
0
0
19 Nov 2024
Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation
Nayeon Kim
Hongje Seong
Daehyun Ji
Sujin Jang
27
2
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
43
0
0
16 Nov 2024
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners
Niklas Hanselmann
Simon Doll
Marius Cordts
Hendrik P. A. Lensch
Andreas Geiger
39
0
0
12 Nov 2024
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
Jisong Kim
Minjae Seong
Jun Won Choi
26
0
0
05 Nov 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
N. Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCV
EDL
3DPC
37
0
0
31 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
41
1
0
17 Oct 2024
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho
Yulong Cao
Jiachen Sun
Qingzhao Zhang
Marco Pavone
Jeong Joon Park
Heng Yang
Z. Morley Mao
15
0
0
16 Oct 2024
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
Zhiwei Lin
Hongbo Jin
Yongtao Wang
Yufei Wei
Nan Dong
30
2
0
15 Oct 2024
UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles
Hui Ye
Rajshekhar Sunderraman
Shihao Ji
18
2
0
14 Oct 2024
Motion Forecasting in Continuous Driving
Nan Song
Bozhou Zhang
Xiatian Zhu
L. Zhang
18
1
0
08 Oct 2024
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection
Yang Cao
Yuanliang Jv
Dan Xu
3DGS
29
3
0
02 Oct 2024
MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction
Jingyu Song
Xudong Chen
Liupei Lu
Jie Li
Katherine A. Skinner
21
1
0
26 Sep 2024
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework
Xiaoyu Li
Peidong Li
Lijun Zhao
Dedong Liu
Jinghan Gao
Xian Wu
Yitao Wu
Dixiao Cui
VOT
28
0
0
18 Sep 2024
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Jinrang Jia
Guangqi Yi
Yifeng Shi
21
0
0
18 Sep 2024
DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving
Haisheng Su
Wei Wu
Junchi Yan
18
0
0
15 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
20
9
0
14 Sep 2024
1
2
3
Next