Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.11926
Cited By
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
21 March 2023
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection"
50 / 150 papers shown
Title
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map Construction
Siyu Li
Kailun Yang
Hao Shi
Song Wang
You Yao
Zhiyong Li
31
2
0
13 Sep 2024
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
Zhiwei Lin
Zhe Liu
Yongtao Wang
Le Zhang
Ce Zhu
23
4
0
08 Sep 2024
Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences
Rui Yu
Runkai Zhao
Cong Nie
Heng Wang
HuaiCheng Yan
Meng Wang
3DPC
21
2
0
06 Sep 2024
DiVE: DiT-based Video Generation with Enhanced Control
Junpeng Jiang
Gangyi Hong
Lijun Zhou
Enhui Ma
Hengtong Hu
...
Kaicheng Yu
Haiyang Sun
Kun Zhan
Peng Jia
Miao Zhang
VGen
DiffM
22
11
0
03 Sep 2024
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Dingyuan Zhang
Dingkang Liang
Zichang Tan
Xiaoqing Ye
Cheng Zhang
Jingdong Wang
Xiang Bai
ViT
36
2
0
01 Sep 2024
Enhancing Vectorized Map Perception with Historical Rasterized Maps
Xiaoyu Zhang
Guangwei Liu
Zihao Liu
Ningyi Xu
Yunhui Liu
Ji Zhao
24
7
0
01 Sep 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
25
0
0
29 Aug 2024
AdaOcc: Adaptive-Resolution Occupancy Prediction
Chao-Yeh Chen
Ruoyu Wang
Yuliang Guo
Cheng Zhao
Xinyu Huang
Chen Feng
Liu Ren
37
0
0
24 Aug 2024
Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception
Jiaru Zhong
Haibao Yu
Tianyi Zhu
Jiahui Xu
Wenxian Yang
Zaiqing Nie
Chao Sun
19
3
0
20 Aug 2024
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
24
2
0
13 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
27
1
0
12 Aug 2024
PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction
Nan Peng
Xun Zhou
Mingming Wang
Xiaojun Yang
Songming Chen
Guisong Chen
31
2
0
24 Jul 2024
LangOcc: Self-Supervised Open Vocabulary Occupancy Estimation via Volume Rendering
Simon Boeder
Fabian Gigengack
Benjamin Risse
31
7
0
24 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
40
0
0
22 Jul 2024
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation
Florian Chabot
Nicolas Granger
G. Lapouge
3DGS
19
3
0
19 Jul 2024
Monocular Occupancy Prediction for Scalable Indoor Scenes
Hongxiao Yu
Yu-Quan Wang
Yuntao Chen
Zhaoxiang Zhang
46
6
0
16 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
25
3
0
15 Jul 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
Zheng Jiang
Jinqing Zhang
Yanan Zhang
Qingjie Liu
Zhenghui Hu
Baohui Wang
Yunhong Wang
19
2
0
14 Jul 2024
PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
Jinhua Zhang
Hualian Sheng
Sijia Cai
Bing Deng
Qiao Liang
Wen Li
Ying Fu
Jieping Ye
Shuhang Gu
DiffM
32
2
0
08 Jul 2024
StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction
Jiaheng Zhuang
Guoan Wang
Siyu Zhang
Xiyang Wang
Hangning Zhou
Ziyao Xu
Chi Zhang
Zhiheng Li
22
1
0
28 Jun 2024
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection
Michelle Adeline
Junn Yong Loo
Vishnu Monn Baskaran
36
0
0
25 Jun 2024
Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint
Xinglong Sun
Barath Lakshmanan
Maying Shen
Shiyi Lan
Jingde Chen
Jose Alvarez
VLM
31
3
0
17 Jun 2024
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Wenjie Wang
Yehao Lu
Guangcong Zheng
Shuigen Zhan
Xiaoqing Ye
Zichang Tan
Jingdong Wang
Gaoang Wang
Xi Li
50
9
0
13 Jun 2024
DualAD: Disentangling the Dynamic and Static World for End-to-End Driving
Simon Doll
Niklas Hanselmann
Lukas Schneider
Richard Schulz
Marius Cordts
Markus Enzweiler
Hendrik P. A. Lensch
27
5
0
10 Jun 2024
Bootstrapping Referring Multi-Object Tracking
Yani Zhang
Dongming Wu
Wencheng Han
Xingping Dong
37
5
0
07 Jun 2024
UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking
Lijun Zhou
Tao Tang
Pengkun Hao
Zihang He
Kalok Ho
...
Haiyang Sun
Kun Zhan
Peng Jia
Xianpeng Lang
Xiaodan Liang
VOT
39
4
0
04 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Peng Jia
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
16
20
0
03 Jun 2024
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Wenchao Sun
Xuewu Lin
Yining Shi
Chuang Zhang
Haoran Wu
Sifa Zheng
24
23
0
30 May 2024
Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
Yifan Bai
Dongming Wu
Yingfei Liu
Fan Jia
Weixin Mao
...
Yucheng Zhao
Jianbing Shen
Xing Wei
Tiancai Wang
Xiangyu Zhang
MLLM
16
9
0
28 May 2024
MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method
Pan Liao
Feng Yang
Di Wu
Liu Bo
24
1
0
24 May 2024
Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Xianpeng Liu
Ce Zheng
Ming Qian
Nan Xue
C. L. P. Chen
Zhebin Zhang
Chen Li
Tianfu Wu
26
2
0
20 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
39
1
0
17 May 2024
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
Xiaosu Zhu
Hualian Sheng
Sijia Cai
Bing Deng
Shaopeng Yang
Qiao Liang
Ken Chen
Lianli Gao
Jingkuan Song
Jieping Ye
27
4
0
16 May 2024
ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
Jinke Li
Xiao He
Chonghua Zhou
Xiaoqiang Cheng
Yang Wen
Dan Zhang
ViT
28
11
0
07 May 2024
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Shihao Wang
Zhiding Yu
Xiaohui Jiang
Shiyi Lan
Min Shi
Nadine Chang
Jan Kautz
Ying Li
Jose M. Alvarez
LRM
31
47
0
02 May 2024
Inverse Neural Rendering for Explainable Multi-Object Tracking
Julian Ost
Tanushree Banerjee
Mario Bijelic
Felix Heide
22
0
0
18 Apr 2024
TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation
T. Monninger
Vandana Dokkadi
Md Zafar Anwar
Steffen Staab
19
2
0
17 Apr 2024
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
Diankun Zhang
Guoan Wang
Runwen Zhu
Jianbo Zhao
Xiwu Chen
...
Haotian Yao
Chi Zhang
Xiaojun Liu
Xiaoguang Di
Bin Li
26
10
0
10 Apr 2024
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
Xiahan Chen
Mingjian Chen
Sanli Tang
Yi Niu
Jiang Zhu
23
2
0
08 Apr 2024
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras
Zhongyu Xia
ZhiWei Lin
Xinhao Wang
Yongtao Wang
Yun Xing
Shengxiang Qi
Nan Dong
Ming-Hsuan Yang
31
4
0
03 Apr 2024
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang
Yuqing Wen
Yucheng Zhao
Yaosi Hu
Yingfei Liu
...
Tiancai Wang
Chi Zhang
Chang Wen Chen
Zhenzhong Chen
Xiangyu Zhang
27
15
0
28 Mar 2024
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection
Zhiwei Lin
Zhe Liu
Zhongyu Xia
Xinhao Wang
Yongtao Wang
Shengxiang Qi
Yang Dong
Nan Dong
Le Zhang
Ce Zhu
19
34
0
25 Mar 2024
CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking
Nicolas Baumann
Michael Baumgartner
Edoardo Ghignone
Jonas Kühne
Tobias Fischer
Yung-Hsu Yang
Marc Pollefeys
Michele Magno
20
8
0
22 Mar 2024
Lifting Multi-View Detection and Tracking to the Bird's Eye View
Torben Teepe
Philipp Wolters
Johannes Gilg
Fabian Herzog
Gerhard Rigoll
44
7
0
19 Mar 2024
SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras
Yingqi Tang
Zhaotie Meng
Guoliang Chen
Erkang Cheng
3DPC
19
0
0
15 Mar 2024
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception
Yiheng Li
Hongyang Li
Zehao Huang
Hong Chang
Naiyan Wang
39
2
0
15 Mar 2024
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Jialv Zou
Bencheng Liao
Qian Zhang
Wenyu Liu
Xinggang Wang
25
0
0
13 Mar 2024
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
Hongcheng Zhang
Liu Liang
Pengxin Zeng
Xiao Song
Zhe Wang
27
7
0
12 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
49
12
0
12 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
20
14
0
11 Mar 2024
Previous
1
2
3
Next