ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.04838
  4. Cited By
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
v1v2v3v4v5v6v7 (latest)

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

Computer Vision and Pattern Recognition (CVPR), 2019
10 December 2019
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
P. Tsui
James Guo
Yin Zhou
Yuning Chai
Benjamin Caine
Vijay Vasudevan
Wei Han
Jiquan Ngiam
Hang Zhao
Aleksei Timofeev
Scott Ettinger
Maxim Krivokon
A. Gao
Aditya Joshi
Sheng Zhao
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
ArXiv (abs)PDFHTML

Papers citing "Scalability in Perception for Autonomous Driving: Waymo Open Dataset"

50 / 1,848 papers shown
Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection
Mcity Data Engine: Iterative Model Improvement Through Open-Vocabulary Data Selection
Daniel Bogdoll
Rajanikant Ananta
Abeyankar Giridharan
Isabel Moore
Gregory Stevens
Henry X. Liu
VLM
363
0
0
30 Apr 2025
Breaking Down Monocular Ambiguity: Exploiting Temporal Evolution for 3D Lane Detection
Breaking Down Monocular Ambiguity: Exploiting Temporal Evolution for 3D Lane Detection
Huan Zheng
Wencheng Han
Tianyi Yan
Cheng-Zhong Xu
Jianbing Shen
400
1
0
29 Apr 2025
Floating Car Observers in Intelligent Transportation Systems: Detection Modeling and Temporal Insights
Floating Car Observers in Intelligent Transportation Systems: Detection Modeling and Temporal Insights
Jeremias Gerner
Klaus Bogenberger
Stefanie Schmidtner
136
0
0
29 Apr 2025
A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes
A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes
Nicolas Münger
M. Ronecker
Xavier Diaz
Michael Karner
Daniel Watzenig
Jan Skaloud
3DPC
964
0
0
25 Apr 2025
Depth3DLane: Monocular 3D Lane Detection via Depth Prior Distillation
Depth3DLane: Monocular 3D Lane Detection via Depth Prior Distillation
Dongxin Lyu
Han Huang
Cheng Tan
Zimu Li
MDE
292
0
0
25 Apr 2025
Dynamic Camera Poses and Where to Find Them
Dynamic Camera Poses and Where to Find ThemComputer Vision and Pattern Recognition (CVPR), 2025
C. Rockwell
Joseph Tung
Nayeon Lee
Xuan Li
David Fouhey
Chen-Hsuan Lin
405
14
0
24 Apr 2025
A Decade of You Only Look Once (YOLO) for Object Detection: A Review
A Decade of You Only Look Once (YOLO) for Object Detection: A ReviewIEEE Access (IEEE Access), 2025
Leo Thomas Ramos
Angel D. Sappa
500
5
0
24 Apr 2025
Learning Isometric Embeddings of Road Networks using Multidimensional Scaling
Learning Isometric Embeddings of Road Networks using Multidimensional Scaling
Juan Carlos Climent Pardo
364
1
0
24 Apr 2025
Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset
Highly Accurate and Diverse Traffic Data: The DeepScenario Open 3D Dataset
Oussema Dhaouadi
Johannes Meier
Luca Wahl
Jacques Kaiser
Luca Scalerandi
Nick Wandelburg
Zhuolun Zhou
Nijanthan Berinpanathan
Holger Banzhaf
Zorah Lähner
348
5
0
24 Apr 2025
CaRL: Learning Scalable Planning Policies with Simple Rewards
CaRL: Learning Scalable Planning Policies with Simple Rewards
Bernhard Jaeger
D. Dauner
Jens Beißwenger
Simon Gerstenecker
Kashyap Chitta
Andreas Geiger
564
10
0
24 Apr 2025
Gaussian Splatting is an Effective Data Generator for 3D Object Detection
Gaussian Splatting is an Effective Data Generator for 3D Object Detection
F. G. Zanjani
Davide Abati
Auke Wiggers
Dimitris Kalatzis
Jens Petersen
Hong Cai
A. Habibian
3DGS
894
1
0
23 Apr 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
320
2
0
23 Apr 2025
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
Xuzhao Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Yanzhe Zhang
Ji Wan
Jiadong Wang
VGen
407
6
0
22 Apr 2025
Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models
Pose Optimization for Autonomous Driving Datasets using Neural Rendering Models
Quentin Herau
Nathan Piasco
Moussâb Bennehar
Luis Rolado
D. Tsishkou
Bingbing Liu
Cyrille Migniot
Pascal Vasseur
C. Demonceaux
183
1
0
22 Apr 2025
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
M. Tami
Mohammed Elhenawy
Huthaifa I. Ashqar
330
1
0
21 Apr 2025
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding
Tong Zeng
Longfeng Wu
Liang Shi
Dawei Zhou
Feng Guo
203
7
0
20 Apr 2025
Seurat: From Moving Points to Depth
Seurat: From Moving Points to DepthComputer Vision and Pattern Recognition (CVPR), 2025
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPCMDE
286
9
0
20 Apr 2025
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
Wenyu Li
Sidun Liu
Peng Qiao
Yong Dou
353
3
0
18 Apr 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
Liwen Wang
ZhiPeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
Jianmin Ji
Y. Zhang
3DPC
295
0
0
17 Apr 2025
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
AerialMegaDepth: Learning Aerial-Ground Reconstruction and View SynthesisComputer Vision and Pattern Recognition (CVPR), 2025
Khiem Vuong
Anurag Ghosh
Deva Ramanan
S. Narasimhan
Shubham Tulsiani
230
14
0
17 Apr 2025
Collaborative Perception Datasets for Autonomous Driving: A Review
Collaborative Perception Datasets for Autonomous Driving: A ReviewIEEE Sensors Journal (IEEE Sens. J.), 2025
N. Wang
Deyong Shang
Yan Gong
X. S. Hu
Tong Zhao
Hongyu Pan
Yanwen Huang
Xiaoyu Wang
J. Lu
436
9
0
17 Apr 2025
Regist3R: Incremental Registration with Stereo Foundation Model
Regist3R: Incremental Registration with Stereo Foundation Model
Sidun Liu
Wenyu Li
Peng Qiao
Yong Dou
3DV
433
7
0
16 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
562
2
0
15 Apr 2025
E2E Parking Dataset: An Open Benchmark for End-to-End Autonomous Parking
E2E Parking Dataset: An Open Benchmark for End-to-End Autonomous Parking
Kejia Gao
Liguo Zhou
Mingjun Liu
Alois C. Knoll
303
1
0
15 Apr 2025
Decoupled Diffusion Sparks Adaptive Scene Generation
Decoupled Diffusion Sparks Adaptive Scene Generation
Yunsong Zhou
Naisheng Ye
William Ljungbergh
Tianyu Li
Jiazhi Yang
Zetong Yang
Hongzi Zhu
Christoffer Petersson
Hongyang Li
268
9
0
14 Apr 2025
ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking
ReferGPT: Towards Zero-Shot Referring Multi-Object Tracking
Tzoulio Chamiti
Leandro Di Bella
Adrian Munteanu
Nikos Deligiannis
332
6
0
12 Apr 2025
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds
Jizong Peng
Tze Ho Elden Tse
Kai Xu
Wenchao Gao
Angela Yao
3DGS
306
0
0
12 Apr 2025
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Vinal Asodia
Zhenhua Feng
Saber Fallah
OffRL
279
0
0
11 Apr 2025
TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing
TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing
Neil Reichlin
Nicolas Baumann
Edoardo Ghignone
Michele Magno
202
0
0
11 Apr 2025
Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review
Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review
Jörg Gamerdinger
Sven Teufel
Oliver Bringmann
173
1
0
11 Apr 2025
RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions
RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions
Youngwan Jin
Michal Kovac
Yagiz Nalcakan
Hyeongjin Ju
Hanbin Song
Sanghyeop Yeo
Shiho Kim
282
0
0
10 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
459
6
0
09 Apr 2025
Model-Agnostic Policy Explanations with Large Language Models
Model-Agnostic Policy Explanations with Large Language Models
Zhang Xi-Jia
Yue (Sophie) Guo
Shufei Chen
Simon Stepputtis
Matthew C. Gombolay
Katia Sycara
Joseph Campbell
LM&RoLRM
331
3
0
08 Apr 2025
Targetless LiDAR-Camera Calibration with Neural Gaussian Splatting
Targetless LiDAR-Camera Calibration with Neural Gaussian Splatting
Haebeom Jung
Namtae Kim
Jungwoo Kim
Jaesik Park
3DGS
898
1
0
06 Apr 2025
Systematic Literature Review on Vehicular Collaborative Perception - A Computer Vision Perspective
Systematic Literature Review on Vehicular Collaborative Perception - A Computer Vision Perspective
Lei Wan
Jianxin Zhao
Andreas Wiedholz
Manuel Bied
Mateus Martinez de Lucena
Abhishek Dinkar Jagtap
Andreas Festag
Antônio Augusto Fröhlich
Hannan Ejaz Keen
Alexey Vinel
482
2
0
06 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionInformation Fusion (Inf. Fusion), 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
439
37
0
03 Apr 2025
MinkOcc: Towards real-time label-efficient semantic occupancy prediction
MinkOcc: Towards real-time label-efficient semantic occupancy prediction
Samuel Sze
Daniele De Martini
Lars Kunze
3DPC
301
2
0
03 Apr 2025
CornerPoint3D: Look at the Nearest Corner Instead of the Center
CornerPoint3D: Look at the Nearest Corner Instead of the Center
Ruixiao Zhang
Runwei Guan
Xinyu Chen
Adam Prugel-Bennett
Xiaohao Cai
3DPC
287
1
0
03 Apr 2025
WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
Chaojun Ni
Xiaofeng Wang
Zheng Zhu
Wei Wang
Haoyun Li
Guosheng Zhao
Jie Li
Wenkang Qin
Guan Huang
Wenjun Mei
3DGSViTVGen
950
17
0
03 Apr 2025
Scene-Centric Unsupervised Panoptic Segmentation
Scene-Centric Unsupervised Panoptic SegmentationComputer Vision and Pattern Recognition (CVPR), 2025
Oliver Hahn
Christoph Reich
Nikita Araslanov
Daniel Cremers
Christian Rupprecht
Stefan Roth
OCL
329
6
0
02 Apr 2025
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous VehiclesIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Sorin Grigorescu
Mihai V. Zaha
AI4CE
276
1
0
02 Apr 2025
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and BenchmarkingScandinavian Conference on Image Analysis (SCIA), 2025
Ulas Gunes
Matias Turkulainen
Xuqian Ren
Dieter Büchler
Arno Solin
Esa Rahtu
3DV
303
2
0
02 Apr 2025
UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction
UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction
Yunxuan Mao
R. Xiong
Longji Xu
Yiyi Liao
3DPC
1.0K
1
0
01 Apr 2025
ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs
ADGaussian: Generalizable Gaussian Splatting for Autonomous Driving with Multi-modal Inputs
Qi Song
Chenghong Li
Haotong Lin
Sida Peng
Rui Huang
3DGS
383
3
0
01 Apr 2025
Intrinsic-feature-guided 3D Object Detection
Intrinsic-feature-guided 3D Object Detection
Wanjing Zhang
Chenxing Wang
3DPC
247
0
0
01 Apr 2025
Zero-Shot 4D Lidar Panoptic Segmentation
Zero-Shot 4D Lidar Panoptic SegmentationComputer Vision and Pattern Recognition (CVPR), 2025
Yushan Zhang
Aljosa Osep
Laura Leal-Taixé
Tim Meinhardt
3DPC
342
5
0
01 Apr 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPCVGen
393
45
0
31 Mar 2025
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
Yuping Wang
Xiangyu Huang
Xiaokang Sun
Mingxuan Yan
Shuo Xing
Zhengzhong Tu
Jiachen Li
360
13
0
31 Mar 2025
A Benchmark for Vision-Centric HD Mapping by V2I Systems
A Benchmark for Vision-Centric HD Mapping by V2I Systems
Miao Fan
Shanshan Yu
Shengtong Xu
Kun Jiang
Haoyi Xiong
Xiangzeng Liu
3DV
195
0
0
31 Mar 2025
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
Yongbin Li
Yujiao Shi
Tao Lin
Xiangrui Liu
Wenxiao Cai
Zhengyang Liang
Bo Zhao
LRM
598
35
0
31 Mar 2025
Previous
123456...353637
Next