Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2405.12821
Cited By
v1
v2
v3 (latest)
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension
21 May 2024
Runwei Guan
Ruixiao Zhang
Ningwei Ouyang
Tao Huang
Ka Lok Man
Xiaohao Cai
Ming Xu
Jeremy S. Smith
Eng Gee Lim
Yutao Yue
Hui Xiong
Re-assign community
ArXiv (abs)
PDF
HTML
Github (44★)
Papers citing
"Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension"
50 / 61 papers shown
AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios
Chenglizhao Chen
Shaofeng Liang
Runwei Guan
Xiaolou Sun
Haocheng Zhao
Haiyun Jiang
Tao Huang
Henghui Ding
Qing-Long Han
389
2
0
26 Nov 2025
RoadSceneVQA: Benchmarking Visual Question Answering in Roadside Perception Systems for Intelligent Transportation System
Runwei Guan
Rongsheng Hu
Shangshu Chen
Ningyuan Xiao
Xue Xia
...
Ningwei Ouyang
Shaofeng Liang
Yuxuan Fan
Wanjie Sun
Yutao Yue
174
2
0
23 Nov 2025
DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal Fusion
IEEE International Conference on Robotics and Automation (ICRA), 2025
Mengmeng Liu
M. Yang
Jiuming Liu
Yunpeng Zhang
Jiangtao Li
Sander Oude Elberink
G. Vosselman
Hao Cheng
199
2
0
07 Sep 2025
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
Zhan Shi
Song Wang
Junbo Chen
Jianke Zhu
353
1
0
02 Aug 2025
AI2MMUM: AI-AI Oriented Multi-Modal Universal Model Leveraging Telecom Domain Large Model
IEEE Wireless Communications Letters (WCL), 2025
Tianyu Jiao
Zhuoran Xiao
Yihang Huang
Chenhui Ye
Yijia Feng
...
Fangkun Liu
Yin Xu
Dazhi He
Yunfeng Guan
Wenjun Zhang
200
5
0
15 May 2025
Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving
Runwei Guan
Tao Huang
Ningwei Ouyang
Shaofeng Liang
Daizong Liu
...
Lianqing Zheng
Ming Xu
Yutao Yue
Guoqiang Mao
Hui Xiong
363
1
0
11 Mar 2025
MetaOcc: Spatio-Temporal Fusion of Surround-View 4D Radar and Camera for 3D Occupancy Prediction with Dual Training Strategies
Long Yang
Lianqing Zheng
W. Ai
Minghao Liu
Sen Li
...
Shengyu Yan
Jie Bai
Zhixiong Ma
Tao Huang
Xichan Zhu
993
3
0
26 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
European Conference on Computer Vision (ECCV), 2023
Chonghao Sima
Katrin Renz
Kashyap Chitta
Lawrence Yunliang Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
953
439
0
17 Jan 2025
RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar
Liye Jia
Runwei Guan
Haocheng Zhao
Qiuchi Zhao
Ka Lok Man
Jeremy S. Smith
Limin Yu
Yutao Yue
453
6
0
04 Jan 2025
radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2024
Yanmei Zhang
Rui Yang
Yutao Yue
Eng Gee Lim
377
8
0
11 Oct 2024
radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave Radar
IEEE Transactions on Mobile Computing (IEEE TMC), 2024
Yizheng Wu
Jun Cen
Xingyi Li
Rui Yang
Yutao Yue
Guo-Shing Lin
361
14
0
03 Aug 2024
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Daizong Liu
Yang Liu
Wencan Huang
Wei Hu
LM&Ro
419
35
0
09 Jun 2024
Radar Spectra-Language Model for Automotive Scene Parsing
Mariia Pushkareva
Yuri Feldman
Csaba Domokos
K. Rambach
Dotan Di Castro
322
5
0
04 Jun 2024
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection
IEEE Transactions on Intelligent Vehicles (TIV), 2024
F. Fent
Andras Palffy
Holger Caesar
310
32
0
03 Apr 2024
DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment
Jiuming Liu
Dong Zhuo
Zhiheng Feng
Siting Zhu
Chensheng Peng
Yanfeng Guo
Hesheng Wang
558
37
0
27 Mar 2024
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection
Zhiwei Lin
Zhe Liu
Zhongyu Xia
Xinhao Wang
Yongtao Wang
Shengxiang Qi
Yang Dong
Nan Dong
Le Zhang
Ce Zhu
332
130
0
25 Mar 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
452
22
0
19 Mar 2024
Large Multimodal Agents: A Survey
Junlin Xie
Zhihong Chen
Ruifei Zhang
Xiang Wan
Guanbin Li
LM&Ro
LLMAG
261
102
0
23 Feb 2024
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding
Senqiao Yang
Jiaming Liu
Ray Zhang
Mingjie Pan
Zoey Guo
Xiaoqi Li
Zehui Chen
Shiyang Feng
Yandong Guo
Shanghang Zhang
3DV
446
127
0
21 Dec 2023
Mono3DVG: 3D Visual Grounding in Monocular Images
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yangfan Zhan
Yuan. Yuan
Zhitong Xiong
MDE
295
37
0
13 Dec 2023
PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Weixin Mao
Tiancai Wang
Diankun Zhang
Junjie Yan
Osamu Yoshie
3DPC
222
16
0
29 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
397
479
0
21 Nov 2023
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models
IEEE International Conference on Robotics and Automation (ICRA), 2023
Tsun-Hsuan Wang
Alaa Maalouf
Wei Xiao
Yutong Ban
Alexander Amini
Guy Rosman
S. Karaman
Daniela Rus
250
75
0
26 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
361
160
0
22 Oct 2023
Language Prompt for Autonomous Driving
AAAI Conference on Artificial Intelligence (AAAI), 2023
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
541
143
0
08 Sep 2023
ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Runwei Guan
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Yong Yue
Jeremy S. Smith
Eng Gee Lim
Yutao Yue
440
13
0
20 Aug 2023
SMURF: Spatial Multi-Representation Fusion for 3D Object Detection with 4D Imaging Radar
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Tao Huang
Qiuchi Zhao
Weiyi Xiong
Wei Chen
Qinghua Han
Bing Zhu
514
112
0
20 Jul 2023
RCM-Fusion: Radar-Camera Multi-Level Fusion for 3D Object Detection
IEEE International Conference on Robotics and Automation (ICRA), 2023
Ji Song Kim
Minjae Seong
Geonho Bang
Dongsuk Kum
Junwon Choi
672
32
0
17 Jul 2023
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar
Runwei Guan
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yong 0001Yue
Yutao Yue
VOS
252
32
0
14 Jul 2023
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces
Shanliang Yao
Runwei Guan
Zhaodong Wu
Yi Ni
Zile Huang
...
H. Seo
Ka Lok Man
Jieming Ma
Xiaohui Zhu
Yutao Yue
409
87
0
13 Jul 2023
LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Weiyi Xiong
Tao Huang
Wei Chen
Qingyan Han
Yu Xia
Bing Zhu
568
115
0
03 Jul 2023
4D Millimeter-Wave Radar in Autonomous Driving: A Survey
Zeyu Han
Jiahao Wang
Zikun Xu
Shuocheng Yang
Lei He
Shaobing Xu
Jianqiang Wang
Keqiang Li
552
60
0
07 Jun 2023
GRES: Generalized Referring Expression Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Chang Liu
Henghui Ding
Xudong Jiang
415
281
0
01 Jun 2023
Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving
Wenhao Cheng
Junbo Yin
Wei Li
Ruigang Yang
Jianbing Shen
3DPC
239
22
0
25 May 2023
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds
Computer Vision and Pattern Recognition (CVPR), 2023
Jin-Yang Li
Chenxu Luo
Xiaodong Yang
3DPC
404
145
0
08 May 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
313
205
0
20 Apr 2023
CRN: Camera Radar Net for Accurate, Robust, Efficient 3D Perception
IEEE International Conference on Computer Vision (ICCV), 2023
Youngseok Kim
Juyeb Shin
Sanmin Kim
In-Jae Lee
Junwon Choi
Dongsuk Kum
719
125
0
03 Apr 2023
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance
IEEE International Conference on Computer Vision (ICCV), 2023
Zoey Guo
Yiwen Tang
Renrui Zhang
Dong Wang
Zhigang Wang
Bin Zhao
Xuelong Li
615
83
0
29 Mar 2023
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
Computer Vision and Pattern Recognition (CVPR), 2023
Yukang Chen
Jianhui Liu
Xiangyu Zhang
Xiaojuan Qi
Jiaya Jia
3DPC
366
414
0
20 Mar 2023
Referring Multi-Object Tracking
Computer Vision and Pattern Recognition (CVPR), 2023
Dongming Wu
Wencheng Han
Tiancai Wang
Xingping Dong
Xiangyu Zhang
Jianbing Shen
264
128
0
06 Mar 2023
LidarCLIP or: How I Learned to Talk to Point Clouds
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Georg Hess
Adam Tonderski
Christoffer Petersson
Kalle AAstrom
Lennart Svensson
DiffM
409
33
0
13 Dec 2022
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding
Neural Information Processing Systems (NeurIPS), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
337
145
0
17 Nov 2022
RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems
Yanlong Yang
Tao Huang
Wei Chen
Qinghua Han
Gang Ma
Bing Zhu
471
44
0
11 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Computer Vision and Pattern Recognition (CVPR), 2022
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Jiaming Song
Xiaogang Wang
Yu Qiao
VLM
648
1,046
0
10 Nov 2022
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2022
Yanmin Wu
Xinhua Cheng
Renrui Zhang
Zesen Cheng
Jian Zhang
405
119
0
29 Sep 2022
CenterFormer: Center-based Transformer for 3D Object Detection
European Conference on Computer Vision (ECCV), 2022
Zixiang Zhou
Xian Zhao
Yu Wang
Panqu Wang
H. Foroosh
3DPC
ViT
254
192
0
12 Sep 2022
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions
Neural Information Processing Systems (NeurIPS), 2022
Dong-Hee Paek
Seung-Hyun Kong
Kevin Tirta Wijaya
631
186
0
16 Jun 2022
Multi-View Transformer for 3D Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2022
Shijia Huang
Yilun Chen
Jiaya Jia
Liwei Wang
456
188
0
05 Apr 2022
Deep Instance Segmentation with Automotive Radar Detection Points
Tao Huang
Weiyi Xiong
Liping Bai
Yu Xia
Wei Chen
Wanli Ouyang
Bing Zhu
466
73
0
05 Oct 2021
YOLOX: Exceeding YOLO Series in 2021
Zheng Ge
Songtao Liu
Feng Wang
Zeming Li
Jian Sun
ObjD
861
5,590
0
18 Jul 2021
1
2
Next
Page 1 of 2