Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.12415
Cited By
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation
19 March 2024
Hao Wang
Jiayou Qin
Ashish Bastola
Xiwen Chen
John Suchanek
Zihao Gong
Abolfazl Razi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation"
18 / 18 papers shown
Title
A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design
Jie Tian
Martin Taylor Sobczak
Dhanush Patil
Jixin Hou
Lin Pang
...
Yuval Golan
Xiaoming Zhai
Hongyue Sun
Kenan Song
X. U. Wang
LLMAG
AI4CE
53
0
0
25 Mar 2025
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
Zhuo Zhi
Chen Feng
Adam Daneshmend
Mine Orlu
Andreas Demosthenous
L. Yin
Da Li
Ziquan Liu
Miguel R. D. Rodrigues
LRM
59
1
0
11 Mar 2025
LLM-Glasses: GenAI-driven Glasses with Haptic Feedback for Navigation of Visually Impaired People
Issatay Tokmurziyev
Miguel Altamirano Cabrera
Muhammad Haris Khan
Yara Mahmoud
Luis Moreno
Dzmitry Tsetserukou
34
0
0
04 Mar 2025
Can LVLMs and Automatic Metrics Capture Underlying Preferences of Blind and Low-Vision Individuals for Navigational Aid?
Na Min An
Eunki Kim
Wan Ju Kang
Sangryul Kim
Hyunjung Shim
James Thorne
36
0
0
15 Feb 2025
Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation
Muhammad Taha Tariq
Congqing Wang
Yasir Hussain
82
0
0
28 Jan 2025
Driving Towards Inclusion: A Systematic Review of AI-powered Accessibility Enhancements for People with Disability in Autonomous Vehicles
Ashish Bastola
Julian Brinkley
Hao Wang
Abolfazl Razi
A. Moshayedi
Abolfazl Razi
46
5
0
10 Jan 2025
LLM-assisted Physical Invariant Extraction for Cyber-Physical Systems Anomaly Detection
Danial Abshari
Chenglong Fu
Meera Sridhar
34
6
0
17 Nov 2024
HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems
Yasaman Haghighi
Celine Demonsant
Panagiotis Chalimourdas
Maryam Tavasoli Naeini
Jhon Kevin Munoz
Bladimir Bacca
Silvan Suter
Matthieu Gani
Alexandre Alahi
EgoV
29
1
0
30 Sep 2024
Motor Focus: Ego-Motion Prediction with All-Pixel Matching
Hao Wang
Jiayou Qin
Xiwen Chen
Ashish Bastola
John Suchanek
Zihao Gong
Abolfazl Razi
24
1
0
25 Apr 2024
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
Hochul Hwang
Sunjae Kwon
Yekyung Kim
Donghyun Kim
27
11
0
09 Feb 2024
GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models
Haicheng Liao
Huanming Shen
Zhenning Li
Chengyue Wang
Guofa Li
Yiming Bie
Chengzhong Xu
34
26
0
06 Dec 2023
Prompt Engineering for Healthcare: Methodologies and Applications
Jiaqi Wang
Enze Shi
Sigang Yu
Zihao Wu
Chong Ma
...
Dajiang Zhu
Yixuan Yuan
Dinggang Shen
Tianming Liu
Shu Zhang
LM&MA
42
106
0
28 Apr 2023
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
139
430
0
10 Jul 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
315
8,261
0
28 Jan 2022
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
223
897
0
28 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Sandeep Subramanian
Raymond Li
Jonathan Pilault
C. Pal
229
212
0
07 Sep 2019
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
281
35,677
0
08 Jun 2015
1