Are vision language models robust to uncertain inputs?

Are vision language models robust to uncertain inputs?

Papers citing "Are vision language models robust to uncertain inputs?"

Title
No papers