Street-View Image Generation from a Bird's-Eye View LayoutIEEE Robotics and Automation Letters (RA-L), 2023 |
YFACC: A Yorùbá speech-image dataset for cross-lingual keyword
localisation through visual groundingSpoken Language Technology Workshop (SLT), 2022 |
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural
Language QuestionThe VLDB journal (VLDBJ), 2022 |
Multimodal Image Synthesis and Editing: The Generative AI EraIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 |