PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For
Vision-and-Language NavigationEngineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023 |
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative
GroundingACM Multimedia (ACM MM), 2022 |
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future
DirectionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double
Back-Translation for Vision-and-Language NavigationIEEE Robotics and Automation Letters (RA-L), 2021 |
Multimodal Attention Networks for Low-Level Vision-and-Language
NavigationComputer Vision and Image Understanding (CVIU), 2019 |