Learning Language Structures through GroundingAAAI Conference on Artificial Intelligence (AAAI), 2024 Freda Shi |
Kiki or Bouba? Sound Symbolism in Vision-and-Language ModelsNeural Information Processing Systems (NeurIPS), 2023 |
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining
on Visual Language UnderstandingComputer Vision and Pattern Recognition (CVPR), 2023 |
Does Vision Accelerate Hierarchical Generalization of Neural Language
Learners?International Conference on Computational Linguistics (COLING), 2023 |
Towers of Babel: Combining Images, Language, and 3D Geometry for
Learning Multimodal VisionIEEE International Conference on Computer Vision (ICCV), 2021 |
Video-aided Unsupervised Grammar InductionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021 |
Visually Grounded Compound PCFGsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |