Text encoders bottleneck compositionality in contrastive vision-language
modelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
CMR3D: Contextualized Multi-Stage Refinement for 3D Object DetectionACM Multimedia Asia (MA), 2022 |
Multi-Class Multi-Instance Count Conditioned Adversarial Image
GenerationIEEE International Conference on Computer Vision (ICCV), 2021 |
Toward automatic comparison of visualization techniques: Application to
graph visualizationVisual Informatics (VI), 2019 |
Interpretable Counting for Visual Question AnsweringInternational Conference on Learning Representations (ICLR), 2017 |
Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers
from VisionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2017 |