Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identificationComputer Vision and Pattern Recognition (CVPR), 2025 |
HistGen: Histopathology Report Generation via Local-Global Feature
Encoding and Cross-modal Context InteractionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024 |
Learning Combinatorial Prompts for Universal Controllable Image
CaptioningInternational Journal of Computer Vision (IJCV), 2023 |
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image CaptioningPattern Recognition (Pattern Recogn.), 2023 |
OSIC: A New One-Stage Image Captioner CoinedInternational Joint Conference on Artificial Intelligence (IJCAI), 2022 |
PreSTU: Pre-Training for Scene-Text UnderstandingIEEE International Conference on Computer Vision (ICCV), 2022 |