CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
MatchingNeural Information Processing Systems (NeurIPS), 2024 |
Crafting Dynamic Virtual Activities with Advanced Multimodal ModelsInternational Symposium on Mixed and Augmented Reality (ISMAR), 2024 |
Renovating Names in Open-Vocabulary Segmentation BenchmarksNeural Information Processing Systems (NeurIPS), 2024 |
Audio-Visual Instance SegmentationComputer Vision and Pattern Recognition (CVPR), 2023 Ruohao Guo Yaru Chen Yanyu Qi Wenzhen Yue Dantong Niu ...Wenzhen Yue Ji Shi Qixun Wang Peiliang Zhang Buwen Liang |
Temporal Transductive Inference for Few-Shot Video Object SegmentationInternational Journal of Computer Vision (IJCV), 2022 |