
Title |
|---|
![]() METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship DetectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025 |
![]() Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual SceneComputer Vision and Pattern Recognition (CVPR), 2025 |
![]() Video-of-Thought: Step-by-Step Video Reasoning from Perception to CognitionInternational Conference on Machine Learning (ICML), 2024 |
![]() Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-ImageNeural Information Processing Systems (NeurIPS), 2024 |
![]() Effectively Leveraging CLIP for Generating Situational Summaries of Images and VideosInternational Journal of Computer Vision (IJCV), 2024 |