M3: 3D-Spatial MultiModal MemoryInternational Conference on Learning Representations (ICLR), 2025 |
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025 |
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency ModelEuropean Conference on Computer Vision (ECCV), 2024 |