Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation LearningInternational Conference on Learning Representations (ICLR), 2025 |
Contextual AD Narration with Interleaved Multimodal SequenceComputer Vision and Pattern Recognition (CVPR), 2024 |