COST: Contrastive One-Stage Transformer for Vision-Language Small Object TrackingInformation Fusion (Inf. Fusion), 2025 |
MambaTrack: Exploiting Dual-Enhancement for Night UAV TrackingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
RingMo-Aerial: An Aerial Remote Sensing Foundation Model With Affine Transformation Contrastive LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024 |
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal AlignmentACM Multimedia (ACM MM), 2023 |
MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with
Depth InformationInterspeech (Interspeech), 2023 |