Lightweight Video Denoising Using a Classic Bayesian BackboneIEEE International Conference on Multimedia and Expo (ICME), 2024 Clement Bled François Pitié |
GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion TransformerInternational Joint Conference on Artificial Intelligence (IJCAI), 2024 |
POA: Pre-training Once for Models of All SizesEuropean Conference on Computer Vision (ECCV), 2024 |
Towards Flexible Evaluation for Generative Visual Question AnsweringACM Multimedia (MM), 2024 |
LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using
LiDAR and Camera Yukai Ma Jianbiao Mei Xuemeng Yang Licheng Wen Weihua Xu Jiangning Zhang Ding Wang Yong-Jin Liu Xingxing Zuo |
Diffusion Synthesizer for Efficient Multilingual Speech to Speech
TranslationInterspeech (Interspeech), 2024 |
What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |