Learning Music Audio Representations With Limited DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 |
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023 |
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for
Sparse TrainingNeural Information Processing Systems (NeurIPS), 2022 |
How does the pre-training objective affect what large language models
learn about linguistic properties?Annual Meeting of the Association for Computational Linguistics (ACL), 2022 |
What's Hidden in a One-layer Randomly Weighted Transformer?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Masked Language Modeling and the Distributional Hypothesis: Order Word
Matters Pre-training for LittleConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |