ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.05364
22
0

Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation

7 April 2025
Manvi Agarwal
Changhong Wang
Gaël Richard
ArXivPDFHTML
Abstract

While music remains a challenging domain for generative models like Transformers, a two-pronged approach has recently proved successful: inserting musically-relevant structural information into the positional encoding (PE) module and using kernel approximation techniques based on Random Fourier Features (RFF) to lower the computational cost from quadratic to linear. Yet, it is not clear how such RFF-based efficient PEs compare with those based on rotation matrices, such as Rotary Positional Encoding (RoPE). In this paper, we present a unified framework based on kernel methods to analyze both families of efficient PEs. We use this framework to develop a novel PE method called RoPEPool, capable of extracting causal relationships from temporal sequences. Using RFF-based PEs and rotation-based PEs, we demonstrate how seemingly disparate PEs can be jointly studied by considering the content-context interactions they induce. For empirical validation, we use a symbolic music generation task, namely, melody harmonization. We show that RoPEPool, combined with highly-informative structural priors, outperforms all methods.

View on arXiv
@article{agarwal2025_2504.05364,
  title={ Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation },
  author={ Manvi Agarwal and Changhong Wang and Gael Richard },
  journal={arXiv preprint arXiv:2504.05364},
  year={ 2025 }
}
Comments on this paper