36
0

The Harmonic Structure of Information Contours

Main:9 Pages
6 Figures
Bibliography:6 Pages
13 Tables
Appendix:9 Pages
Abstract

The uniform information density (UID) hypothesis proposes that speakers aim to distribute information evenly throughout a text, balancing production effort and listener comprehension difficulty. However, language typically does not maintain a strictly uniform information rate; instead, it fluctuates around a global average. These fluctuations are often explained by factors such as syntactic constraints, stylistic choices, or audience design. In this work, we explore an alternative perspective: that these fluctuations may be influenced by an implicit linguistic pressure towards periodicity, where the information rate oscillates at regular intervals, potentially across multiple frequencies simultaneously. We apply harmonic regression and introduce a novel extension called time scaling to detect and test for such periodicity in information contours. Analyzing texts in English, Spanish, German, Dutch, Basque, and Brazilian Portuguese, we find consistent evidence of periodic patterns in information rate. Many dominant frequencies align with discourse structure, suggesting these oscillations reflect meaningful linguistic organization. Beyond highlighting the connection between information rate and discourse structure, our approach offers a general framework for uncovering structural pressures at various levels of linguistic granularity.

View on arXiv
@article{tsipidi2025_2506.03902,
  title={ The Harmonic Structure of Information Contours },
  author={ Eleftheria Tsipidi and Samuel Kiegeland and Franz Nowak and Tianyang Xu and Ethan Wilcox and Alex Warstadt and Ryan Cotterell and Mario Giulianelli },
  journal={arXiv preprint arXiv:2506.03902},
  year={ 2025 }
}
Comments on this paper