ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2511.08548
60
0

A Matter of Interest: Understanding Interestingness of Math Problems in Humans and Language Models

11 November 2025
Shubhra Mishra
Yuka Machino
Gabriel Poesia
Albert Q. Jiang
Joy Hsu
Adrian Weller
Challenger Mishra
David Broman
Joshua B. Tenenbaum
M. Jamnik
Cedegao E. Zhang
Katherine M. Collins
ArXiv (abs)PDFHTML
Main:6 Pages
30 Figures
Bibliography:3 Pages
7 Tables
Appendix:30 Pages
Abstract

The evolution of mathematics has been guided in part by interestingness. From researchers choosing which problems to tackle next, to students deciding which ones to engage with, people's choices are often guided by judgments about how interesting or challenging problems are likely to be. As AI systems, such as LLMs, increasingly participate in mathematics with people -- whether for advanced research or education -- it becomes important to understand how well their judgments align with human ones. Our work examines this alignment through two empirical studies of human and LLM assessment of mathematical interestingness and difficulty, spanning a range of mathematical experience. We study two groups: participants from a crowdsourcing platform and International Math Olympiad competitors. We show that while many LLMs appear to broadly agree with human notions of interestingness, they mostly do not capture the distribution observed in human judgments. Moreover, most LLMs only somewhat align with why humans find certain math problems interesting, showing weak correlation with human-selected interestingness rationales. Together, our findings highlight both the promises and limitations of current LLMs in capturing human interestingness judgments for mathematical AI thought partnerships.

View on arXiv
Comments on this paper