ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.10018
63
19
v1v2 (latest)

Dynamic pricing and assortment under a contextual MNL demand

19 October 2021
Vineet Goyal
Noémie Périvier
ArXiv (abs)PDFHTML
Abstract

We consider dynamic multi-product pricing and assortment problems under an unknown demand over T periods, where in each period, the seller decides on the price for each product or the assortment of products to offer to a customer who chooses according to an unknown Multinomial Logit Model (MNL). Such problems arise in many applications, including online retail and advertising. We propose a randomized dynamic pricing policy based on a variant of the Online Newton Step algorithm (ONS) that achieves a O(dTlog⁡(T))O(d\sqrt{T}\log(T))O(dT​log(T)) regret guarantee under an adversarial arrival model. We also present a new optimistic algorithm for the adversarial MNL contextual bandits problem, which achieves a better dependency than the state-of-the-art algorithms in a problem-dependent constant κ2\kappa_2κ2​ (potentially exponentially small). Our regret upper bound scales as O~(dκ2T+log⁡(T)/κ2)\tilde{O}(d\sqrt{\kappa_2 T}+ \log(T)/\kappa_2)O~(dκ2​T​+log(T)/κ2​), which gives a stronger bound than the existing O~(dT/κ2)\tilde{O}(d\sqrt{T}/\kappa_2)O~(dT​/κ2​) guarantees.

View on arXiv
Comments on this paper