Corruption-Robust Lipschitz Contextual Search

I study the problem of learning a Lipschitz function with corrupted binary signals. The learner tries to learn a -Lipschitz function that the adversary chooses. There is a total of rounds. In each round , the adversary selects a context vector in the input space, and the learner makes a guess to the true function value and receives a binary signal indicating whether the guess is high or low. In a total of rounds, the signal may be corrupted, though the value of is \emph{unknown} to the learner. The learner's goal is to incur a small cumulative loss. This work introduces the new algorithmic technique \emph{agnostic checking} as well as new analysis techniques. I design algorithms which: for the symmetric loss, the learner achieves regret with and with ; for the pricing loss, the learner achieves regret .
View on arXiv