Multi-Agent Reinforcement Learning for Long-Term Network Resource
Allocation through Auction: a V2X ApplicationComputer Communications (Comput. Commun.), 2022 |
Nonstochastic Multiarmed Bandits with Unrestricted DelaysNeural Information Processing Systems (NeurIPS), 2019 |
Linear Bandits with Stochastic Delayed FeedbackInternational Conference on Machine Learning (ICML), 2018 |