Calibration Matters: Tackling Maximization Bias in Large-scale
Advertising Recommendation SystemsInternational Conference on Learning Representations (ICLR), 2022 |
Action Candidate Driven Clipped Double Q-learning for Discrete and
Continuous Action TasksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022 |
Action Candidate Based Clipped Double Q-learning for Discrete and
Continuous Action TasksAAAI Conference on Artificial Intelligence (AAAI), 2021 |
Regularized Softmax Deep Multi-Agent -LearningNeural Information Processing Systems (NeurIPS), 2021 |