Deep reinforcement learning from human preferencesNeural Information Processing Systems (NeurIPS), 2017 |
Enhancing The Reliability of Out-of-distribution Image Detection in
Neural NetworksInternational Conference on Learning Representations (ICLR), 2017 |
Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017 |
Towards A Rigorous Science of Interpretable Machine Learning Finale Doshi-Velez Been Kim |
Strongly-Typed Agents are Guaranteed to Interact SafelyInternational Conference on Machine Learning (ICML), 2017 |
A Baseline for Detecting Misclassified and Out-of-Distribution Examples
in Neural NetworksInternational Conference on Learning Representations (ICLR), 2016 |