
Title |
|---|
![]() On-line Policy Improvement using Monte-Carlo SearchNeural Information Processing Systems (NeurIPS), 1996 |
![]() Enhancing Chess Reinforcement Learning with Graph RepresentationNeural Information Processing Systems (NeurIPS), 2024 |
![]() Online 3D Bin Packing Reinforcement Learning Solution with BufferIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022 |
![]() Scaling Laws Under the Microscope: Predicting Transformer Performance
from Small Scale ExperimentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |