v1v2 (latest)

Optimal Execution with Reinforcement Learning

10 November 2024

Yadh Hafsi

Edoardo Vittori

ArXiv (abs)PDF HTML

Main:7 Pages

9 Figures

Bibliography:1 Pages

3 Tables

Abstract

This study investigates the development of an optimal execution strategy through reinforcement learning, aiming to determine the most effective approach for traders to buy and sell inventory within a finite time horizon. Our proposed model leverages input features derived from the current state of the limit order book and operates at a high frequency to maximize control. To simulate this environment and overcome the limitations associated with relying on historical data, we utilize the multi-agent market simulator ABIDES, which provides a diverse range of depth levels within the limit order book. We present a custom MDP formulation followed by the results of our methodology and benchmark the performance against standard execution strategies. Results show that the reinforcement learning agent outperforms standard strategies and offers a practical foundation for real-world trading applications.

View on arXiv

Comments on this paper