347

Realised Volatility Forecasting: Machine Learning via Financial Word Embedding

Main:74 Pages
13 Figures
Bibliography:6 Pages
30 Tables
Abstract

This study develops a financial word embedding using 15 years of business news. Our results show that this specialised language model produces more accurate results than general word embeddings, based on a financial benchmark we established. As an application, we incorporate this word embedding into a simple machine learning model to enhance the HAR model for forecasting realised volatility. This approach statistically and economically outperforms established econometric models. Using an explainable AI method, we also identify key phrases in business news that contribute significantly to volatility, offering insights into language patterns tied to market dynamics.

View on arXiv
Comments on this paper