Data Center Cooling System Optimization Using Offline Reinforcement Learning

17 February 2025

Abstract

The recent advances in information technology and artificial intelligence have fueled a rapid expansion of the data center (DC) industry worldwide, accompanied by an immense appetite for electricity to power the DCs. In a typical DC, around 30~40% of the energy is spent on the cooling system rather than on computer servers, posing a pressing need for developing new energy-saving optimization technologies for DC cooling systems. However, optimizing such real-world industrial systems faces numerous challenges, including but not limited to a lack of reliable simulation environments, limited historical data, and stringent safety and control robustness requirements. In this work, we present a novel physics-informed offline reinforcement learning (RL) framework for energy efficiency optimization of DC cooling systems. The proposed framework models the complex dynamical patterns and physical dependencies inside a server room using a purposely designed graph neural network architecture that is compliant with the fundamental time-reversal symmetry. Because of its well-behaved and generalizable state-action representations, the model enables sample-efficient and robust latent space offline policy learning using limited real-world operational data. Our framework has been successfully deployed and verified in a large-scale production DC for closed-loop control of its air-cooling units (ACUs). We conducted a total of 2000 hours of short and long-term experiments in the production DC environment. The results show that our method achieves 14~21% energy savings in the DC cooling system, without any violation of the safety or operational constraints. Our results have demonstrated the significant potential of offline RL in solving a broad range of data-limited, safety-critical real-world industrial control problems.

View on arXiv

@article{zhan2025_2501.15085,
  title={ Data Center Cooling System Optimization Using Offline Reinforcement Learning },
  author={ Xianyuan Zhan and Xiangyu Zhu and Peng Cheng and Xiao Hu and Ziteng He and Hanfei Geng and Jichao Leng and Huiwen Zheng and Chenhui Liu and Tianshun Hong and Yan Liang and Yunxin Liu and Feng Zhao },
  journal={arXiv preprint arXiv:2501.15085},
  year={ 2025 }
}

Comments on this paper