Operational Optimization of PV-Assisted Heat Pump Systems Using Reinforcement Learning

Authors

DOI:

https://doi.org/10.52825/isec.v2i.3335

Keywords:

Heat Pump, PV, Reinforcement Learning, Cost Optimization

Abstract

Heat pumps are increasingly integrated with PV systems to supply domestic hot water and space heating. End user purchase prices for electricity are typically significantly higher than feed-in tariffs for PV energy. From a financial perspective, it therefore makes sense to increase the self-consumption of local PV electricity and reduce the amount of energy drawn from the grid. One approach is to overheat the thermal energy storage when the heat pump can be operated with PV energy. However, with increasing temperature, heat losses increase and the heat pump’s COP decreases, making excessive overheating uneconomical. The optimal degree varies with time and depends on various factors, such as heat demand, storage capacity and the purchase-to-feed-in price ratio. Using detailed system simulations, we inves-tigate the application of self-adaptive reinforcement learning agents that can dynamically determine the optimal degree of overheating. The agents continuously evaluate environmental and operational inputs to make informed control decisions, enhancing cost-effectiveness. The results of our empirical evaluation demonstrate that the agents reliably learn to avoid unnec-essary hot water overheating and adapt their space heating overheating strategy to the char-acteristics of each system, achieving net electricity costs close to the optimal solution. Across multiple configurations, agents operating in reduced environments—i.e., with fewer state vari-ables—converge more quickly and develop more stable control policies. In contrast, restrict-ing the action space alone does not consistently enhance performance. Although the overall cost reduction potential is modest, amounting to EUR 35–90 per year, the proposed approach has the important advantage of guaranteeing that user comfort is never violated.

Downloads

Download data is not yet available.

References

[1] M. Battaglia, R. Haberl, E. Bamberger, und M. Haller, „Increased self-consumption and grid flexibility of PV and heat pump systems with thermal and electrical storage“, En-ergy Procedia, Bd. 135, S. 358–366, Okt. 2017, doi: 10.1016/j.egypro.2017.09.527.

[2] A. Thür, T. Calabrese, und W. Streicher, „Smart grid and PV driven ground heat pump as thermal battery in small buildings for optimized electricity consumption“, Solar Energy, Bd. 174, S. 273–285, Nov. 2018, doi: 10.1016/j.solener.2018.08.087.

[3] M. Pinamonti, A. Prada, und P. Baggio, „Rule-Based Control Strategy to Increase Photo-voltaic Self-Consumption of a Modulating Heat Pump Using Water Storages and Building Mass Activation“, Energies, Bd. 13, Nr. 23, Art. Nr. 23, Jan. 2020, doi: 10.3390/en13236282.

[4] A. Heinz und R. Rieberer, „Energetic and economic analysis of a PV-assisted air-to-water heat pump system for renovated residential buildings with high-temperature heat emission system“, Applied Energy, Bd. 293, S. 116953, Juli 2021, doi: 10.1016/j.apenergy.2021.116953.

[5] A. Toradmal, T. Kemmler, und B. Thomas, „Boosting the share of onsite PV-electricity utilization by optimized scheduling of a heat pump using buildings thermal inertia“, Applied Thermal Engineering, Bd. 137, S. 248–258, Juni 2018, doi: 10.1016/j.applthermaleng.2018.03.052.

[6] E. Psimopoulos, E. Bee, J. Widén, und C. Bales, „Techno-economic analysis of control algorithms for an exhaust air heat pump system for detached houses coupled to a photo-voltaic system“, Applied Energy, Bd. 249, S. 355–367, Sep. 2019, doi: 10.1016/j.apenergy.2019.04.080.

[7] R. Thygesen und B. Karlsson, „Simulation of a proposed novel weather forecast control for ground source heat pumps as a mean to evaluate the feasibility of forecast controls’ influence on the photovoltaic electricity self-consumption“, Applied Energy, Bd. 164, S. 579–589, Feb. 2016, doi: 10.1016/j.apenergy.2015.12.013.

[8] M. F. Pichler, „Model Predictive Control for heating in residential buildings - PV-led heat pump operation in a single family house“, PHD thesis, Graz University of Technology, In-stitute of Thermal Engineering; https://diglib.tugraz.at/download.php?id=5f31215885404&location=browse, 2016. [Online]. Verfügbar unter: https://diglib.tugraz.at/download.php?id=5f31215885404&location=browse

[9] G. Han, H.-J. Joo, H.-W. Lim, Y.-S. An, W.-J. Lee, und K.-H. Lee, „Data-driven heat pump operation strategy using rainbow deep reinforcement learning for significant reduc-tion of electricity cost“, Energy, Bd. 270, S. 126913, Mai 2023, doi: 10.1016/j.energy.2023.126913.

[10] M. A. Adesanya u. a., „Deep reinforcement learning for PID parameter tuning in green-house HVAC system energy Optimization: A TRNSYS-Python cosimulation approach“, Expert Systems with Applications, Bd. 252, S. 124126, Okt. 2024, doi: 10.1016/j.eswa.2024.124126.

[11] L. Langer und T. Volling, „A reinforcement learning approach to home energy manage-ment for modulating heat pumps and photovoltaic systems“, Applied Energy, Bd. 327, S. 120020, Dez. 2022, doi: 10.1016/j.apenergy.2022.120020.

[12] M. Biemann, F. Scheller, X. Liu, und L. Huang, „Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control“, Applied Energy, Bd. 298, S. 117164, Sep. 2021, doi: 10.1016/j.apenergy.2021.117164.

[13] A. Manjavacas, A. Campoy-Nieves, J. Jiménez-Raboso, M. Molina-Solana, und J. Gómez-Romero, „An experimental evaluation of deep reinforcement learning algorithms for HVAC control“, Artif Intell Rev, Bd. 57, Nr. 7, S. 173, Juni 2024, doi: 10.1007/s10462-024-10819-x.

[14] S. Brandi, M. S. Piscitelli, M. Martellacci, und A. Capozzoli, „Deep reinforcement learning to optimise indoor temperature control and heating energy consumption in buildings“, En-ergy and Buildings, Bd. 224, S. 110225, Okt. 2020, doi: 10.1016/j.enbuild.2020.110225.

[15] S. Schmitz, K. Brucke, P. Kasturi, E. Ansari, und P. Klement, „Forecast-based and data-driven reinforcement learning for residential heat pump operation“, Applied Energy, Bd. 371, S. 123688, Okt. 2024, doi: 10.1016/j.apenergy.2024.123688.

[16] X. Deng, Y. Zhang, Y. Zhang, und H. Qi, „Towards optimal HVAC control in non-stationary building environments combining active change detection and deep reinforce-ment learning“, Building and Environment, Bd. 211, S. 108680, März 2022, doi: 10.1016/j.buildenv.2021.108680.

[17] TRNSYS 18, „TRNSYS 18: a Transient System Simulation Program, Volume 4: Mathe-matical Reference. Solar Energy Laboratory, University of Wisconsin-Madison“. 2021.

[18] M. Mörth u. a., „Grey-Box Model for Efficient Building Simulations: A Case Study of an Integrated Water-Based Heating and Cooling System“, Buildings, Bd. 15, Nr. 11, Art. Nr. 11, Jan. 2025, doi: 10.3390/buildings15111959.

[19] R. Dott u. a., „Models of sub-components and validation for the IEA SHC Task 44/HPP Annex 38 Part C: heat pump models“. 2013. [Online]. Verfügbar unter: https://wvvvv.iea-shc.org/Data/Sites/1/publications/T44A38_Rep_C2_C_HeatPumpModels_Final_Draft_Revised_130610.pdf

[20] A. Heinz, F. Gritzer, und A. Thür, „The effect of using a desuperheater in an air-to-water heat pump system supplying a multi-family building“, Journal of Building Engineering, Bd. 49, S. 104002, Mai 2022, doi: 10.1016/j.jobe.2022.104002.

[21] N. Bernier, B. Marcotte, und M. Kummert, Calling Python from TRNSYS with CFFI. (11. April 2022). Zenodo. doi: 10.5281/zenodo.6523104.

Published

2026-05-20

How to Cite

Heinz, A., Hofer, B., & Wotawa, F. (2026). Operational Optimization of PV-Assisted Heat Pump Systems Using Reinforcement Learning. International Sustainable Energy Conference - Proceedings, 2. https://doi.org/10.52825/isec.v2i.3335

Conference Proceedings Volume

Section

Novel Approaches in Data Usage, Digitisation, Modelling and System Assessment