Forecasting geothermal temperature in western Yemen with Bayesian-optimized machine learning regression models
DOI:
https://doi.org/10.1186/s40517-024-00324-3Keywords:
Machine learning, Geothermal temperature forecasting, Optimization, YemenAbstract
Geothermal energy is a sustainable resource for power generation, particularly in Yemen. Efficient utilization necessitates accurate forecasting of subsurface temperatures, which is challenging with conventional methods. This research leverages machine learning (ML) to optimize geothermal temperature forecasting in Yemen’s western region. The data set, collected from 108 geothermal wells, was divided into two sets: set 1 with 1402 data points and set 2 with 995 data points. Feature engineering prepared the data for model training. We evaluated a suite of machine learning regression models, from simple linear regression (SLR) to multi-layer perceptron (MLP). Hyperparameter tuning using Bayesian optimization (BO) was selected as the optimization process to boost model accuracy and performance. The MLP model outperformed others, achieving high values and low error values across all metrics after BO. Specifically, MLP achieved of 0.999, with MAE of 0.218, RMSE of 0.285, RAE of 4.071%, and RRSE of 4.011%. BO significantly upgraded the Gaussian process model, achieving an of 0.996, a minimum MAE of 0.283, RMSE of 0.575, RAE of 5.453%, and RRSE of 8.717%. The models demonstrated robust generalization capabilities with high values and low error metrics (MAE and RMSE) across all sets. This study highlights the potential of enhanced ML techniques and the novel BO in optimizing geothermal energy resource exploitation, contributing significantly to renewable energy research and development.
References
Aggarwal CC. An introduction to outlier analysis. In: Outlier analysis. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6396-2_1. 2013.
Al-Fakih A, Li K. Study of geothermal energy resources of Yemen for electric power generation. GRC Trans. 2019:42(2018).
Al-Fakih A, Kaka S. Application of artificial intelligence in static formation temperature estimation. Arab J Sci Eng. 2023;48:16791–804. https://doi.org/10.1007/s13369-023-08096-x.
Al-Fakih A, Al-khudafi A. Unlocking the potential of geothermal energy in Yemen: a comparative analysis with global trends. 49th Workshop on Geothermal Reservoir Engineering Stanford University, Stanford, California, 2024:1-10.
Al-Fakih A, Abdulraheem A, Kaka S. Application of machine learning and deep learning in geothermal resource development: Trends and perspectives. Deep Underground Science and Engineering. 2024. https://doi.org/10.1002/dug2.12098.
Alnethary M, Sharian A, Mattash M, Minissale A. Evaluation of the geothermal explorations In Yemen (Western Area And The Red Sea). 49th Workshop on Geothermal Reservoir Engineering Stanford University, Stanford, California, 2024;1–12.
Al-Sabri A, Al-Kohlani T, Al-Nethary M, Sharian A, Al-Dukhain A, Al-Hosam A, Al-Hosam M, Sultan M. Geothermal exploration in some interest geothermal area in the Republic of Yemen. IOP Conf Ser Earth Environ Sci. 2019:249(1). https://doi.org/10.1088/1755-1315/249/1/012003.
Al-wesabi, I., Zhijian, F., Bosah, C. P., & Dong, H. (2022). A review of Yemen’s current energy situation, challenges, strategies, and prospects for using renewable energy systems. Environmental Science and Pollution Research, 29(36), 53907–53933. https://doi.org/10.1007/s11356-022-21369-6.
Aminzadeh F, Temizel C, Hajizadeh Y. References. In Aminzadeh F, Temizel C, Hajizadeh Y, editors. Artificial intelligence and data analytics for energy exploration and production. 2022. https://doi.org/10.1002/9781119879893.refs.
Breiman L. Random forests. Mach Learn. 2001;45(1):5–32. https://doi.org/10.1023/A:1010933404324.
Breiman L, Friedman JH, Olshen RA, Stone CJ. Classification and regression trees. Biometrics. 1984;40:874.
Breunig MM, Kriegel H-P, Ng RT, Sander J. LOF: identifying density-based local outliers. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dalles, 15–18 May 2000, 2000:93–104. https://doi.org/10.1145/342009.335388.
Broomhead DS, Lowe D. Multivariable functional interpolation and adaptive networks. Complex Syst. 1988:2.
Bourhis P, Cousin B, Rotta AF, Laloui L. Machine learning enhancement of thermal response tests for geothermal potential evaluations at site and regional scales. Geothermics. 2021;95:102132. https://doi.org/10.1016/j.geothermics.2021.102132.
Degen D, Cacace M, Moulaeifard M. The value of scientific machine learning for geothermal applications. EGU General Assembly. 2022.
Duplyakin D, Beckers KF, Siler DL, Martin MJ, Heasler P. Modeling subsurface performance of a geothermal reservoir using machine learning. Energies, 2022;15(967).
Draper NR, Smith H. Applied regression analysis. New Jersey: Wiley;1998. 326.
Ewees AA, Vo Thanh H, Al-qaness MAA, Abd Elaziz M, Samak AH. Smart predictive viscosity mixing of CO2-N2 using optimized dendritic neural networks to implicate for carbon capture utilization and storage. J Environ Chem Eng. 2024;12(2):112210. https://doi.org/10.1016/j.jece.2024.112210.
Gudala M, Kumar S, Ghafoor K. Numerical investigations on a geothermal reservoir using fully coupled thermo-hydro-geomechanics with integrated RSM-machine learning and ARIMA models. Geothermics. 2021;96(January):102174. https://doi.org/10.1016/j.geothermics.2021.102174.
Guodong Chen, Jiu Jimmy Jiao, Chuanyin Jiang, Xin Luo, Surrogate-assisted level-based learning evolutionary search for geothermal heat extraction optimization, Renewable and Sustainable Energy Reviews, Part B, 113860, ISSN 2024;189:1364–0321, https://doi.org/10.1016/j.rser.2023.113860.
Haklidir FST. Prediction of reservoir temperatures using hydrogeo-chemical data. Western Anatolia Geothermal Systems (Turkey): A Machine Learning Approach. Natural Resources Research; 2019. https://doi.org/10.1007/s11053-019-09596-0.
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software: an update. ACM SIGKDD Explor Newsl. 2009;11:10–8. https://doi.org/10.1145/1656274.1656278.
Han JW, Kamber M, Pei J. Data mining concepts and techniques. 3rd ed. Waltham: Morgan Kaufmann Publishers; 2012.
Hashim Alkipsy, E. I., Raju, V., & Kumar, H. (2020). A Review of the Challenges of Energy Sector and Prospects of Renewable Energy Utilization in Yemen. Global Journal of Management and Business Research, 1–7. https://doi.org/10.34257/gjmbravol20is8pg1.
Ishitsuka K, Kobayashi Y, Watanabe N, Yamaya Y, Bjarkason E, Suzuki A, Mogi T, Asanuma H, Kajiwara T, Sugimoto T, Saito R. Bayesian and neural network approaches to estimate deep temperature distribution for assessing a supercritical geothermal system: evaluation using a numerical model. Nat Resour Res. 2021;30(5):3289–314. https://doi.org/10.1007/s11053-021-09874-w.
Jiang A, Qin Z, Faulder D, Cladouhos TT, Jafarpour B. A multiscale recurrent neural network model for predicting energy production from geothermal reservoirs. Geothermics. 2023;110:102643. https://doi.org/10.1016/j.geothermics.2022.102643.
Kamra AA. Yemen geothermal resources. Trans Geother Resour Council. 2006;30:637–42.
Kohavi R. A Study of cross-validation and Bootstrap for Accuracy Estimation and Model Selection. In Ijcai. 1995;14:1137–45.
Kubati, M. Al, Mattash, M. A., Alnethary, M. F., Minissale, A., & Vaselli, O. (2015). Geothermal Exploration and Geothermometric Characteristics of Western Area in Yemen. World Geothermal Congress 2015 Melbourne, Australia, April, 19–25.
Kullick J, Hackl CM. Dynamic modeling and simulation of deep geothermal electric submersible pumping systems. Energies. 2017;10:1659. https://doi.org/10.3390/en10101659.
Kutner MH, Nachtsheim CJ, Neter J, Li W. Applied linear statistical models. 5th ed. Irwin, New York: McGraw-Hill; 2005.
Maktoubian J, Taskhiri MS, Turner P. Intelligent Predictive Maintenance (IPdM) in forestry: a review of challenges and opportunities. Forests. 2021;12:1495. https://doi.org/10.3390/f12111495.
Malkomes G, Schaff C, Garnett R. Bayesian optimization for automated model selection. 30th Conference on Neural Information Processing Systems (NIPS 2016), Nips 2016.
Mann S, Singh G. Application of M5P model tree and artificial neural networks for traffic noise prediction on highways of India. Civil Environ Eng Rep. 2024;34(2):45–62. https://doi.org/10.59440/ceer/188375.
Minissale A. A simple geochemical prospecting method for geothermal resources in flat areas areas. Geothermics. 2018;72(March):258–67. https://doi.org/10.1016/j.geothermics.2017.12.001.
Moraga J, Duzgun HS, Cavur MS. The geothermal artificial intelligence for geothermal exploration keywords list of abbreviations. Renew Energy. 2022. https://doi.org/10.1016/j.renene.2022.04.113.
Nwokediegwu ZQS, Ibekwe KI, Ilojianya VI, Etukudoh EA, Ayorinde OB. Renewable energy technologies in engineering: a review of current developments and future prospects. Eng Sci Technol J. 2024;5(2):367–84.
Platt J. Sequential minimal optimization: a fast algorithm for training support vector machines. Tech. Rep. Microsoft Research, Technical Report msr-tr-98-14. 1998.
Poorya S, Lialestani M, Parcerisa D, Himi M, Shahri AA. Generating 3D geothermal maps in Catalonia, Spain using a hybrid adaptive multitask deep learning procedure. Energies, 2022;15(4602).
Quinlan JR. Simplifying decision trees. Int J Man-Mach Stud. 1987;27(3):221–34. https://doi.org/10.1016/S0020-7373(87)80053-6.
Quinlan JR. Learning with continuous classes. Proceedings of Australian Joint Conference on Artificial Intelligence, Hobart 16–18 November 1992, 1992;343-348.
Rasmussen, C.E. (2004). Gaussian Processes in Machine Learning. In: Bousquet, O. von Luxburg, U. & R¨atsch, G. (eds) Advanced Lectures on Machine Learning. ML 2003. Lecture Notes in Computer Science, vol 3176. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28650-9-4.
Rosenblatt, F. (1962). Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Washington DC. http://catalog.hathitrust.org/Record/000203591.
Santoso RK, Degen D, Knapp D, Pechnig R. Uncertainty quantification with a physics-based machine learning method for geothermal-well targeting: a case study of The Hague. Netherlands: EGU General Assembly; 2024.
Santoyo E, Acevedo-Anicasio A, Díaz-González L. Evaluation of artificial neural networks for the prediction of deep reservoir temperatures using the gas-phase composition of geothermal fluids. Comput Geosci. 2019;129(May):49–68. https://doi.org/10.1016/j.cageo.2019.05.004.
Shahdi, A., Lee, S., Karpatne, A. et al. Exploratory analysis of machine learning methods in predicting subsurface temperature and geothermal gradient of Northeastern United States. Geotherm Energy 9, 18 (2021). https://doi.org/10.1186/s40517-021-00200-4.
Song H, Diethe T, Kull M, Flach P. Distribution calibration for regression. arXiv preprint arXiv:1905.06023. 2019.
Spichak VV. Neural network approach to the temperature estimation. In Electromagnetic geothermometry (Issue 1999, pp. 57-75). https://doi.org/10.1016/B978-0-12-802210-8/00003-4. 2015.
Sutarmin YD. Subsurface temperature prediction in the geothermal field with neural network using 3D MT data inversion and borehole temperature data. AIP Conf Proc. 2021;2320(040008):1–5.
Tukey JW. Exploratory data analysis. Reading, Massachusetts: Addison-Wesley; 1977.
Varol Altay E, Gurgenc E, Altay O, Dikici A. Hybrid artificial neural network based on a metaheuristic optimization algorithm for the prediction of reservoir temperature using hydrogeochemical data of different geothermal areas in Anatolia (Turkey). Geothermics. 2022;104:102476. https://doi.org/10.1016/j.geothermics.2022.102476.
Vo Thanh H, Zhang H, Rahimi M, Ashraf U, Migdady H, Daoud MS, Abualigah L. Enhancing carbon sequestration: innovative models for wettability dynamics in CO2-brine-mineral systems. J Environ Chem Eng. 2024a;12(5):113435. https://doi.org/10.1016/j.jece.2024.113435.
Vo Thanh H, Rahimi M, Tangparitkul S, Promsuk N. Modeling the thermal transport properties of hydrogen and its mixtures with greenhouse gas impurities: a data-driven machine learning approach. Int J Hydrogen Energy. 2024b;83:1–12. https://doi.org/10.1016/j.ijhydene.2024.08.100.
Vo Thanh H, Dai Z, Du Z, Yin H, Yan B, Soltanian MR, Xiao T, McPherson B, Abualigah L. Artificial intelligence-based prediction of hydrogen adsorption in various kerogen types: implications for underground hydrogen storage and cleaner production. Int J Hydrogen Energy. 2024c;57:1000–9. https://doi.org/10.1016/j.ijhydene.2024.01.115.
Wang, L., Dernoncourt, F., & Bui, T. (2020). Bayesian optimization for selecting efficient machine learning models. ArXiv, abs/2008.00386. Retrieved from https://api.semanticscholar.org/CorpusID:220936142.
Wei P, Bamisile O, Adun H, Cai D, Obiora S, Li J, Huang Q. Bibliographical progress in hybrid renewable energy systems’ integration, modelling, optimization, and artificial intelligence applications: A critical review and future research perspective. Energy Sources Part A Recov Util Environ Effects. 2023;45(1):2058–88. https://doi.org/10.1080/15567036.2023.2181888.
Xu S, Hu CN. Predicting terrestrial heat flow in North China using multiple geological and geophysical datasets based on machine. Energies, 2023;16(1620).
Zayed ME, Shboul B, Yin H, Zhao J, Zayed AAA. Recent advances in geothermal energy reservoirs modeling: challenges and potential of thermo-fluid integrated models for reservoir heat extraction and geothermal energy piles design. J Energy Storage. 2023;62:106835. https://doi.org/10.1016/j.est.2023.106835.
Zhang H, Wang P, Rahimi M. Vo, Thanh H, Wang Y, Dai Z, Zheng Q, Cao Y. Catalyzing net-zero carbon strategies: enhancing CO2 flux prediction from underground coal fires using optimized machine learning models. J Clean Prod. 2024;441:141043. https://doi.org/10.1016/j.jclepro.2024.141043.
Downloads
Published
Repository
Section
License
Copyright (c) 2026 "This Open Access article is distributed under the Creative Commons Attribution 4.0 International License (CC BY 4.0), permitting unrestricted use, distribution, and adaptation provided the original author and source are properly credited."

This work is licensed under a Creative Commons Attribution 4.0 International License.
Deprecated: json_decode(): Passing null to parameter #1 ($json) of type string is deprecated in /home/eiuedunetcp/public_html/journals.eiu.edu.ye/plugins/generic/citations/CitationsPlugin.php on line 68