Evaluating the Prediction Performance of Random Forest in Classification of Carbonate Lithology

Authors

  • Ibrahim A. Farea Emirates International University image/svg+xml Author
  • Abdulla Ali Aldambi Faculty of Science, Department of geology, University of Aden Aden, Yemen Author
  • Abdulrahman A. Kadi Department of Petroleum Engineering, Department of Oil and Gas Field Development Engineering China University of Petroleum Beijing Aden, Yemen Author
  • Hamzah. A. Al-Sharifi Department of Petroleum Engineering, Department of Oil and Gas Field Development Engineering China University of Petroleum Beijing Beijing, China Author

DOI:

https://doi.org/10.20428/jst.v30i9.3186

Keywords:

Carbonate lithology prediction , Random Forest , machine learning in geoscience , reservoir characterization , feature importance

Abstract

Accurate lithology prediction in carbonate reservoirs is essential for hydrocarbon exploration but remains challenging due to their complex heterogeneity. Traditional methods (e.g., seismic and well-log analysis) often fail to capture subtle lithological variations, while machine learning approaches such as Random Forest (RF) remain underexplored for carbonates. Previous research has not sufficiently compared Random Forest with advanced models such as XGBoost and deep learning approaches, nor provided detailed feature importance analyses specific to carbonate lithology classification. This study employs a dataset comprising 4,624 samples characterized by ten petrophysical properties to evaluate the classification performance of RF. Our optimized RF framework demonstrates superior accuracy while reducing dependence on costly core sampling, thereby improving the precision of carbonate reservoir models.

Author Biography

  • Ibrahim A. Farea, Emirates International University

    Ibrahim A. Farea

    Department of Oil and Gas Engineering, faculty of Engineering and IT, Emirates International University Sanaa, Yemen

References

A

Abbas, A. M., Al-Mudhafar, W. J., & Wood, D. A. (2024). Integration of electromagnetic, resistivity-based, and production logging data for validating lithofacies and permeability predictive models with tree ensemble algorithms in heterogeneous carbonate reservoirs. Petroleum Geoscience, 30(2), Article petgeo2023-067. https://doi.org/10.1144/petgeo2023-067

Al-Khudafi, A. M., Al-Sharifi, H. A., & Hamada, G. M. (2023). Evaluation of different tree-based machine learning approaches for formation lithology classification. Journal of Geological Sciences, 2(1), Article jgs-2023-0026. https://doi.org/10.56952/jgs-2023-0026

Amaefule, A. E., McColloch, M., Hoummad, T. C., & Keelan, H. D. (1993). Enhanced reservoir description: Using core and log data to identify hydraulic flow units and predict permeability in uncored intervals/wells. SPE Formation Evaluation, 8(2), 221–229. https://doi.org/10.2118/24729-PA

Ao, Y., Li, H., Zhu, L., Ali, S., & Zvi, Z. (2020). The linear random forest algorithm and its advantages in machine learning assisted logging regression classification. Journal of Petroleum Science and Engineering, 194, Article 107550. https://doi.org/10.1016/j.petrol.2020.107550

B

Banerjee, S., Jha, M., & Mittal, S. (2024). Machine learning-based petrographic classification using geophysical well logs: A case study from India’s Bokaro coalfield. Journal of Earth System Science, 133(1), Article 12. https://doi.org/10.1007/s12040-023-02220-4

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Bressan, T. S., de Lima, G. F., & de Almeida, L. B. (2020). Lithology prediction using machine learning algorithms: A case study in the Paraná Basin, Brazil. Journal of Applied Geophysics, 183, Article 104197. https://doi.org/10.1016/j.jappgeo.2020.104197

C

Cracknell, M. J., & Reading, A. M. (2014). Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information. Computers & Geosciences, 63, 22–33. https://doi.org/10.1016/j.cageo.2013.10.008

F

Farhadi, Z. (2022). An ensemble framework to improve the accuracy of prediction using clustered random-forest and shrinkage methods. Applied Sciences, 12(20), Article 10608. https://doi.org/10.3390/app122010608

G

Gu, Y., Wang, X., & Zhang, J. (2021). The identification of coal and gangue by deep learning and random forest. IEEE Access, 9, 119939–119949. https://doi.org/10.1109/ACCESS.2021.3107935

H

Hamada, G. M., Al-Khudafi, A. M., & Al-Sharifi, H. A. (2024). Characterization of lithofacies properties of carbonate reservoir rocks using machine learning techniques. Journal of Petroleum and Mining Engineering, 26(1), 45–54. https://doi.org/10.21608/jpme.2024.265484.1190

Hamada, G. M., Al-Blehed, M. S., & Al-Awad, M. N. (2024). Reservoir characterization using machine learning techniques: A comprehensive review. Journal of Natural Gas Science and Engineering, 102, Article 104567. https://doi.org/10.1016/j.jngse.2024.104567

Harris, J. R., & Grunsky, E. C. (2015). Predictive lithological mapping of Canada’s North using Random Forest classification applied to geophysical and geochemical data. Computers & Geosciences, 80, 9–25. https://doi.org/10.1016/j.cageo.2015.03.006

K

Karabadji, N. E. I., Seridi, H., Aridhi, S., & Dhifli, W. (2023). Improving decision tree performance by differential evolution-based feature weighting. Knowledge-Based Systems, 241, Article 108246. https://doi.org/10.1016/j.knosys.2022.108246

L

Liaw, A., & Wiener, M. (2002). Classification and regression by random forest. R News, 2(3), 18–22.

Lucia, D. J. (2007). Petrophysical parameters influencing reservoir quality of carbonate rocks. Society of Professional Well Log Analysts.

M

Mukherjee, R., Naik, A., & Srivastava, P. K. (2024). Comparative analysis of machine learning algorithms for lithology classification: A case study from the Cambay Basin. Journal of Petroleum Exploration and Production Technology, 14(2), 345–360. https://doi.org/10.1007/s13202-023-01712-4

Musleh, D., Olatunji, S. O., & Almajed, A. A. (2023). Ensemble learning based sustainable approach to carbonate reservoirs permeability prediction. Sustainability, 15(19), Article 14403. https://doi.org/10.3390/su151914403

N

Nugroho, H., Wikantika, K., & Bijaksana, S. (2023). Integration of remote sensing and geophysical data to enhance lithological mapping utilizing the Random Forest classifier: A case study from Komopa, Papua Province, Indonesia. Journal of Degraded and Mining Lands Management, 10(3), 4417–4429. https://doi.org/10.15243/jdmlm.2023.103.4417

R

Rosid, M. S., Haikel, S., & Haidar, M. W. (2019). Carbonate reservoir rock type classification using comparison of Naive Bayes and Random Forest method in field “S” East Java. Proceedings of the International Conference on Applied Physics. https://doi.org/10.1063/1.5132446

S

Shuvo, M. G. H., Islam, M. S., & Hossain, M. E. (2024). Application of machine learning in lithology prediction: A review. Earth Science Informatics, 17(1), 1–15. https://doi.org/10.1007/s12145-023-01188-0

Singh, B. K., & Rao, G. S. (2023). Random Forest classifier for lithological mapping of the Mundiyawas-Khera mineralized belt of the Alwar basin, India, from remote sensing and potential field data. EGUsphere, Article egu23-8232. https://doi.org/10.5194/egusphere-egu23-8232

T

Tepe, C. (2024). Ensemble learning methods for geoscience applications. Springer. https://doi.org/10.1007/978-3-031-52342-8

Tong, K., Sun, F., & Dong, S. (2024). Method of lithology identification in carbonate reservoirs using well logs based on deep forest. Research Square. https://doi.org/10.21203/rs.3.rs-4422432/v1

W

Wang, G., Carr, T. R., Ju, Y., & Li, C. (2020). Identifying organic-rich Marcellus Shale lithofacies by support vector machine classifier in the Appalachian basin. Computers & Geosciences, 64, 52–60. https://doi.org/10.1016/j.cageo.2013.12.003

Wang, G., Ju, Y., Carr, T. R., Li, C., & Zhang, P. (2019). Machine learning assisted lithofacies classification and reservoir quality prediction in tight shale gas reservoirs. Journal of Petroleum Science and Engineering, 182, Article 106312. https://doi.org/10.1016/j.petrol.2019.106312

Weka Team. (2023). Weka 3: Machine Learning Software in Java (Version 3.9.6) [Computer software]. University of Waikato. https://www.cs.waikato.ac.nz/ml/weka/

X

Xie, Y., Zhu, C., & Wang, X. (2020). Performance evaluation of machine learning methods for lithology classification using imbalanced well log data. Natural Resources Research, 29(3), 1685–1701. https://doi.org/10.1007/s11053-019-09553-w

Xie, Y., Zhu, C., Zhou, W., Li, Z., Liu, X., & Tu, M. (2018). Evaluation of machine learning methods for formation lithology identification: A comparison of tuning processes and model performances. Journal of Petroleum Science and Engineering, 160Trace, 182–193. https://doi.org/10.1016/j.petrol.2017.10.028

Z

Zhang, P., Gao, T., & Li, R. (2024). Advanced machine learning framework for enhanced lithology classification and identification. SPE Journal, 29(4), Article 223312-MS. https://doi.org/10.2118/223312-MS

Zhang, P., Gao, T., & Li, R. (2025). Enhancing lithology classification through a deep learning framework. Paper presented at the SPE/AAPG/SEG Unconventional Resources Technology Conference, Houston, TX, USA. https://doi.org/10.15530/urtec-2025-4252996

Zhu, L., Wang, Y., Zhu, Y., Zhang, C., & Zhang, F. (2023). Challenges of machine learning models for lithology prediction in imbalanced datasets: A case study. Journal of Geophysical Research: Solid Earth, 128(4), Article e2022JB025678. https://doi.org/10.1024/2022JB025678

13-1

Downloads

Published

2025-09-14

Issue

Section

Articles

Categories

How to Cite

Farea, I. A., Aldambi, A. A., A. Kadi, A., & Al-Sharifi, H. A. (2025). Evaluating the Prediction Performance of Random Forest in Classification of Carbonate Lithology. Emirates International University Digital Repository, 1(1). https://doi.org/10.20428/jst.v30i9.3186

Similar Articles

21-30 of 35

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)