Macêdo, Bruno da Silva and Wayo, Dennis Delali Kwesi and Campos, Deivid and De Santis, Rodrigo Barbosa and Martinho, Alfeu Dias and Yaseen, Zaher Mundher and Saporetti, Camila M. and Goliatt, Leonardo (2025) Data-driven total organic carbon prediction using feature selection methods incorporated in an automated machine learning framework. Scientific Reports, 15 (1). pp. 1-19. ISSN 2045-2322. (Published)
|
Pdf
Data-driven total organic carbon prediction using feature selection.pdf Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (4MB) | Preview |
Abstract
An accurate assessment of shale gas resources is highly important for the sustainable development of these energy resources. Total organic carbon (TOC) analysis thus becomes fundamental for understanding the distribution and quality of hydrocarbon source rocks within a shale gas reservoir. The elevation of the TOC is often associated with the presence of source rocks, indicating the potential for oil and gas production. TOC assessment is performed using laboratory methods, which can be time-consuming and costly. Data-driven models have been successfully applied to model the relationship between TOC and other constituents and to predict the TOC content. However, these methods depend on extensive parameter adjustments that must be carefully conducted in different sedimentary environments. In this context, Automated Machine Learning (AutoML) is an alternative for accurately predicting TOCs, saving time-consuming fine-tuning steps in model development. This study aims to develop an AutoML strategy for estimating TOC using well log data. This procedure automatically preprocesses the search for the best method parameters, reducing the execution time. Among the methods evaluated, Extremely Randomized Trees (XT) performed best (R = 0.8632, MSE = 0.1806) in the test set. The proposed strategy provides a powerful data-driven method, which allows real-world use of the well to assist in data analysis and subsequent decision-making.
Item Type: | Article |
---|---|
Additional Information: | Indexed by Scopus |
Uncontrolled Keywords: | Article; Data analysis; Energy resource; Feature selection; Gas |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science T Technology > TD Environmental technology. Sanitary engineering T Technology > TP Chemical technology |
Faculty/Division: | Institute of Postgraduate Studies Faculty of Chemical and Process Engineering Technology |
Depositing User: | Mrs. Nurul Hamira Abd Razak |
Date Deposited: | 18 Jul 2025 07:05 |
Last Modified: | 18 Jul 2025 07:05 |
URI: | http://umpir.ump.edu.my/id/eprint/45116 |
Download Statistic: | View Download Statistics |
Actions (login required)
![]() |
View Item |