Application of supervised machine learning and unsupervised data compression models for pore pressure prediction employing drilling, petrophysical, and well log data

应用监督式机器学习和非监督式数据压缩模型，利用钻井、岩石物理和测井数据进行孔隙压力预测。

阅读：1

作者：Siddique,Abu Bakker,Munshi,Tanveer Alam,Rakin,Nazmul Islam,Hashan,Mahamudul,Chnapa,Sushmita Sarker,Jahan,Labiba Nusrat

期刊：	Scientific Reports	影响因子：	3.900
时间：	2025	起止号：	2025 Jul 9;15(1):24706
doi：	10.1038/s41598-025-89199-3

Abstract

Accurate determination of pore pressure is critical in the design of wells, determining a safe range of mud properties, and estimating the required mud weight to ensure wellbore stability. Conventional techniques for forecasting pore pressure, such as the Eaton, Bower, or compressibility methods, have certain constraints. These methods depend on empirical relationships and constants that can differ between basins. This study proposes an effective data-driven approach that utilizes machine learning algorithms to forecast reservoir pore pressure. A total of five machine learning algorithms, namely multivariable regression (MVR), polynomial regression (PR), random forest (RF), CatBoost regression, and multilayer perception (MLP), are applied in this research. Hybrid stacking modeling is employed for the first time to forecast pore pressure and to improve the accuracy and robustness of the results by combining different methodologies. Principal component analysis is also utilized (PCA) to extract features, hence expediting the entire process by reducing dimensionality. To accomplish the objectives, 1811 recordings are selected from the Volve Field, situated approximately 200 km west of Stavanger, Norway. These recordings encompass depth data; well logs, including NPHI, GR, DT, RD, RHOB, RS, and RT; drilling activities, specifically ROP; and petrophysical parameters, including BVW, K, PHIF, SW, and VCL. Pore pressure is used as the output level to generate data-driven models. 70% of the dataset is used for training the machine learning models, while the remaining 30% is reserved for testing the models to evaluate their performance and generalization capability. Data standardization is conducted to ensure that the utilized data is statistically well-distributed, devoid of measurement mistakes, and impervious to instrumental noise. Regression metrics, such as mean MAE, R(2), Adjusted R(2,) RMSE, MinE, and MaxE are employed to evaluate the efficacy of the models. The results suggest that the stacking model, which integrates CatBoost and Random Forest (RF) as base models and Polynomial Regression (PR) as the meta-model, achieves an R(2) of 0.9846, an adjusted R(2) of 0.9842, MAE of 11.20 and an RMSE of 22.747 on the testing dataset. This makes it the most accurate model for pore pressure prediction, followed closely by CatBoost. The MVR, exhibiting an R(2) of 0.896 and an RMSE of 57.931, is the least efficient model. A thorough comparison of all analyzed models indicates that the algorithms, ranked by performance metrics, are Stack_2, CatBoost, Stack_1, RF, PR, Stack_3, MLP, and MVR. Hybrid stacking improves performance even without hyperparameter tuning. PCA significantly speeds up the entire process by lowering the number of dimensions, hence enhancing the cost-effectiveness of the procedure. Using a few petrophysical, drilling, and well log data, the methodology presented in this work can help engineers and researchers quickly and precisely determine the reservoir pore pressure, validating the safe and cost-effective drilling operations.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

线粒体

凋亡

转录调控

巨噬细胞

自噬

传染病

氧化应激

肠道菌群

磷酸化

血管生成

囊泡

3D/类器官

单细胞

中性粒细胞

外泌体

DNA甲基化

miRNA

药物研究

铁死亡

细胞衰老

乙酰化

缺氧低氧

泛素化

树突状细胞

组蛋白修饰

炎性小体

肿瘤微环境

lncRNA

代谢重编程

焦亡

m6A/m5C/m7G

内质网应激

空间多组学

细胞基因治疗

治疗耐药

相分离

Treg

上皮间质转化

免疫代谢

染色质重塑

脂质过氧化

脂代谢

蛋白质稳态

铁代谢

细胞极性

氨基酸代谢

碱基编辑

cGAS-STING

肠脑轴

蛋白降解

乳酸化

翻译调控

circRNA

piRNA

肿瘤异质性

NK 细胞

氧化脂质

MDSC

NETosis

低氧缺氧

溶酶体功能

细胞干性

琥珀酰化

CAR-NK

RNA 编辑

冷应激

Tfh

巴豆酰化

器官芯片

表观遗传记忆

铜死亡

器官纤维化

线粒体未折叠蛋白反应

空间代谢组

程序性坏死

自噬流

肠肝轴

丙酰化

MAIT 细胞