Temporal Trends and Patient Stratification in Lung Cancer: A Comprehensive Clustering Analysis from Timis County, Romania

罗马尼亚蒂米什县肺癌的时间趋势和患者分层:一项综合聚类分析

阅读:2

Abstract

Background/Objectives: Lung cancer remains a major cause of cancer-related mortality, with regional differences in incidence and patient characteristics. This study aimed to verify and quantify a perceived dramatic increase in lung cancer cases at a Romanian center, identify distinct patient phenotypes using unsupervised machine learning, and characterize contributing factors, including demographic shifts, changes in the healthcare system, and geographic patterns. Methods: A comprehensive retrospective analysis of 4206 lung cancer patients admitted between 2013 and 2024 was conducted, with detailed molecular characterization of 398 patients from 2023 to 2024. Temporal trends were analyzed using statistical methods, while k-means clustering on 761 clinical features identified patient phenotypes. The geographic distribution, smoking patterns, respiratory comorbidities, and demographic factors were systematically characterized across the identified clusters. Results: We confirmed an 80.5% increase in lung cancer admissions between pre-pandemic (2013-2020) and post-pandemic (2022-2024) periods, exceeding the 51.1% increase in total hospital admissions and aligning with national Romanian trends. Five distinct patient clusters emerged: elderly never-smokers (28.9%) with the highest metastatic rates (44.3%), heavy-smoking males (27.4%), active smokers with comprehensive molecular testing (31.7%), young mixed-gender cohort (7.3%) with balanced demographics, and extreme heavy smokers (4.8%) concentrated in rural areas (52.6%) with severe comorbidity burden. Clusters demonstrated significant differences in age (p < 0.001), smoking intensity (p < 0.001), geographic distribution (p < 0.001), as well as molecular characteristics. COPD prevalence was exceptionally high (44.8-78.9%) across clusters, while COVID-19 history remained low (3.4-8.3%), suggesting a limited direct association between the pandemic and cancer. Conclusions: This study presents the first comprehensive machine learning-based stratification of lung cancer patients in Romania, confirming genuine epidemiological increases beyond healthcare system artifacts. The identification of five clinically meaningful phenotypes-particularly rural extreme smokers and age-stratified never-smokers-demonstrates the value of unsupervised clustering for regional healthcare planning. These findings establish frameworks for targeted screening programs, personalized treatment approaches, and resource allocation strategies tailored to specific high-risk populations while highlighting the potential of artificial intelligence in identifying actionable clinical patterns for the implementation of precision medicine.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。