Abstract
The concomitant development and evolution of lung computed tomography (CT) and artificial intelligence (AI) have made non-invasive lung imaging a key component of clinical care of patients. However, the scarcity of labeled CT data and the limited generative capacity of existing models have constrained their clinical utility. Here, we present LCTfound, a large-scale vision foundation model designed to overcome these limitations. Trained on a multi-center dataset comprising 105,184 CT scans, LCTfound leverages diffusion-based pretraining and joint encoding of imaging and clinical information to support 8 tasks, including CT enhancement, virtual computed tomography angiography (CTA), sparse-view reconstruction, lesion segmentation, diagnosis, prognosis, cancer pathological response prediction, and three-dimensional surgical navigation. In comprehensive multicenter evaluations, LCTfound consistently outperforms leading baseline models, delivering a unified, broadly deployable solution that both augments clinical decision-making and elevates CT image quality across diverse practice settings. LCTfound establishes a scalable foundation for next-generation clinical imaging intelligence, uniting large AI model with precision healthcare.