Conclusions
Determination of DDX49, EGFR, and T-stage could form a novel prediction model to improve the diagnostic efficacy of lymph node metastasis in clinical application.
Methods
This was an early experimental laboratory trial. The model identification data included the RNA sequence data of 10 patients from our clinical data and 188 patients with lung cancer from The Cancer Genome Atlas dataset. The model development and validation data consisted of RNA sequence data for 537 cases from the Gene Expression Omnibus dataset. We explore the predictive value of the model on two independent clinical data.
Results
A higher specificity of diagnostic model for patients with lung cancer with lymph node metastases consisted of DDX49, EGFR, and tumor stage (T-stage), which were the independent predictive factors. The area under the curve value, specificity, and sensitivity for predicting lymph node metastases were 0.835, 70.4%, and 78.9% at RNA expression level in the training group, and 0.681, 73.2%, and 75.7% at RNA expression level in the validation group as shown as in result part. To verify the predictive performance of the combined model for lymph node metastases, we downloaded the GSE30219 data set (n = 291) and the GSE31210 data set (n = 246) from the Gene Expression Omnibus (GEO) database as the training group and validation group, respectively. In addition, the model had a higher specificity for predicting lymph node metastases in independent tissue samples. Conclusions: Determination of DDX49, EGFR, and T-stage could form a novel prediction model to improve the diagnostic efficacy of lymph node metastasis in clinical application.
