Novel Machine Learning of DNA Methylation Patterns to Diagnose Complex Disease: Identification of Cerebral Palsy with Concurrent Epilepsy

利用新型机器学习方法分析DNA甲基化模式以诊断复杂疾病:识别伴有癫痫的脑瘫

阅读:1

Abstract

Spastic cerebral palsy (CP) is a common pediatric-onset disability with an estimated prevalence of 0.2%. It is a complex condition characterized by muscle stiffness, contractures, and abnormal movement. Spastic CP is difficult to diagnose. Although nearly all affected children are born with it or acquire it immediately after birth, many are not identified until after 19 months of age with the diagnosis often not confirmed until 5 years of age. In addition, CP frequently co-occurs with other complex conditions that can complicate diagnosis and treatment. For example, an estimated 42% of spastic CP cases have co-occurring epilepsy. Recent studies indicate that altered DNA methylation patterns in peripheral blood cells are associated with CP and may have diagnostic value.Accordingly, the purpose of this study is to assess the diagnostic value of methylation in CP with more complex disease states. We evaluated machine learning classification for detecting CP based on DNA methylation pattern analysis in the context of co-occurrent epilepsy. Blood samples from 30 study participants diagnosed with epilepsy (n=4), spastic CP (n=10), both (n=8), or neither (n=8) were analyzed by Illumina MethylationEpic arrays. A novel machine learning algorithm using a Support Vector Machine (SVM) or Linear Discriminant Analysis (LDA) was developed to identify methylation loci that classified CP from controls and to measure the classification ability of identified methylation loci. The isolation of informative methylation loci was performed in a binary comparison between CP and controls, as well as in a 4-way comparison that included epilepsy. Median F1 scores for SVM-based analysis were 0.67 in 4-class comparison, and 1.0 in the binary classification. SVM outperformed LDA (median F1 0.57 and 0.86, respectively). Overall, the novel machine learning based algorithm was able to classify study participants with spastic CP and/or epilepsy from controls with significant performance.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。