From Normal Cognition to Dementia: Using Natural Language Processing to Identify Cognitive Stages from Clinical Notes

从正常认知到痴呆:利用自然语言处理技术从临床记录中识别认知阶段

阅读:1

Abstract

BACKGROUND: This study responds to the urgent need for automated and reliable methods to detect cognitive impairments on a large scale. It leverages natural language processing (NLP) techniques to predict dementia and mild cognitive impairment (MCI) using clinical notes from electronic health records (EHR). METHOD: Our study used an EHR dataset from Massachusetts General Brigham, which included clinical notes from a 2‐year period (2017‐2018) covering 12 types of patient encounters. Sentence segmentation and keyword‐specific extraction were performed using the NLTK tool, focusing on dementia and activities of daily living, thus providing a comprehensive base for our analysis. Our analysis, designed for classifying cognitive stages into normal cognition, MCI, and dementia, involved three binary classification tasks. We employed two innovative NLP methods based on a transformer‐based language model, the Universal Sentence Encoder (USE). The first, Random Sampling, involved extracting and randomly batching sentences containing relevant keywords, each batch processed through the Universal Sentence Encoder (USE) to generate unique 512‐dimensional embeddings. The second method, Encounter‐based Sampling, grouped sentences by their corresponding note encounter types and keyword categories, creating 24 distinct embeddings for each patient. These methods, leveraging the USE’s deep learning capabilities, provided nuanced approach to classifying cognitive stages, enhancing the predictive accuracy of our model. RESULT: We evaluated the performance of various classification tasks on the data containing 531 Normal, 153 MCI, and 229 Dementia subjects. These classifications include classifying between normal cognition and dementia (AUC 97.8% in random sampling, AUC 93.6% in encounter‐based sampling), normal cognition and MCI (AUC 81% in random sampling, AUC 74.6% in encounter‐based sampling), as well as normal cognition and cognitive impairment (i.e., merged MCI and dementia stages; AUC 92.4% in random sampling, AUC 85.4% in encounter‐based sampling). Our NLP‐based approaches also significantly outperforms the baseline model based on the number of relevant sentences by a margin of 6‐13% in AUC in these three classification tasks. CONCLUSION: Our study harnessed NLP techniques for enhanced dementia staging accuracy using EHR clinical notes. We introduced a highly accurate, fully automated approach with scalability potential, promising to transform hospital practices in managing clinical notes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。