Automated Logical Observation Identifiers Names and Codes mapping with biomedical natural language processing models: enabling scalable health information exchange via the Open Concept Lab

将自动化逻辑观察标识符名称和代码与生物医学自然语言处理模型进行映射:通过开放概念实验室实现可扩展的健康信息交换

阅读:1

Abstract

OBJECTIVES: Efficient exchange of health information requires consistent representation of clinical concepts across laboratories, hospitals, and public health systems. LOINC supports this interoperability by standardizing laboratory test codes, but mapping remains difficult when datasets are incomplete, inconsistently formatted, or structurally diverse. These challenges often create a mismatch between algorithmic performance in controlled settings and real-world deployment. This study aimed to develop a biomedical natural language processing (NLP) approach for mapping heterogeneous laboratory test strings to LOINC v2.81 and to compare its performance with established algorithms in the Open Concept Lab (OCL) Mapper. MATERIALS AND METHODS: We implemented a ScispaCy-based pipeline (ScispaCy-LOINC) that identifies clinical entities, links them to UMLS Concept Unique Identifiers, assembles LOINC codes from LOINC parts, and ranks candidates using a weighted scoring system. Overall and ranked performance was evaluated against 2 OCL algorithms, Elasticsearch Keyword Retrieval (OCL-Keyword) and MiniLM Semantic Search (OCL-Semantic), on 2 datasets: MIMIC-IV lab_d_items and a LOINC-mapped subset of the CIEL interface terminology v2025-07-15. RESULTS: In MIMIC-IV, the ScispaCy-LOINC achieved the highest coverage, correctly identifying the LOINC code in 42.3% of cases, outperforming OCL-Keyword (19.5%) and OCL-Semantic (21.4%). In the CIEL dataset, OCL-Semantic achieved the highest coverage (54.4%), followed by OCL-Keyword (46.9%) and ScispaCy-LOINC (28.4%). DISCUSSION: These results indicate that ScispaCy-LOINC is particularly effective for noisier or structurally sparse inputs, whereas OCL-based approaches perform better for more standardized terminologies, highlighting complementary algorithmic strengths. CONCLUSION: ScispaCy-LOINC offers a flexible approach to LOINC mapping and demonstrates complementary strengths relative to existing OCL algorithms. These findings support the development of an integrated framework that combines algorithmic strategies to improve robustness across diverse clinical datasets.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。