FragHub: A Mass Spectral Library Data Integration Workflow

FragHub:质谱库数据集成工作流程

阅读:8
作者:Axel Dablanc, Solweig Hennechart, Amélie Perez, Guillaume Cabanac, Yann Guitton, Nils Paulhe, Bernard Lyan, Emilien L Jamin, Franck Giacomoni, Guillaume Marti

Abstract

Open mass spectral libraries (OMSLs) are critical for metabolite annotation and machine learning, especially given the rising volume of untargeted metabolomic studies and the development of annotation pipelines. Despite their importance, the practical application of OMSLs is hampered by the lack of standardized file formats, metadata fields, and supporting ontology. Current libraries, often restricted to specific topics or matrices, such as natural products, lipids, or the human metabolome, may limit the discovery potential of untargeted studies. The goal of FragHub is to provide users with the capability to integrate various OMSLs into a single unified format, thereby enhancing the annotation accuracy and reliability. FragHub addresses these challenges by integrating multiple OMSLs into a single comprehensive database, supporting various data formats, and harmonizing metadata. It also proposes some generic filters for the mass spectrum using a graphical user interface. Additionally, a workflow to generate in-house libraries compatible with FragHub is proposed. FragHub dynamically segregates libraries based on ionization modes and chromatography techniques, thereby enhancing data utility in metabolomic research. The FragHub Python code is publicly available under a MIT license, at the following repository: https://github.com/eMetaboHUB/FragHub. Generated data can be accessed at 10.5281/zenodo.11057687.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。