Abstract
Background: Untargeted metabolomics enables comprehensive profiling of biological systems, but accurate metabolite annotation remains a critical bottleneck due to incomplete spectral libraries and structural isomerism. The use of in silico annotation tools can increase the coverage of annotated compounds, but it remains unclear whether these tools, in the absence of reference standards, can reliably annotate real-world experimental LC-HRMS data and whether they are sufficient for this task. Methods: This study assesses the performance and limitations of four widely used in silico structure prediction tools (MassFrontier, MetFrag, MS-FINDER, and SIRIUS/CSI:FingerID) when applied to an experimentally acquired feature set previously used to differentiate patients with depressive disorders from healthy controls. To ensure uniform evaluation across tools under realistic but optimized conditions, the quality of MS/MS data was improved using a parallel reaction monitoring method, allowing acquisition of interpretable fragmentation spectra for 26 of the 28 detected features. Results: For most features, all tools were able to suggest structure candidates. However, none of the tools proved sufficient as a standalone solution for reliable metabolite annotation. Due to their different algorithms, each tool had strengths and weaknesses in fragmentation interpretation, candidate generation, and ranking, resulting in incomplete or inconsistent annotations. While the combined application of all four tools provided a substantial improvement in putative annotation over conventional spectral library matching, the in silico structure prediction tools often prioritized chemically implausible, biologically irrelevant, or artifactual candidates. Consequently, manual expert evaluation was required to assess the chemical plausibility and biological relevance of the proposed structures. This ultimately reduced the number of biologically plausible metabolites putatively associated with disease to ten. Conclusions: Overall, these results demonstrate that existing in silico annotation tools can substantially support the annotation of experimental metabolomics data, but are insufficient on their own. Reliable identification of metabolites in complex biological matrices still depends on high-quality MS/MS data acquisition, the combined use of complementary tools, and mandatory post-annotation expert curation.