Abstract
Blood analysis is the most common in biomedical applications and a reference metabolome will be critical for effective annotation and for guiding scientific investigations. However, compiling such a reference is hindered by many technical challenges, despite the availability of large amount of metabolomics data today. Based on a new set of data structures and tools, we have assembled a consensus serum metabolome (CSM) from over 100,000 mass spectrometry acquisitions of more than 200 million spectra. This provides a comprehensive survey of human blood chemistry, revealing the frequency dependent nature of metabolome and exposome. Major gaps are found between CSM and the current databases. The CSM enables community-level data alignment and significantly improves annotation quality of LC-MS metabolomics.