Abstract
An organism's metabolic profile provides vital information pertaining to its physiology or pathology. To monitor these biochemical changes, Nuclear Magnetic Resonance (NMR) spectroscopy has found success in non-invasively observing metabolite changes within intact samples in an untargeted manner. However, biological samples are chemically complex, comprised of many different constituents (amino acids, carbohydrates, and lipids) at varying concentrations depending on physiological and pathological conditions. Due to the narrow spectral window of proton NMR, compound resonance frequencies can often overlap, making the identification and monitoring of metabolites difficult and time consuming, particularly when dealing with large numbers of samples. Here, we introduce a Python program (ROIAL-NMR) to systematically identify potential metabolites from defined proton NMR spectral regions-of-interest (ROIs), which are identified from complex biological samples (i.e., human serum, saliva, sweat, urine, CSF, and tissues) using the Human Metabolome Database (HMDB) as a reference platform. Briefly, for disease-versus-control studies, the program considers disease types and utilizes study-defined ROIs together with their differing intensity levels, according to sample types, in differentiating disease from control to propose potential metabolites represented by these ROIs in an output table. In this report, we illustrate the utility of the program with one of our recent studies, where we measured proton NMR spectra of serum samples taken from lung cancer (LC) patients, with and without Alzheimer's disease and related dementia (ADRD). The program successfully identified 88 metabolites, with 66 differentiating LC from control patients, and 80 distinguishing LC patients with ADRD from those without ADRD to provide important information regarding pathophysiology in complex biological samples.