Abstract
MOTIVATION: Synthetic microbial communities offer an opportunity to conduct reductionist research in tractable model systems. However, deriving abundances of highly related strains within these communities is currently unreliable. 16S rRNA gene sequencing does not resolve abundance at the strain level and other methods such as quantitative polymerase chain reaction (qPCR) scale poorly and are resource prohibitive for complex communities. We present StrainR2, which utilizes shotgun metagenomic sequencing to provide high accuracy strain-level abundances for all members of a synthetic community, provided their genomes. RESULTS: Both in silico, and using sequencing data derived from gnotobiotic mice colonized with a synthetic fecal microbiota, StrainR2 resolves strain abundances with greater accuracy and efficiency than other tools utilizing shotgun metagenomic sequencing reads. We demonstrate that StrainR2's accuracy is comparable to that of qPCR on a subset of strains resolved using absolute quantification. AVAILABILITY AND IMPLEMENTATION: Software is available at GitHub and implemented in C, R, and Bash. Software is supported on Linux and MacOS, with packages available on Bioconda or as a Docker container. The source code at the time of publication is also available on figshare at the doi: 10.6084/m9.figshare.29420780.