Abstract
Metagenotyping of metagenomic data has recently attracted increasing attention as it resolves intraspecies diversity by identifying single nucleotide variants. Furthermore, gene copy number analysis within species provides a deeper understanding of metabolic functions in microbial communities. However, a platform for examining metagenotyping results based on relevant grouping data is lacking. Here, we have developed the R package, stana, for the processing and analysis of metagenotyping results. The package consists of modules for preprocessing, statistical analysis, functional analysis and visualization. An interactive analysis environment for exploring the metagenotyping results was also developed and publicly released with over 1000 publicly available metagenome samples related to human diseases. Three examples exploring the relationship between the metagenotypes of the gut microbiome and human diseases are presented-end-stage renal disease, Crohn's disease and Parkinson's disease. The results suggest that stana facilitated the confirmation of the original study's findings and the generation of a new hypothesis. The GitHub repository for the package is available at https://github.com/noriakis/stana.