Abstract
The average pairwise dissimilarity (APD) between one plant sample and other assayed samples based on genetic markers was developed in 2006 to assess genetic distinctness and genetic redundancy in a plant germplasm collection. With the availability of abundant genomic variants across a genome, APD can be expanded to measure individual genomic distinctness. This study was conducted to assess the applicability of APD estimates in measuring the individual genomic distinctness of 1789 indica and 854 japonica rice samples based on published genome-wide single-nucleotide polymorphism (SNP) and structural variant (SV) data. It was found that the acquired APD estimates were weakly or not correlated between the SNP and SV data sets in the indica or japonica samples, respectively. For the indica samples, the APD estimates based on the SNP and SV data ranged from 0.1779 to 0.3277 and from 0.2297 to 0.4096, respectively. For the japonica samples, the SNP-based and SV-based APD estimates varied from 0.1774 to 0.3029 and from 0.1534 to 0.3459, respectively. These APD estimates were highly negatively correlated with the estimates of individual inbreeding coefficients and can identify the most genomically distinct rice germplasm that are compatible with those revealed through principal component analysis. Also, a reliable APD estimation was found to require 5000 to 10,000 random genomic SNPs or SVs. These findings together are significant, not only in demonstrating the informativeness of APD estimates in the identification of individuals with variable genomic distinctness, but also in providing guidance for APD applications to measure individual genomic distinctness.