The advent of high-dimensional single-cell data has necessitated the development of dimensionality-reduction tools. t-Distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP) are the two most frequently used approaches, allowing clear visualization of complex single-cell datasets. Despite the need for quantitative comparison, t-SNE and UMAP have largely remained visualization tools due to the lack of robust statistical approaches. Here, we have derived a statistical test for evaluating the difference between dimensionality-reduced datasets using the Kolmogorov-Smirnov test on the distributions of cross entropy of single cells within each dataset. As the approach uses the inter-relationship of single cells for comparison, the resulting statistic is robust and capable of identifying true biological variation. Further, the test provides a valid distance between single-cell datasets, allowing the organization of multiple samples into a dendrogram for quantitative comparison of complex datasets. These results demonstrate the largely untapped potential of dimensionality-reduction tools for biomedical data analysis beyond visualization.
A cross entropy test allows quantitative statistical comparison of t-SNE and UMAP representations.
交叉熵检验可以对 t-SNE 和 UMAP 表示进行定量统计比较
阅读:5
作者:Roca Carlos P, Burton Oliver T, Neumann Julika, Tareen Samar, Whyte Carly E, Gergelits Vaclav, Veiga Rafael V, Humblet-Baron Stéphanie, Liston Adrian
| 期刊: | Cell Reports Methods | 影响因子: | 4.500 |
| 时间: | 2023 | 起止号: | 2023 Jan 13; 3(1):100390 |
| doi: | 10.1016/j.crmeth.2022.100390 | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
