Abstract
Single-cell RNA sequencing (scRNA-seq) provides a high-resolution view of cellular heterogeneity, yet accurate cell-type annotation remains challenging due to data sparsity, technical noise, and variability across tissues, platforms, and species. Many existing annotation tools depend on a single form of prior knowledge, such as marker genes or reference profiles, which can limit performance when these resources are incomplete or inconsistent. Here, we present scANMF, a prior- and graph-regularized non-negative matrix factorization framework that integrates marker-gene information, partial label supervision, and the local manifold structure into a unified annotation model. scANMF factorizes the expression matrix into interpretable gene-factor and cell-factor representations, enabling accurate annotation in settings with limited or noisy prior information. Across multiple real scRNA-seq collections, scANMF achieved a high annotation accuracy in within-dataset, cross-platform, and cross-species evaluations. The method remained stable under varying levels of label sparsity and marker-gene noise and showed a broad robustness to hyperparameter choices. Ablation analyses indicated that marker priors, label supervision, and graph regularization contribute complementary information to the overall performance. These results support scANMF as a practical and robust framework for single-cell annotation, particularly in applications where high-quality prior knowledge is restricted.