Abstract
Background: The modernization of Traditional Chinese Medicine (TCM) is hindered by a "structure-blind" bottleneck: establishing molecular mechanisms for complex formulations with uncharacterized chemical constituents. Conventional computational screening fails in these scenarios due to a heavy reliance on pre-determined structures. We developed NovelHTI, an inductive graph-based framework designed to reverse-engineer protein targets directly from standardized clinical symptom profiles. Methods: NovelHTI implements a "Phenotype-to-Target" paradigm by integrating heterogeneous graph neural networks with systemic pathway constraints. Unlike traditional transductive models, NovelHTI leverages multi-view feature fusion of symptom semantics and biological pathways to enable de novo prediction for unseen herbs. The framework was evaluated across 698 herbs and 7854 targets, benchmarking against advanced GNNs (HAN) and non-graph classifiers (XGBoost) under strict cold-start and knowledge erosion simulations. Results: NovelHTI maintains high precision (>84%) and balanced performance (F1-score >77%), outperforming baselines by over 33% (ROC-AUC) in realistic imbalanced screening, where traditional models typically fail (AUC ≈ 0.51). Robustness analysis confirmed stable performance (>0.83 AUC) despite 30% structural data incompleteness. Notably, retrospective validation successfully "rediscovered" emerging mechanisms (e.g., the Artemisinin-GPX4 ferroptosis axis) elucidated in 2021-2024 literature, which were entirely latent in the training data. Conclusions: NovelHTI provides a robust computational prioritization filter that effectively bridges macroscopic phenotypes and microscopic pharmacology. By enabling mechanism-driven target de-risking, this framework optimizes resource allocation for downstream experimental validation and accelerates TCM-based drug discovery.