Abstract
Despite rapidly growing volumes of clinical data, precision medicine still faces a structural data deficit: most patients and rare disease variants are sparsely sampled, labels are noisy, and counterfactual outcomes for alternative treatments are fundamentally unobservable. This position paper argues that overcoming these limits will require hybrid systems that couple multiscale virtual tissue models, synthetic data generation, and AI/ML within risk-aware digital twin frameworks. Using a structured narrative synthesis of three literatures-synthetic health data, virtual tissues and medical digital twins, and hybrid mechanistic-AI architectures including numerical weather prediction-we develop a conceptual framework centered on a mechanistic core linked to AI via forward (mechanistic → synthetic data → AI), backward (AI → mechanistic), and closed (patient-anchored digital twin) loops. We analyze how complex-systems behavior, biological adaptability, and sparse observations bound what medical digital twins can meaningfully predict, motivating ensemble and population-level forecasts rather than exact individual replicas. We then survey emerging implementation patterns, parameter-space exploration methods, and computational envelopes for using virtual tissues to generate biologically constrained synthetic cohorts and to calibrate hybrid digital twins. Finally, we adapt risk- and context-informed verification, validation, and governance frameworks to a four-layer stack spanning mechanistic cores, synthetic data products, AI components, and clinical workflows, with explicit attention to bias, drift, and provenance. We conclude that near-term impact is most likely from population- and cohort-level digital twins that support stratification and short-horizon decision support, while laying the groundwork for more individualized, trustworthy hybrids as biological and methodological uncertainties are better characterized.