Abstract
Hepatocellular carcinoma (HCC) remains a major cause of cancer-related mortality worldwide, and its management is limited by heterogeneous risk profiles, suboptimal surveillance performance, diagnostic uncertainty in chronically diseased livers, and difficulty individualizing prognosis after treatment. The aim of this narrative review was to critically evaluate artificial intelligence (AI) applications across the HCC care continuum, with emphasis on their intended clinical role, reported performance, evidence maturity, and barriers to implementation. A major strength of this review is that it moves beyond a descriptive catalog of models by structuring the literature around clinically relevant decision points and by explicitly distinguishing emerging proof-of-concept tools from applications with stronger translational potential. Across risk stratification, surveillance, imaging-based diagnosis, pathology, treatment-response prediction, and prognostication, we found that AI consistently demonstrates promise, particularly for identifying patients at higher future HCC risk, improving lesion detection and characterization on ultrasound, CT, MRI, and contrast-enhanced ultrasound, assisting histopathologic classification, and predicting outcomes such as microvascular invasion, recurrence, survival, and response to locoregional therapies. However, we also found that the evidence base remains highly uneven: many diagnostic studies are retrospective and lesion-enriched rather than embedded in true surveillance populations, many prognostic models lack robust external validation and calibration assessment, and reference standards, imaging protocols, and dataset composition vary substantially across studies. These findings are clinically relevant because they highlight both where AI may offer near-term value and why most published systems are not yet ready for routine use. Overall, AI in HCC should be viewed as a rapidly evolving but still transitional field. Its future impact will depend not only on higher-performing algorithms but on clearly defined clinical use cases, multicenter and prospective validation, transparent reporting, workflow-aware evaluation, and implementation strategies that support safe, equitable, and scalable adoption.