Abstract
Background: Hepatocellular carcinoma (HCC) patients without microvascular invasion (MVI) face significant postoperative early recurrence (ER) risks, yet prognostic determinants remain understudied. Existing models often rely on linear assumptions. This study aimed to develop and validate an interpretable machine learning model using routine clinical parameters to predict early recurrence (ER) in MVI-negative HCC patients. Methods: We retrospectively analyzed 578 MVI-negative HCC patients undergoing radical resection. Seven machine learning (ML) algorithms were systematically benchmarked using clinical/laboratory/imaging features optimized via recursive feature elimination (RFE) and hyperparameter tuning. Model interpretability was achieved via SHapley Additive exPlanations (SHAP). Results: The CatBoost model demonstrated superior performance (AUC: 0.7957, Accuracy: 0.7290). SHAP analysis identified key predictors: tumor capsule absence, elevated HBV-DNA and CA125 levels, larger tumor diameter, and lower body weight significantly increased ER risk. Individualized SHAP force plots enhanced clinical interpretability. Conclusions: The CatBoost model exhibits robust predictive performance for ER in MVI-negative HCC, offering a clinically interpretable tool for personalized risk stratification and optimization of postoperative management strategies.