Abstract
Background: Accurate differentiation of kidney diseases such as cysts, tumors, stones, and normal tissue from computed tomography (CT) images remains challenging due to overlapping visual characteristics and variability in data distributions. While deep learning approaches have shown promising results, many existing studies rely on image-level data splitting and focus primarily on accuracy, which may lead to overly optimistic performance and limited clinical reliability. Methods: This study proposes KDH-Net (Kidney Disease Hybrid Network), a hybrid deep learning framework for multiclass kidney disease characterization that integrates EfficientNetB0, ResNet50, and MobileNetV2 through feature-level fusion. A two-stage training strategy is adopted to enhance optimization stability. To ensure realistic performance assessment, experiments on the primary dataset are conducted under a patient-level evaluation protocol, eliminating potential data leakage. The framework further incorporates calibration analysis, statistical validation, and explainable artificial intelligence to evaluate prediction reliability and interpretability. Results: On the patient-level dataset, KDH-Net achieves an overall accuracy of 0.93 with a macro-average F1-score of 0.91, demonstrating balanced performance across all classes. Confidence analysis indicates meaningful alignment between prediction confidence and correctness, while Grad-CAM visualizations highlight anatomically relevant regions associated with each class. Conclusions: The results demonstrate that KDH-Net provides a stable, reliable, and interpretable framework for kidney CT characterization. The proposed system is designed to support clinical decision-making by offering trustworthy predictions under realistic evaluation conditions, rather than replacing clinical expertise.