Abstract
BACKGROUND: Orthopaedic surgeons treating skeletally immature patients with knee pathology rely on an accurate evaluation of skeletal age. Although the Greulich and Pyle atlas remains the gold standard for bone age estimation, it requires additional hand imaging and radiation exposure for patients who already have readily available knee imaging. PURPOSE: To develop a deep learning (DL) model for bone age estimation based on knee radiographs. STUDY DESIGN: Cohort study (diagnosis); Level of evidence, 3. METHODS: This was an institutional review board-approved study. A total of 2374 cropped knee images-80% for training and 20% for a test set-from patients aged ≤18 years at an orthopaedic specialty hospital with paired hand films obtained within 3 months of knee imaging were used. Patients with previous physeal trauma about the knee, genetic disorders including skeletal dysplasias and endocrinopathies affecting maturation, were excluded. A ConvNeXT model was trained on sex, skeletal age, and chronological age as input values using 5-fold validation. Model performance was evaluated using mean absolute error (MAE) and gradient-based saliency maps to assess model interpretability. RESULTS: The model achieved an MAE of 5.02 months relative to the assigned ground-truth bone ages, nearly half that of the abbreviated Fels method (9.59 months). An even lower MAE of 3.43 months was observed relative to the pretrained pseudo-labels generated by a hand DL model used to annotate the training set. Bland-Altman analysis revealed near-zero bias, indicating close agreement between predicted and ground truth bone age. This model outperformed both prior simple and artificial intelligence-based skeletal maturity estimation tools of the knee. CONCLUSION: This highly accurate DL model for knee skeletal maturity estimation demonstrates that automated bone age estimation from routine knee radiographs is feasible. We anticipate that this model can serve as a valuable diagnostic assistant for orthopaedic surgeons and radiologists evaluating skeletally immature patients and help to augment their clinical decision-making. External validation and model refinement are important for adoption into everyday practice.