Abstract
BACKGROUND: The alignment of clinical decisions in pediatric dentistry with evidence-based guidelines is crucial for treatment success. The potential of artificial intelligence applications to provide decision support in accordance with the guidelines of the American Academy of Pediatric Dentistry (AAPD) is gaining increasing attention. This study aims to comprehensively compare the ability of OpenAI ChatGPT-5 and Google Gemini Advanced 2.0 Flash models to provide responses consistent with the 2020–2024 AAPD pulp treatment guidelines. MATERIALS AND METHODS: AAPD’s Pulp Treatment in Primary and Immature Permanent Teeth (2020), Vital Pulp Therapy in Primary Teeth (2024), and Non-vital Pulp Therapy in Primary Teeth (2020) guidelines, a total of 27 questions were prepared, 9 of which are yes/no (dichotomous) and 18 are open-ended. The questions were posed by two different researchers over a period of 10 days, three times a day (morning, afternoon, evening), using both models (ChatGPT-5 and Gemini Advanced 2.0 Flash) in new chat sessions. A total of 1,620 responses were classified according to predefined, guideline-based criteria as ‘correct,’ ‘incorrect,’ or ‘consult an expert,’ with reference to the corresponding recommendations in the AAPD guidelines. The data were analysed using the Pearson Chi-Square test. RESULTS: The ChatGPT-5 model provided accurate information consistent with guidelines in 72.8% of its responses, whereas this rate was 67.2% for Gemini Advanced. The difference in response distributions between the two models is statistically significant (Chi-Square test, p = 0.000). These results indicate that ChatGPT-5 provides more consistent and guideline-compliant responses to pediatric pulp therapy decision support questions compared to Gemini Advanced (2.0 Flash). CONCLUSION: Artificial Intelligence-based chatbots have potential as clinical decision support tools in pediatric dentistry. ChatGPT-5 demonstrated higher accuracy than Gemini Advanced by providing responses that were largely consistent with the pulp treatment protocols outlined in the AAPD guidelines. However, both models can provide incorrect or insufficient guidance in some critical situations. Therefore, these tools should be viewed as auxiliary clinical decision support tools and should not replace the dentist’s expert knowledge and clinical judgment in final decisions. Developing artificial intelligence models that more comprehensively incorporate evidence-based guidelines in the field of pediatric dentistry will enhance their reliability in clinical practice. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12903-026-07897-4.