ChatGPT Provides Accurate but Incomplete Responses and Reliably Adjusts Readability to Prompts for Hamstring Injury Frequently Asked Questions

ChatGPT 提供准确但不完整的回复，并能可靠地根据腿筋损伤常见问题的提示调整可读性

阅读：1

作者：Fenn,Thomas W,Farronato,Dominic M,Wells,Douglas K,Reahl,George B,Gwathmey,F Winston,Su,Charles A

期刊：	Arthroscopy Sports Medicine and Rehabilitation	影响因子：	0.000
时间：	2025	起止号：	2025 Aug;7(4):101200
doi：	10.1016/j.asmr.2025.101200

Abstract

PURPOSE: To evaluate the accuracy of ChatGPT's responses to frequently asked questions (FAQs) about hamstring injuries and to determine, if prompted, whether ChatGPT could appropriately tailor the reading level to that suggested. METHODS: A preliminary list of 15 questions on hamstring injuries was developed from various FAQ sections on patient education websites from a variety of institutions, from which the 10 most frequently cited questions were selected. Three queries were performed, inputting the questions into ChatGPT-4.0: (1) unprompted, naïve, (2) additional prompt specifying the response being tailored to a seventh-grade reading level, and (3) additional prompt specifying the response being tailored to a college graduate reading level. The responses from the unprompted query were independently evaluated by two of the authors. To assess the quality of the answers, a grading system was applied: (A) correct and sufficient response; (B) correct but insufficient response; (C) response containing both correct and incorrect information; and (D) incorrect response. In addition, the readability of each response was measured using the Flesch-Kinkaid Reading Ease Score (FRES) and Grade Level (FKGL) scales. RESULTS: Ten responses were evaluated. Inter-rater reliability was 0.6 regarding grading. Of the initial query, 2 of 10 responses received a grade of A, seven were graded as B, and one were graded as C. The average cumulative FRES and FKGL scores of the initial query was 61.64 and 10.28, respectively. The average cumulative FRES and FKGL scores of the secondary query were 75.2 and 6.1, respectively. Finally, the average FRES and FKGL scores of the third query were 12.08 and 17.23. CONCLUSIONS: ChatGPT showed generally satisfactory accuracy in responding to questions regarding hamstring injuries, although certain responses lacked completeness or specificity. On initial, unprompted queries, the readability of responses aligned with a tenth-grade level. However, when explicitly prompted, ChatGPT reliably adjusted the complexity of its responses to both a seventh-grade and a graduate-level reading standard. These findings suggest that although ChatGPT may not consistently deliver fully comprehensive medical information, it possesses the capacity to adapt its output to meet specific readability targets. CLINICAL RELEVANCE: Artificial intelligence models like ChatGPT have the potential to serve as a supplemental educational tool for patients with orthopaedic to aid medical-decision making. It is important that we continually review the quality of they medical information generated by these artificial models as the evolve and improve.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

线粒体

凋亡

转录调控

巨噬细胞

自噬

传染病

氧化应激

肠道菌群

磷酸化

血管生成

囊泡

3D/类器官

单细胞

中性粒细胞

外泌体

DNA甲基化

miRNA

药物研究

铁死亡

细胞衰老

乙酰化

缺氧低氧

泛素化

树突状细胞

炎性小体

组蛋白修饰

肿瘤微环境

lncRNA

代谢重编程

焦亡

m6A/m5C/m7G

内质网应激

空间多组学

细胞基因治疗

治疗耐药

相分离

Treg

上皮间质转化

免疫代谢

染色质重塑

脂质过氧化

蛋白质稳态

脂代谢

细胞极性

铁代谢

氨基酸代谢

碱基编辑

cGAS-STING

肠脑轴

蛋白降解

乳酸化

翻译调控

circRNA

piRNA

肿瘤异质性

NK 细胞

氧化脂质

MDSC

NETosis

低氧缺氧

溶酶体功能

琥珀酰化

细胞干性

CAR-NK

冷应激

RNA 编辑

Tfh

巴豆酰化

器官芯片

表观遗传记忆

铜死亡

器官纤维化

线粒体未折叠蛋白反应

空间代谢组

程序性坏死

自噬流

MAIT 细胞

肠肝轴

丙酰化