Generative Artificial Intelligence Enhancements for Reducing Image-based Training Data Requirements

生成式人工智能增强技术可减少基于图像的训练数据需求

阅读:2

Abstract

OBJECTIVE: Training data fuel and shape the development of artificial intelligence (AI) models. Intensive data requirements are a major bottleneck limiting the success of AI tools in sectors with inherently scarce data. In health care, training data are difficult to curate, triggering growing concerns that the current lack of access to health care by under-privileged social groups will translate into future bias in health care AIs. In this report, we developed an autoencoder to grow and enhance inherently scarce datasets to alleviate our dependence on big data. DESIGN: Computational study with open-source data. SUBJECTS: The data were obtained from 6 open-source datasets comprising patients aged 40-80 years in Singapore, China, India, and Spain. METHODS: The reported framework generates synthetic images based on real-world patient imaging data. As a test case, we used autoencoder to expand publicly available training sets of optic disc photos, and evaluated the ability of the resultant datasets to train AI models in the detection of glaucomatous optic neuropathy. MAIN OUTCOME MEASURES: Area under the receiver operating characteristic curve (AUC) were used to evaluate the performance of the glaucoma detector. A higher AUC indicates better detection performance. RESULTS: Results show that enhancing datasets with synthetic images generated by autoencoder led to superior training sets that improved the performance of AI models. CONCLUSIONS: Our findings here help address the increasingly untenable data volume and quality requirements for AI model development and have implications beyond health care, toward empowering AI adoption for all similarly data-challenged fields. FINANCIAL DISCLOSURES: The authors have no proprietary or commercial interest in any materials discussed in this article.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。