Democratizing social media for health research: LLM-powered data analytics platform for NCDs

利用社交媒体开展健康研究:基于LLM的非传染性疾病数据分析平台

阅读:2

Abstract

Despite over 41 million annual deaths from Non-Communicable Diseases (NCDs) globally, predominantly in low and middle-income countries, public access to relevant information from social media is hindered by restrictive licensing of existing social listening tools. This study introduces NCDs Listener, an open-source tool designed to simplify the extraction, summarization, and visualization of NCD-related knowledge from social media comments (Facebook and Reddit posts) in both English and Thai. The tool utilizes keyword matching and the BERT model for knowledge extraction, followed by descriptive statistical analysis. A generative AI model, specifically Google Gemini 2.0 Flash as per the saved information, summarizes this extracted knowledge into human-readable sentences, focusing on medical and healthcare insights. Preliminary results indicate that NCDs Listener improves dashboard comprehension for both general users and data scientists, with the general users showing higher comprehension. Furthermore, both user groups preferred medically focused generative AI summaries over general summaries (p-value <0.001). These findings suggest that NCDs Listener not only provides immediate insights but also establishes a foundation for advanced data analysis, fostering new opportunities for understanding complex social phenomena and predicting emerging trends. The source codes are available at the project page: https://ratchanontt.github.io/NCDsListenerWebpage/.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。