Building an analytical framework for tobacco-related information on social media: an exploratory analysis with generative AI assistance

构建社交媒体上烟草相关信息的分析框架:基于生成式人工智能辅助的探索性分析

阅读:2

Abstract

BACKGROUND: The propagation of tobacco-related information that is inconsistent with public health guide significantly impacts public health, particularly affecting people with less access to reliable information sources (such as those with lower education), who may also suffer disproportionate tobacco-related morbidity and mortality. This study develops a multi-dimensional analytical framework for identifying and categorizing tobacco-related information on social media. Using a dataset of tweets, the framework was constructed through qualitative analysis, which was then compared with an exploratory, AI-assisted analysis to assess the capabilities of current automated tools. METHODS: A collection of 3.4 million tweets related to tobacco and nicotine was refined to 842,754 after removing irrelevant and duplicate posts. LDA topic modeling identified six unique topics, from which two randomly selected samples of tweets were drawn to perform qualitative analysis and AI-assisted analysis to identify categories of tobacco information. RESULTS: The identified tobacco-related information was categorized by three dimensions (1) content, including safety and health effects, cessation, substance, and policy; (2) type of falsehood, which included fabrication and unsubstantiated claims, misrepresentations, and distortions; and (3) source, ranging from individuals and retail stores to advocacy groups and influencers. A notable finding was the prevalence of policy-related discussions of tobacco information on Twitter (X), highlighting this often-overlooked domain. The controversy over vaping has amplified pro-vaping voices on social media, with content frequently misinterpreting scientific findings, policies, and expert opinions, reflecting more nuanced and difficult to recognize falsehood in the misleading content. CONCLUSION: This study offers a comprehensive framework for analyzing tobacco-related information on social media, emphasizing key issues in policy debates and the presence of conspiracy narratives. This framework can inform the design of interventions for less informed populations and enhance data annotation for machine learning tasks.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。