Abstract
BACKGROUND: Multiple lines of prior research have documented that author conflicts of interest (COI) can compromise the integrity of the biomedical research enterprise. However, continuing research that would investigate why, how, and in what circumstances COI is most risky is stymied by the difficulty in accessing disclosure statements, which are not widely represented in available databases. OBJECTIVE: In this study, we describe a new open access dataset of COI disclosures extracted from published biomedical journals articles. METHODS: To develop the dataset, we used ClinCalc's Top 300 drugs lists for 2017 and 2018 to identify 319 of the most commonly used drugs. Search strategies for each product were developed using the National Library of Medicine's Medical Subject Headings browser and deployed using the eUtilities API in April 2021. We identified the 150 most relevant articles for each product and extracted COI disclosure statements from PubMed, PubMed Central, or retrieved articles as necessary. RESULTS: Conflicts of Interest PublicatiON Disclosures (COIPonD) is a new dataset that captures author-reported COI disclosures for biomedical research articles published in a wide range of journals and subspecialties. COIPonD captures author-reported disclosure information (including lack of disclosure) for over 38,000 PubMed-indexed articles published between 1949 and 2022. The collected articles are indexed by discussed drug products with a focus on the 319 most commonly used drugs in the United States. CONCLUSIONS: COIPonD should accelerate research efforts to understand the effects of COI on the biomedical research enterprise. In particular, this dataset should facilitate new studies of COI effects across disciplines and subspecialties.