Abstract
BACKGROUND AND OBJECTIVE: The new user cohort design has emerged as a best practice for the estimation of drug effects from observational data. However, despite its advantages, this design requires the selection and evaluation of comparators for appropriateness, a process that can be challenging. The objective of this work was to introduce an empirical approach to rank candidate comparators in terms of their similarity to a target drug in high-dimensional covariate space. METHODS: We generated new user cohorts for each RxNorm ingredient and Anatomic Therapeutic Chemical level 4 class in five administrative claims databases then extracted aggregated pre-treatment covariate data for each cohort across five clinically oriented domains. We formed all pairs of cohorts with ≥ 1000 patients and computed a scalar similarity score, defined as the average of cosine similarities computed within each domain, for each pair. We then generated ranked lists of candidate comparators for each cohort. RESULTS: Across up to 1350 cohorts forming 922,761 comparisons, drugs that were more similar in the Anatomic Therapeutic Chemical hierarchy had higher cohort similarity scores. The most similar candidate comparators for each of six example drugs corresponded to alternative treatments used in the target drug's indication(s), and choosing the top-ranked comparator for randomly selected drugs tended to produce balance on most covariates. This approach also ranked highly those comparators chosen in high-quality published new user cohort design studies. CONCLUSION: Empirical comparator recommendations may serve as a useful aid to investigators and could ultimately enable the automated generation of new user cohort design-derived evidence, a process that has previously been limited to self-controlled designs.