Abstract
Within epidemiologic research, estimating treatment effects from observational data presents notable challenges. Targeted maximum likelihood estimation (TMLE) emerges as a robust method, addressing these challenges by accurately modeling treatment effects. This approach uniquely combines the precision of correctly specified models with the versatility of data-adaptive, flexible machine learning algorithms. Despite its effectiveness, TMLE's integration of complex algorithms can introduce bias and undercoverage. This issue is addressed through the double cross-fit TMLE (DC-TMLE) approach, enhancing accuracy and reducing biases inherent in observational studies. However, DC-TMLE's potential remains underexplored in epidemiologic research, primarily due to the lack of comprehensive methodologic guidance and the complexity of its computational implementation. Recognizing this gap, our article contributes a detailed, reproducible guide for implementing DC-TMLE in R, aimed specifically at epidemiologic applications. We demonstrate the utility of this method using an openly available clinical data set, underscoring its relevance and adaptability for robust epidemiologic analysis. This guide aims to facilitate broader adoption of DC-TMLE in epidemiologic studies, promoting more accurate and reliable treatment effect estimations in observational research.