Abstract
Understanding the interplay among clinical variables-such as demographics, symptoms, and laboratory results-and their relationships with disease outcomes is critical for advancing diagnostics and understanding mechanisms in complex diseases. Existing methods fail to capture indirect or directional relationships, while existing Bayesian network learning methods are computationally expensive and only infer general associations without focusing on disease outcomes. Here we introduce random walk- and genetic algorithm-based network inference (RAMEN), a method for Bayesian network inference that uses absorbing random walks to prioritize outcome-relevant variables and a genetic algorithm for efficient network refinement. Applied to COVID-19 (Biobanque québécoise de la COVID-19), intensive care unit (ICU) septicemia (MIMIC-III), and COPD (CanCOLD) datasets, RAMEN reconstructs networks linking clinical markers to disease outcomes, such as elevated lactate levels in ICU patients. RAMEN demonstrates advantages in computational efficiency and scalability compared to existing methods. By modeling outcome-specific relationships, RAMEN provides a robust tool for uncovering critical disease mechanisms, advancing diagnostics, and enabling personalized treatment strategies.