Abstract
BACKGROUND: Genetic mutations have proven to be the epicenters of cancer and disease progression. Traditional WXS sequencing and BLOSUM scoring can be used to infer the evolutionary conservation of amino acid substitutions, though these approaches are not informed by probable base pair sequence changes. Within gene mutation analysis, most tools focus on amino acid conservation or codon switching independently, limiting their ability to contextualize observed mutations against stochastic mutational processes. In the clinical setting, variants of unspecified significance remain difficult to interpret, as clinicians are often unable to determine whether observed mutations arise from oncogenic selection or from stochastic mutational degradation. METHODS: We analyzed mutation sequences from the TCGA BRCA cohort for TP53 and PIK3CA and developed a model that integrates BLOSUM scoring with statistical modeling of base pair changes to evaluate deviation from codon-aware neutral expectations. Observed mutational distributions were compared against a stochastic neutral model to assess statistical significance. RESULTS: Within the TCGA BRCA cohort, TP53 mutations were significantly more evolutionarily radical than expected under the codon-aware neutral model, while PIK3CA mutations were significantly more evolutionarily conservative, as determined using chi-square testing. These opposing patterns are consistent with the distinct functional roles of TP53 and PIK3CA in oncogenesis, where TP53 is inhibited through disruptive loss-of-function mutations, whereas PIK3CA is recurrently mutated in a manner that preserves protein structure and promotes constitutive pathway activation. This contrast reflects selective pressure toward disabling tumor suppressor function while maintaining persistent oncogenic signaling. CONCLUSIONS: Codon-aware neutral modeling provides a statistical framework for distinguishing mutations that deviate from stochastic expectations and may aid in the interpretation of variants of unspecified significance. By contextualizing mutational severity relative to neutral processes, this approach offers insight into tumor evolution and may support prognostic assessment without relying on predefined gene-level neutrality.