Abstract
AP-1 transcription factors are a network of cellular regulators that combine in different dimer pairs to control a range of pathways involved in differentiation, growth, and cell death. They dimerize via leucine zipper coiled-coil domains that are preceded by a basic DNA binding domain. Depending on which AP-1 transcription factors dimerize, different DNA sequences will be recognized resulting in differential gene expression. The affinity of AP-1 transcription factors for each other dictates which dimers form. The relative concentration of AP-1 transcription factors varies with tissue type and environment, adding another layer of control to this integral network of cellular regulation. The development of artificial intelligence (AI)-based protein structure prediction methods gives us a new technique to investigate or predict how dimerization affects combinatorial control. All versions of AlphaFold2 and AlphaFold3 are AI/deep learning programs that predict 3D structures of proteins from an amino acid sequence and multiple sequence alignments of homologous proteins. To fully realize the potential of AI for structural biology, it is essential to understand its current capabilities and limitations. In this study, we used the classical example of an AP-1 dimer: Fos and Jun, and an array of over 2000 experimentally tested human leucine zippers to interrogate how AlphaFold models leucine zipper domains and if AlphaFold can be used to differentiate between probable and improbable dimer interfaces. We found that AlphaFold predicts highly confident leucine zipper dimers, even for dimer pairs such as the FosB homodimer, for which electrostatics are known to prevent their formation in vivo. This is an important case study concerning high-confidence but low-accuracy protein structure prediction.