Abstract
BACKGROUND: The WRKY gene family is identified as one of the most prominent transcription factor families in plants and is involved in various biological processes such as metabolism, growth and development, and response to biotic and abiotic stresses. In many plant species, the WRKY gene family was widely studied and analyzed but little to no information for Fortunella hindsii. However, the completion of the whole genome sequencing of Fortunella hindsii allowed us to investigate the genome-wide analysis of WRKY proteins. OBJECTIVE: The main objective of this study was to analyze and identify the WRKY gene family in Fortunella hindsii genome. METHODOLOGY: Various bioinformatics approaches have been used to conduct this study. RESULTS: We constituted 46 members of the Fortunella hindsii WRKY gene family, which were unevenly distributed on all nine chromosomes. The phylogenetic relationship of predicted WRKY proteins of Fortunella hindsii with the WRKY proteins of Arabidopsis showed that 46 FhWRKY genes were divided into three main groups (G1, G2, G3) with five subgroups (2A, 2B, 2C, 2D, and 2E) of G2 group. Domain, conserved motif identification, and gene structure were conducted and the results found that these FhWRKY proteins have conserved identical characteristics within groups and maintain differences between groups. In silico subcellular localization, results showed that FhWRKY genes are located in the nucleus. The cis-regulatory element analysis identified several key CREs that are significantly associated with light, hormone responses, and stress. The gene ontology analysis of these predicted FhWRKY genes showed that these genes are significantly enriched in sequence-specific DNA binding, transcriptional activity, cellular biosynthesis, and metabolic processes. CONCLUSION: Therefore, overall, our results provided an excellent foundation for further functional characterization of WRKY genes with an aim of Fortunella hindsii citrus crop improvement.