Abstract
To clarify the phylogenetic relationship between Abutilon theophrasti M. and other Malvaceae plants, the chloroplast genome of A. theophrasti was assembled, annotated, and analyzed. The complete chloroplast genome was sequenced using the Illumina NovaSeq 6000 platform. Bioinformatics methods were employed to systematically analyze its genomic structure, repetitive sequences, nucleic acid diversity, and codon preference. Additionally, a phylogenetic tree was constructed by integrating chloroplast genomic sequences from other Malvaceae species. The results showed that the chloroplast genome of A. theophrasti was 160,440 bp in length with a GC content of 36.89%, exhibiting a typical tetrad structure. A total of 130 coding genes were annotated, including 85 mRNA genes, 37 tRNA genes, and 8 rRNA genes, with no pseudogenes detected. Codon preference analysis indicates that leucine (Leu) is the most frequently used amino acid. There are 31 codons with a relative synonymous codon usage (RSCU) value greater than 1, most of which end with A or U. The genome contains 61 scattered repeat sequences and 288 simple repeat sequences (SSR). Ka/Ks analysis revealed that the overall chloroplast genes of A. theophrasti undergo purifying selection, while genes such as psbK and rps12 are subjected to positive selection, which may be associated with adaptive evolution. Phylogenetically, A. theophrasti is most closely related to its congener A. indicum, followed by a clade comprising M. cathayensis and Malva crispa of the genus Malva. This study enhances the understanding of the phylogenetic relationship of A. theophrasti and provides a theoretical basis for the genetic improvement and breeding strategies of A. theophrasti and other Malvaceae plants.