Abstract
Background: Sinorhizobium fredii HH103 is a fast-growing rhizobial strain capable of infecting a broad range of legumes, including plants forming determinate and indeterminate nodules, such as Glycine max (its natural host) and Glycyrrhiza uralensis, respectively. Previous studies reported the sequence and annotation of the genome of this strain (7.25 Mb), showing the most complex S. fredii genome sequenced to date. It comprises seven replicons: one chromosome and six plasmids. Among these plasmids, pSfHH103d, also known as the symbiotic plasmid pSymA, harbors most of the genes involved in symbiosis. Due to limitations of the sequencing technology used at the time and the presence of high number of clusters of transposable elements, this plasmid could only be partially assembled as four separated contigs. Methods: In this work, we have used a combination of PacBio and Illumina sequencing technologies to resolve these complex regions, obtaining an updated genome sequence (7.27 Mb). Results: This updated version includes an increase in size of the largest replicons (chromosome, pSfHH103d, and pSfHH103e) and a complete and closed symbiotic plasmid (pSfHH103d or pSymA). Additionally, we carried out a re-annotation of the updated genome, merging the previous annotation and the new one found in the remaining gaps. Notably, we found a high number of transposable elements in the HH103 genome, especially in three plasmids (pSfHH103b, pSfHH103c, and pSymA), a feature that is common among S. fredii strains. Conclusions: The combination of PacBio and Illumina sequencing technologies has allowed us to obtain a complete version of the HH103 pSymA. The presence of a high number of mobile elements seems to be a general characteristic among S. fredii strains, a fact that might be related to a high genome plasticity.