Abstract
Cellulose, the most abundant organic polymer in soil, is degraded by the action of microbial communities. Cellulolytic taxa are widespread in soils, enhancing the biodegradation of cellulose by the synergistic action of different cellulase enzymes. β-glucosidases are the last enzymes responsible for the degradation of cellulose by producing glucose from the conversion of the disaccharide cellobiose. Different soils from the states of Delaware, Maryland, New Jersey, and New York were analyzed by direct DNA extraction, PCR analysis, and next generation sequencing of amplicon sequences coding for β-glucosidase genes. To determine the community structure and diversity of microorganisms carrying β-glucosidase genes, amplicon sequence variant analysis was performed. Results showed that the majority of β-glucosidase genes did not match any known phylum or genera with an average of 84% of sequences identified as unclassified. The forest soil sample from New York showed the highest value with 95.62%. When identification was possible, the bacterial phyla Pseudomonadota, Actinomycetota, and Chloroflexota were found to be dominant microorganisms with β-glucosidase genes in soils. The Delaware soil showed the highest diversity with phyla and genera showing the presence of β-glucosidase gene sequences in bacteria, fungi, and plants. However, the Chloroflexota genus Kallotanue was detected in 3 out of the 4 soil locations. When phylogenetic analysis of unclassified β-glucosidase genes was completed, most sequences aligned with the Chloroflexota genus Kallotenue and the Pseudomonadota species Sphingomonas paucimobilis. Since most sequences did not match known phyla, there is tremendous potential to discover new enzymes for possible biotechnological and pharmaceutical applications.