Abstract
Phylogenetic relationships are crucial for solving various biological questions, serving as a fundamental knowledge in biology. However, the application of phylogenetic trees has been limited by inadequate coverage of updated published phylogenies and the scarcity of reliable comprehensive datasets. In this study, we present a novel approach for automatically extracting phylogenetic data and integrating relevant species information from scientific papers and public databases. On this basis, we constructed a dataset TreeHub, including 135,502 corresponding phylogenetic trees from 7,879 phylogenetic research articles across 609 academic journals. This database will serve as a reliable and accessible resource for the scientific community, accelerating innovations in biodiversity studies and evolutionary theory based on high-density data.