Abstract
The NIH Molecular Libraries Initiative (MLI), launched in 2004 with initial goals of identifying chemical probes for characterizing gene function and druggability, has produced PubChem, a chemical genomics knowledgebase for fostering translation of basic research into new therapeutic strategies. This paper assesses progress toward these goals by evaluating MLI target novelty and propensity for undergoing biochemically or therapeutically relevant modulations and the degree of chemical diversity and biogenic bias inherent in the MLI screening set. Our analyses suggest that while MLI target selection has not yet been fully optimized for biochemical diversity, it covers biologically interesting pathway space that complements established drug targets. We find the MLI screening set to be chemically diverse and to have greater biogenic bias than comparable collections of commercially available compounds. Biogenic enhancements such as incorporation of more metabolite-like chemotypes are suggested.