Pairwise Neural Networks for Ranking Molecular Structures Based on Properties

基于性质对分子结构进行排序的成对神经网络

阅读:4

Abstract

The rapid discovery and design of new molecules drive innovation in science and technology, advancing energy storage, catalysis, and drug development. Traditionally, exploring chemical space involves costly quantum-chemical calculations or slow experimental screening, which limits the speed of identifying promising candidates. Machine learning has emerged as a groundbreaking approach to accelerate molecular discovery by predicting key properties directly from molecular structures. Moreover, in many cases, if we can rank molecular structures, it is not necessary to know the exact value of a molecular property. In other words, a ranker model can be useful for molecular screening. In this work, we develop a deep learning model to rank molecular structures using a siamese network approach and pairwise learning to learn the ranking. According to different properties of the QM7x and QO2Mol data sets, the results show that the performance of the learn-to-rank Siamese architecture outperforms standard pointwise regression for predicting absolute energetic properties, such as total and orbital energies, while traditional pointwise regression remains effective for derived (e.g., HOMO-LUMO gap) or nonenergy properties (e.g., dipole moment). To further validate the robustness of the proposed framework, we extended our evaluation to include the Uni-Mol molecular representation model. Experiments with Uni-Mol V1 and V2 across various model sizes (84 M to 1.1 B parameters) confirm that the pairwise learning-to-rank objective consistently outperforms standard pointwise regression, even when using highly expressive pretrained Transformer backbones.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。