SUMMARY: Despite the rapid growth of machine learning in biomolecular applications, information about protein dynamics is underutilized. Here, we introduce Nearl, an automated pipeline designed to extract dynamic features from large ensembles of molecular dynamics trajectories. Nearl aims to identify intrinsic patterns of molecular motion and to provide informative features for predictive modeling tasks. We implement two classes of dynamic features, termed marching observers and property-density flow, to capture local atomic motions while maintaining a view of the global configuration. Complemented by standard voxelization techniques, Nearl transforms substructures of proteins into three-dimensional (3D) grids, suitable for contemporary 3D convolutional neural networks (3D-CNNs). The pipeline leverages graphics processing unit (GPU) acceleration, adheres to the FAIR principles for research software, and prioritizes flexibility and user-friendliness, allowing customization of input formats and feature extraction. AVAILABILITY AND IMPLEMENTATION: The source code of Nearl is hosted at https://github.com/miemiemmmm/Nearl and archived at https://doi.org/10.5281/zenodo.15320286. The documentation is hosted on ReadTheDocs at https://nearl.readthedocs.io/en/latest/. All pre-built models are implemented in PyTorch and available on GitHub.
Nearl: extracting dynamic features from molecular dynamics trajectories for machine learning tasks.
阅读:5
作者:Zhang Yang, Vitalis Andreas
| 期刊: | Bioinformatics | 影响因子: | 5.400 |
| 时间: | 2025 | 起止号: | 2025 Jul 1; 41(7):btaf321 |
| doi: | 10.1093/bioinformatics/btaf321 | ||
特别声明
1、本文转载旨在传播信息,不代表本网站观点,亦不对其内容的真实性承担责任。
2、其他媒体、网站或个人若从本网站转载使用,必须保留本网站注明的“来源”,并自行承担包括版权在内的相关法律责任。
3、如作者不希望本文被转载,或需洽谈转载稿费等事宜,请及时与本网站联系。
4、此外,如需投稿,也可通过邮箱info@biocloudy.com与我们取得联系。
