Efficient trace reconstruction in DNA storage systems using bidirectional beam search

利用双向光束搜索在DNA存储系统中进行高效的痕迹重建

阅读:1

Abstract

As DNA data storage gains popularity, efficient trace reconstruction algorithms are crucial for fast decoding of data from noisy sequenced reads (or "traces"). Existing approaches, often adaptations of multiple sequence alignment or read correction methods, rely on strict assumptions of fixed error rates, showing limited generalizability to more complex datasets and with slower running times. We introduce a probabilistic formulation of the trace reconstruction problem by modeling traces as observations from a k-th order Markov chain. Instead of doing alignment, we identify the sequence most likely generated by the Markov chain as the consensus. This inspires bidirectional beam search (BBS), an algorithm that reconstructs the consensus in linear time with respect to its length. Experiments on multiple public Nanopore sequencing datasets demonstrate that BBS achieves top-tier accuracy while being approximately 20× faster than existing methods, showing its potential to enhance the efficiency and reliability of DNA data storage systems.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。