Abstract
AbstractThe Beltwayand Turnpikeproblems entail the reconstruction of circular and linear one-dimensional point sets from unordered pairwise distances. These problems arise in computational biology when the measurements provide distances but do not associate those distances with the entities that gave rise to them. Such applications include molecular structure determination, genomic sequencing, tandem mass spectrometry, and molecular error-correcting codes (since sequencing and mass spec technologies can give lengths or weights, usually without connecting them to endpoints). Practical algorithms for Turnpike are known when the distance measurements are accurate, but both problems become strongly NP-hard under any level of measurement uncertainty. This is problematic since all known applications experience some degree of uncertainty from uncontrollable factors. Traditional algorithms cope with this complexity by exploring a much larger solution space, leading to exponential blowup in terms of both time and space. To alleviate both issues, we propose a novel alternating optimization algorithm that can scale to large, uncertain distance sets with as many as 100,000 points. This algorithm is space and time-efficient, with each step running inO(mlog(m)) time and requiring onlyworking space for a distance set of sizem. Evaluations of this approach on synthetic and partial digest data showcase improved accuracy and scalability in the presence of uncertain, duplicated, and missing distances. Our implementation of the algorithm is available athttps://github.com/Kingsford-Group/turnpikesolvermm.
Publisher
Cold Spring Harbor Laboratory
Reference27 articles.
1. A fast exact sequential algorithm for the partial digest problem;BMC Bioinformatics,2016
2. Physical mapping of chromosomes using unique probes;Journal of Computational Biology,1995
3. Three observations on linear algebra;Univ. Nac. Tacuman, Rev. Ser. A,1946
4. Stephen Boyd , Stephen P Boyd , and Lieven Vandenberghe . Convex Optimization. Cambridge, Cambridge University Press, 2004.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献