Abstract
AbstractMotivation: As genome graphs are powerful data structures for representing the genetic diversity within populations, they can help identify genomic variations that traditional linear references miss, but their complexity and size makes the analysis of genome graphs challenging. We sought to develop a genome graph analysis tool that helps these analyses to become more accessible by addressing the limitations of existing tools. Specifically, we improve scalability and user-friendliness, and we provide many new statistics for graph evaluation.Results: We developed an efficient, comprehensive, and integrated tool,gretl, to analyse genome graphs and gain insights into their structure and composition by providing a wide range of statistics.gretlcan be utilised to evaluate different graphs, compare the output of graph construction pipelines with different parameters, as well as perform an in-depth analysis of individual graphs, including sample-specific analysis. With the assistance ofgretl, novel patterns of genetic variation and potential regions of interest can be identified, for later, more detailed inspection. We demonstrate thatgretloutperforms other tools in terms of speed, particularly for larger genome graphs.Availability and implementation:gretlis implemented in Rust. Commented source code is available under MIT licence athttps://github.com/MoinSebi/gretl. Examples of how to rungretlare provided in the documentation. Several Jupyter notebooks are part of the repository and can help visualisegretlresults.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献