Abstract
AbstractCertain nucleotide sequences in DNA can change their positions. Such nucleotide sequences might be shorter than a general gene. When we restrict to nucleotide sequences that form complete genes, we can still find genes that change their relative locations in a genome. Thus for different individuals of the same species, the orders of genes might be different. Such spatial difference of gene orders might be affected by temporal difference of gene (mutation) orders, and can be used to explain the order of mutation problem in myeloproliferative neoplasm. A practical problem is to determine such transposable genes in given gene sequences. Through an intuitive rule, we transform the biological problem of determining transposable genes into a rigorous mathematical problem of determining the longest common subsequence. Given several number sequences, determining the longest common subsequence is a classical problem in computer science. Depending on whether the gene sequence is linear or circular, and whether genes have multiple copies, we classify the problem of determining transposable genes into different scenarios and design corresponding algorithms. Specifically, we study the situation where the longest common subsequence is not unique.
Publisher
Cold Spring Harbor Laboratory
Reference83 articles.
1. Repetition-free longest common subsequence;Discret. Appl. Math,2010
2. A model for the intrinsic limit of cancer therapy: Duality of treatment-induced cell death and treatment-induced stemness;PLOS Comput. Biol,2022
3. Transposons: the agents of antibiotic resistance in bacteria
4. Backurs, A. , and Indyk, P. Edit distance cannot be computed in strongly subquadratic time (unless SETH is false). In Proceedings of the Forty-Seventh Annual ACM Symposium on Theory of Computing (2015), pp. 51–58.
5. Bergroth, L. , Hakonen, H. , and Raita, T. A survey of longest common subsequence algorithms. In Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000 (2000), IEEE, pp. 39–48.