KNNCNV: A K-Nearest Neighbor Based Method for Detection of Copy Number Variations Using NGS Data-Reference-Cited by-同舟云学术

KNNCNV: A K-Nearest Neighbor Based Method for Detection of Copy Number Variations Using NGS Data

Published:2021-12-22 Issue: Volume:9 Page:
ISSN:2296-634X
Container-title:Frontiers in Cell and Developmental Biology
language:
Short-container-title:Front. Cell Dev. Biol.

Author:

Xie Kun,Liu Kang,Alvi Haque A K,Chen Yuehui,Wang Shuzhen,Yuan Xiguo

Abstract

Copy number variation (CNV) is a well-known type of genomic mutation that is associated with the development of human cancer diseases. Detection of CNVs from the human genome is a crucial step for the pipeline of starting from mutation analysis to cancer disease diagnosis and treatment. Next-generation sequencing (NGS) data provides an unprecedented opportunity for CNVs detection at the base-level resolution, and currently, many methods have been developed for CNVs detection using NGS data. However, due to the intrinsic complexity of CNVs structures and NGS data itself, accurate detection of CNVs still faces many challenges. In this paper, we present an alternative method, called KNNCNV (K-Nearest Neighbor based CNV detection), for the detection of CNVs using NGS data. Compared to current methods, KNNCNV has several distinctive features: 1) it assigns an outlier score to each genome segment based solely on its first k nearest-neighbor distances, which is not only easy to extend to other data types but also improves the power of discovering CNVs, especially the local CNVs that are likely to be masked by their surrounding regions; 2) it employs the variational Bayesian Gaussian mixture model (VBGMM) to transform these scores into a series of binary labels without a user-defined threshold. To evaluate the performance of KNNCNV, we conduct both simulation and real sequencing data experiments and make comparisons with peer methods. The experimental results show that KNNCNV could derive better performance than others in terms of F1-score.

Publisher

Frontiers Media SA

Subject

Cell Biology,Developmental Biology

Reference38 articles.

1. CNVnator: an Approach to Discover, Genotype, and Characterize Typical and Atypical CNVs from Family and Population Genome Sequencing;Abyzov;Genome Res.,2011

2. Outlier Analysis

3. Fast Outlier Detection in High Dimensional Spaces;Angiulli,2002

4. Control-FREEC: a Tool for Assessing Copy Number and Allelic Content Using Next-Generation Sequencing Data;Boeva;Bioinformatics,2012

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OTSUCNV: an adaptive segmentation and OTSU-based anomaly classification method for CNV detection using NGS data;BMC Genomics;2024-01-30

2. Mitophagy genes in ovarian cancer: a comprehensive analysis for improved immunotherapy;Discover Oncology;2023-12-01

3. Neurophysiologic evidence of motor imagery in lower limb amputees: an event-related potential study;2023-08-14

4. iPADD: A Computational Tool for Predicting Potential Antidiabetic Drugs Using Machine Learning Algorithms;Journal of Chemical Information and Modeling;2023-07-27

5. iRNA-ac4C: A novel computational method for effectively detecting N4-acetylcytidine sites in human mRNA;International Journal of Biological Macromolecules;2023-02