Abstract
The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand. In this work, we propose a new CNV detection method, called CBCNV, for the detection of CNVs of different lengths from whole genome sequencing data. CBCNV uses a clustering algorithm to divide the read depth segment profile, and assigns an abnormal score to each read depth segment. Based on the abnormal score profile, Tukey’s fences method is adopted in CBCNV to forecast CNVs. The performance of the proposed method is evaluated on simulated data sets, and is compared with those of several existing methods. The experimental results prove that the performance of CBCNV is better than those of several existing methods. The proposed method is further tested and verified on real data sets, and the experimental results are found to be consistent with the simulation results. Therefore, the proposed method can be expected to become a routine tool in the analysis of CNVs from tumor-normal matched samples.
Subject
Genetics (clinical),Genetics,Molecular Medicine
Reference52 articles.
1. Copy number variations and cancer.;Adam;Genome Med.,2009
2. A role for XRCC4 in age at diagnosis and breast cancer risk.;Allen-Brady;Cancer Epidemiol. Biomarkers Prevent.,2006
3. Implication of the proliferation and apoptosis associated CSE1L/CAS gene for breast cancer development.;Behrens;Anticancer Res.,2001
4. Accurate whole human genome sequencing using reversible terminator chemistry.;Bentley;Nature,2008
5. The landscape of somatic copy-number alteration across human cancers.;Beroukhim;Nature,2010