A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer-Reference-Cited by-同舟云学术

A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

Published:2014-01 Issue: Volume:13s4 Page:CIN.S15203
ISSN:1176-9351
Container-title:Cancer Informatics
language:en
Short-container-title:Cancer Inform

Author:

Li Ming¹,Wen Yalu²,Fu Wenjiang²³

Affiliation:

1. Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.

2. Department of Epidemiology and Biostatistics, Michigan State University, East Lansing Ml, USA.

3. Department of Mathematics, University of Houston, Houston, TX, USA.

Abstract

Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease.

Publisher

SAGE Publications

Subject

Cancer Research,Oncology

Link

http://journals.sagepub.com/doi/pdf/10.4137/CIN.S15203

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Genome-wide copy number analysis reveals candidate gene loci that confer susceptibility to high-grade prostate cancer;Urologic Oncology: Seminars and Original Investigations;2017-09