WaveCNV: allele-specific copy number alterations in primary tumors and xenograft models from next-generation sequencing.
Bioinformatics
; 30(6): 768-74, 2014 Mar 15.
Article
in En
| MEDLINE
| ID: mdl-24192544
MOTIVATION: Copy number variations (CNVs) are a major source of genomic variability and are especially significant in cancer. Until recently microarray technologies have been used to characterize CNVs in genomes. However, advances in next-generation sequencing technology offer significant opportunities to deduce copy number directly from genome sequencing data. Unfortunately cancer genomes differ from normal genomes in several aspects that make them far less amenable to copy number detection. For example, cancer genomes are often aneuploid and an admixture of diploid/non-tumor cell fractions. Also patient-derived xenograft models can be laden with mouse contamination that strongly affects accurate assignment of copy number. Hence, there is a need to develop analytical tools that can take into account cancer-specific parameters for detecting CNVs directly from genome sequencing data. RESULTS: We have developed WaveCNV, a software package to identify copy number alterations by detecting breakpoints of CNVs using translation-invariant discrete wavelet transforms and assign digitized copy numbers to each event using next-generation sequencing data. We also assign alleles specifying the chromosomal ratio following duplication/loss. We verified copy number calls using both microarray (correlation coefficient 0.97) and quantitative polymerase chain reaction (correlation coefficient 0.94) and found them to be highly concordant. We demonstrate its utility in pancreatic primary and xenograft sequencing data. AVAILABILITY AND IMPLEMENTATION: Source code and executables are available at https://github.com/WaveCNV. The segmentation algorithm is implemented in MATLAB, and copy number assignment is implemented Perl. CONTACT: lakshmi.muthuswamy@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
DNA Copy Number Variations
/
High-Throughput Nucleotide Sequencing
/
Neoplasms
Limits:
Animals
/
Humans
Language:
En
Journal:
Bioinformatics
Journal subject:
INFORMATICA MEDICA
Year:
2014
Type:
Article
Affiliation country:
Canada