RESUMO
Motivation: The ability to produce and analyze whole genome sequencing (WGS) data from samples with structural variations (SV) generated the need to visualize such abnormalities in simplified plots. Conventional two-dimensional representations of WGS data frequently use either circular or linear layouts. There are several diverse advantages regarding both these representations, but their major disadvantage is that they do not use the two-dimensional space very efficiently. We propose a layout, termed the Genome U-Plot, which spreads the chromosomes on a two-dimensional surface and essentially quadruples the spatial resolution. We present the Genome U-Plot for producing clear and intuitive graphs that allows researchers to generate novel insights and hypotheses by visualizing SVs such as deletions, amplifications, and chromoanagenesis events. The main features of the Genome U-Plot are its layered layout, its high spatial resolution and its improved aesthetic qualities. We compare conventional visualization schemas with the Genome U-Plot using visualization metrics such as number of line crossings and crossing angle resolution measures. Based on our metrics, we improve the readability of the resulting graph by at least 2-fold, making apparent important features and making it easy to identify important genomic changes. Results: A whole genome visualization tool with high spatial resolution and improved aesthetic qualities. Availability and implementation: An implementation and documentation of the Genome U-Plot is publicly available at https://github.com/gaitat/GenomeUPlot. Contact: vasmatzis.george@mayo.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Visualização de Dados , Genoma Humano , Variação Estrutural do Genoma , Software , Sequenciamento Completo do Genoma/métodos , Genômica/métodos , HumanosRESUMO
Copy number variation (CNV) is a common form of structural variation detected in human genomes, occurring as both constitutional and somatic events. Cytogenetic techniques like chromosomal microarray (CMA) are widely used in analyzing CNVs. However, CMA techniques cannot resolve the full nature of these structural variations (i.e. the orientation and location of associated breakpoint junctions) and must be combined with other cytogenetic techniques, such as karyotyping or FISH, to do so. This makes the development of a next-generation sequencing (NGS) approach capable of resolving both CNVs and breakpoint junctions desirable. Mate-pair sequencing (MPseq) is a NGS technology designed to find large structural rearrangements across the entire genome. Here we present an algorithm capable of performing copy number analysis from mate-pair sequencing data. The algorithm uses a step-wise procedure involving normalization, segmentation, and classification of the sequencing data. The segmentation technique combines both read depth and discordant mate-pair reads to increase the sensitivity and resolution of CNV calls. The method is particularly suited to MPseq, which is designed to detect breakpoint junctions at high resolution. This allows for the classification step to accurately calculate copy number levels at the relatively low read depth of MPseq. Here we compare results for a series of hematological cancer samples that were tested with CMA and MPseq. We demonstrate comparable sensitivity to the state-of-the-art CMA technology, with the benefit of improved breakpoint resolution. The algorithm provides a powerful analytical tool for the analysis of MPseq results in cancer.
Assuntos
Aberrações Cromossômicas , Variações do Número de Cópias de DNA/genética , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala , Algoritmos , Pontos de Quebra do Cromossomo , Rearranjo Gênico , Humanos , Análise Serial de Tecidos/métodosRESUMO
BACKGROUND: HER2 positive (HER2+) breast cancers involve chromosomal structural alterations that act as oncogenic driver events. METHODS: We interrogated the genomic structure of 18 clinically-defined HER2+ breast tumors through integrated analysis of whole genome and transcriptome sequencing, coupled with clinical information. RESULTS: ERBB2 overexpression in 15 of these tumors was associated with ERBB2 amplification due to chromoanasynthesis with six of them containing single events and the other nine exhibiting multiple events. Two of the more complex cases had adverse clinical outcomes. Chromosomes 8 was commonly involved in the same chromoanasynthesis with 17. In ten cases where chromosome 8 was involved we observed NRG1 fusions (two cases), NRG1 amplification (one case), FGFR1 amplification and ADAM32 or ADAM5 fusions. ERBB3 over-expression was associated with NRG1 fusions and EGFR and ERBB3 expressions were anti-correlated. Of the remaining three cases, one had a small duplication fully encompassing ERBB2 and was accompanied with a pathogenic mutation. CONCLUSION: Chromoanasynthesis involving chromosome 17 can lead to ERBB2 amplifications in HER2+ breast cancer. However, additional large genomic alterations contribute to a high level of genomic complexity, generating the hypothesis that worse outcome could be associated with multiple chromoanasynthetic events.
Assuntos
Neoplasias da Mama/genética , Cromotripsia , Amplificação de Genes , Receptor ErbB-2/genética , Neoplasias da Mama/química , Neoplasias da Mama/patologia , Cromossomos Humanos Par 17 , Estudos de Coortes , Feminino , Humanos , Estadiamento de Neoplasias , Receptor ErbB-2/análiseRESUMO
Genome-wide copy number profiling by single-nucleotide polymorphism (SNP) array is increasingly employed in the clinical diagnostic workup of melanocytic tumors. We present our SNP array results on 675 melanocytic tumors, including 615 histologically ambiguous tumors evaluated by our institution's dermatopathology consultation service and a separate validation cohort of 26 known benign nevi and 34 known malignant melanomas. The total number of somatic copy number abnormalities, sub-chromosomal copy number abnormalities, regions of homozygosity, and abnormalities at disease-associated regions was significantly associated with a diagnosis of malignancy across disease categories. In our study, the number of copy number abnormalities was the factor that best discriminated between benign versus malignant diagnoses, confirming recent published research. Histologically ambiguous tumors had a range and spectrum of abnormalities, including recurrent 11p gains, copy state transitions over kinase genes, and 3p deletions overlapping BAP1 in neoplasms with Spitzoid morphology. Our data suggest that histologically ambiguous melanocytic neoplasms and early primary melanomas have a range of abnormalities that is intermediate between unambiguous benign or malignant melanocytic neoplasms. Careful technical review and an integrated diagnostic approach are essential for the accurate interpretation of SNP array results on histologically ambiguous melanocytic tumors.
Assuntos
Melanoma , Nevo de Células Epitelioides e Fusiformes , Neoplasias Cutâneas , Humanos , Neoplasias Cutâneas/diagnóstico , Neoplasias Cutâneas/genética , Polimorfismo de Nucleotídeo Único , Melanoma/diagnóstico , Melanoma/genética , Aberrações CromossômicasRESUMO
OBJECTIVE: To test the hypothesis that chromosomal rearrangements (CRs) can distinguish low risk of progression (LRP) from intermediate and high risk of progression (IHRP) to prostate cancer (PCa) and if these CRs have the potential to identify men with LRP on needle biopsy that harbor IHRP PCa in the prostate gland. PATIENTS AND METHODS: Mate pair sequencing of amplified DNA from pure populations of Gleason patterns in 154 frozen specimens from 126 patients obtained between August 14, 2001, and July 15, 2011, was used to detect CRs including abnormal junctions and copy number variations. Potential CR biomarkers with higher incidence in IHRP than in LRP to cancer and having significance in PCa biology were identified. Independent validation was performed by fluorescence in situ hybridization in 152 specimens from 124 patients obtained between February 12, 2002, and July 12, 2008. RESULTS: The number of abnormal junctions did not distinguish LRP from IHRP. Loci corresponding to genes implicated in PCa were more frequently altered in IHRP. Integrated analysis of copy number variations and microarray data yielded 6 potential markers that were more frequently detected in Gleason pattern 3 of a Gleason score 7 of PCa than in Gleason pattern 3 of a Gleason score 6 PCa. Five of those were cross-validated in an independent sample set with statistically significant areas under the receiver operating characteristic curves (AUCs) (P≤.01). Probes detecting deletions in PTEN and CHD1 had AUCs of 0.87 (95% CI, 0.77-0.97) and 0.73 (95% CI, 0.60-0.86), respectively, and probes detecting gains in ASAP1, MYC, and HDAC9 had AUCs of 0.71 (95% CI, 0.59-0.84), 0.82 (95% CI, 0.71-0.93), and 0.77 (95% CI, 0.66-0.89), respectively (for expansion of gene symbols, use search tool at www.genenames.org). CONCLUSION: Copy number variations in regions encompassing important PCa genes were predictive of cancer significance and have the potential to identify men with LRP PCa by needle biopsy who have IHRP PCa in their prostate gland.
Assuntos
Biomarcadores Tumorais/genética , DNA de Neoplasias/genética , Estadiamento de Neoplasias , Próstata/patologia , Neoplasias da Próstata/genética , Idoso , Biomarcadores Tumorais/metabolismo , Biópsia por Agulha , Variações do Número de Cópias de DNA , Progressão da Doença , Seguimentos , Humanos , Hibridização in Situ Fluorescente , Masculino , Pessoa de Meia-Idade , Próstata/metabolismo , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/metabolismo , Curva ROC , Estudos Retrospectivos , Fatores de RiscoRESUMO
Mate-pair sequencing (MPseq), using long-insert, paired-end genomic libraries, is a powerful next-generation sequencing-based approach for the detection of genomic structural variants. SVAtools is a set of algorithms to detect both chromosomal rearrangements and large (>10 kb) copy number variants (CNVs) in genome-wide MPseq data. SVAtools can also predict gene disruptions and gene fusions, and characterize the genomic structure of complex rearrangements. To illustrate the power of SVAtools' junction detection methods to provide comprehensive molecular karyotypes, MPseq data were compared against a set of samples previously characterized by traditional cytogenetic methods. Karyotype, FISH and chromosomal microarray (CMA), performed for 29 patients in a clinical laboratory setting, collectively revealed 285 breakpoints in 87 rearrangements. The junction detection methods of SVAtools detected 87% of these breakpoints compared to 48%, 42% and 57% for karyotype, FISH and CMA respectively. Breakpoint resolution was also reported to 1 kb or less and additional genomic rearrangement complexities not appreciable by standard cytogenetic techniques were revealed. For example, 63% of CNVs detected by CMA were shown by SVAtools' junction detection to occur secondary to a rearrangement other than a simple deletion or tandem duplication. SVAtools with MPseq provides comprehensive and accurate whole-genome junction detection with improved breakpoint resolution, compared to karyotype, FISH, and CMA combined. This approach to molecular karyotyping offers considerable diagnostic potential for the simultaneous detection of both novel and recurrent genomic rearrangements in hereditary and neoplastic disorders.
Assuntos
Fusão Gênica/genética , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Aberrações Cromossômicas , HumanosRESUMO
TMPRSS2-ERG gene fusions occur in over 50% of prostate cancers, but their impact on clinical outcomes is not well understood. Retention of interstitial genes between TMPRSS2 and ERG has been reported to influence tumor progression in an animal model. In this study, we analyzed the status of TMPRSS2-ERG fusion genes and interstitial genes in tumors from a large cohort of men treated surgically for prostate cancer, associating alterations with biochemical progression. Through whole-genome mate pair sequencing, we mapped and classified rearrangements driving ETS family gene fusions in 133 cases of very low-, low-, intermediate-, and high-risk prostate cancer from radical prostatectomy specimens. TMPRSS2-ERG gene fusions were observed in 44% of cases, and over 90% of these fusions occurred in ERG exons 3 or 4. ERG fusions retaining interstitial sequences occurred more frequently in very low-risk tumors. These tumors also frequently displayed ERG gene fusions involving alternative 5'-partners to TMPRSS2, specifically SLC45A3 and NDRG1 and other ETS family genes, which retained interstitial TMPRSS2/ERG sequences. Lastly, tumors displaying TMPRSS2-ERG fusions that retained interstitial genes were less likely to be associated with biochemical recurrence (P = 0.028). Our results point to more favorable clinical outcomes in patients with ETS family fusion-positive prostate cancers, which retain potential tumor-suppressor genes in the interstitial regions between TMPRSS2 and ERG Identifying these patients at biopsy might improve patient management, particularly with regard to active surveillance. Cancer Res; 77(22); 6157-67. ©2017 AACR.