RESUMEN
Recommendations for laboratories to report incidental findings from genomic tests have stimulated interest in such results. In order to investigate the criteria and processes for assigning the pathogenicity of specific variants and to estimate the frequency of such incidental findings in patients of European and African ancestry, we classified potentially actionable pathogenic single-nucleotide variants (SNVs) in all 4300 European- and 2203 African-ancestry participants sequenced by the NHLBI Exome Sequencing Project (ESP). We considered 112 gene-disease pairs selected by an expert panel as associated with medically actionable genetic disorders that may be undiagnosed in adults. The resulting classifications were compared to classifications from other clinical and research genetic testing laboratories, as well as with in silico pathogenicity scores. Among European-ancestry participants, 30 of 4300 (0.7%) had a pathogenic SNV and six (0.1%) had a disruptive variant that was expected to be pathogenic, whereas 52 (1.2%) had likely pathogenic SNVs. For African-ancestry participants, six of 2203 (0.3%) had a pathogenic SNV and six (0.3%) had an expected pathogenic disruptive variant, whereas 13 (0.6%) had likely pathogenic SNVs. Genomic Evolutionary Rate Profiling mammalian conservation score and the Combined Annotation Dependent Depletion summary score of conservation, substitution, regulation, and other evidence were compared across pathogenicity assignments and appear to have utility in variant classification. This work provides a refined estimate of the burden of adult onset, medically actionable incidental findings expected from exome sequencing, highlights challenges in variant classification, and demonstrates the need for a better curated variant interpretation knowledge base.
Asunto(s)
Exoma , Genómica , Hallazgos Incidentales , Adulto , Población Negra/genética , Femenino , Frecuencia de los Genes , Genes Dominantes , Estudios de Asociación Genética , Pruebas Genéticas , Genoma Humano , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Fenotipo , Polimorfismo de Nucleótido Simple , Población Blanca/genéticaRESUMEN
Recurrent deletions have been associated with numerous diseases and genomic disorders. Few, however, have been resolved at the molecular level because their breakpoints often occur in highly copy-number-polymorphic duplicated sequences. We present an approach that uses a combination of somatic cell hybrids, array comparative genomic hybridization, and the specificity of next-generation sequencing to determine breakpoints that occur within segmental duplications. Applying our technique to the 17q21.31 microdeletion syndrome, we used genome sequencing to determine copy-number-variant breakpoints in three deletion-bearing individuals with molecular resolution. For two cases, we observed breakpoints consistent with nonallelic homologous recombination involving only H2 chromosomal haplotypes, as expected. Molecular resolution revealed that the breakpoints occurred at different locations within a 145 kbp segment of >99% identity and disrupt KANSL1 (previously known as KANSL1). In the remaining case, we found that unequal crossover occurred interchromosomally between the H1 and H2 haplotypes and that this event was mediated by a homologous sequence that was once again missing from the human reference. Interestingly, the breakpoints mapped preferentially to gaps in the current reference genome assembly, which we resolved in this study. Our method provides a strategy for the identification of breakpoints within complex regions of the genome harboring high-identity and copy-number-polymorphic segmental duplication. The approach should become particularly useful as high-quality alternate reference sequences become available and genome sequencing of individuals' DNA becomes more routine.
Asunto(s)
Puntos de Rotura del Cromosoma , Cromosomas Humanos Par 17/genética , Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Deleción Cromosómica , Hibridación Genómica Comparativa/métodos , Variaciones en el Número de Copia de ADN , Haplotipos , Recombinación Homóloga , Humanos , Datos de Secuencia Molecular , Duplicaciones Segmentarias en el Genoma , Síndrome de Smith-MagenisRESUMEN
While copy number variation (CNV) is an active area of research, de novo mutation rates within human populations are not well characterized. By focusing on large (>100 kbp) events, we estimate the rate of de novo CNV formation in humans by analyzing 4394 transmissions from human pedigrees with and without neurocognitive disease. We show that a significant limitation in directly measuring genome-wide CNV mutation is accessing DNA derived from primary tissues as opposed to cell lines. We conservatively estimated the genome-wide CNV mutation rate using single nucleotide polymorphism (SNP) microarrays to analyze whole-blood derived DNA from asthmatic trios, a collection in which we observed no elevation in the prevalence of large CNVs. At a resolution of â¼30 kb, nine de novo CNVs were observed from 772 transmissions, corresponding to a mutation rate of µ = 1.2 × 10(-2) CNVs per genome per transmission (µ = 6.5 × 10(-3) for CNVs >500 kb). Combined with previous estimates of CNV prevalence and assuming a model of mutation-selection balance, we estimate significant purifying selection for large (>500 kb) events at the genome-wide level to be s = 0.16. Supporting this, we identify de novo CNVs in 717 multiplex autism pedigrees from the AGRE collection and observe a fourfold enrichment (P = 1.4 × 10(-3)) for de novo CNVs in cases of multiplex autism versus unaffected siblings, suggesting that many de novo CNV mutations contribute a subtle, but significant risk for autism. We observe no parental bias in the origin or transmission of CNVs among any of the cohorts studied.
Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Frecuencia de los Genes , Selección Genética , Adolescente , Asma/genética , Trastorno Autístico/genética , Secuencia de Bases , Niño , Preescolar , Femenino , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Linaje , Polimorfismo de Nucleótido SimpleRESUMEN
PURPOSE: To determine the role of CD49d for response to Bruton's tyrosine kinase inhibitors (BTKi) in patients with chronic lymphocytic leukemia (CLL). PATIENTS AND METHODS: In patients treated with acalabrutinib (n = 48), CD49d expression, VLA-4 integrin activation, and tumor transcriptomes of CLL cells were assessed. Clinical responses to BTKis were investigated in acalabrutinib- (n = 48; NCT02337829) and ibrutinib-treated (n = 73; NCT01500733) patients. RESULTS: In patients treated with acalabrutinib, treatment-induced lymphocytosis was comparable for both subgroups but resolved more rapidly for CD49d+ cases. Acalabrutinib inhibited constitutive VLA-4 activation but was insufficient to block BCR and CXCR4-mediated inside-out activation. Transcriptomes of CD49d+ and CD49d- cases were compared using RNA sequencing at baseline and at 1 and 6 months on treatment. Gene set enrichment analysis revealed increased constitutive NF-κB and JAK-STAT signaling, enhanced survival, adhesion, and migratory capacity in CD49d+ over CD49d- CLL that was maintained during therapy. In the combined cohorts of 121 BTKi-treated patients, 48 (39.7%) progressed on treatment with BTK and/or PLCG2 mutations detected in 87% of CLL progressions. Consistent with a recent report, homogeneous and bimodal CD49d-positive cases (the latter having concurrent CD49d+ and CD49d- CLL subpopulations, irrespective of the traditional 30% cutoff value) had a shorter time to progression of 6.6 years, whereas 90% of cases homogenously CD49d- were estimated progression-free at 8 years (P = 0.0004). CONCLUSIONS: CD49d/VLA-4 emerges as a microenvironmental factor that contributes to BTKi resistance in CLL. The prognostic value of CD49d is improved by considering bimodal CD49d expression. See related commentary by Tissino et al., p. 3560.
Asunto(s)
Integrina alfa4beta1 , Leucemia Linfocítica Crónica de Células B , Humanos , Leucemia Linfocítica Crónica de Células B/tratamiento farmacológico , Leucemia Linfocítica Crónica de Células B/genética , Leucemia Linfocítica Crónica de Células B/patología , Supervivencia sin Progresión , Agammaglobulinemia Tirosina Quinasa/antagonistas & inhibidores , Relevancia Clínica , Inhibidores de Proteínas Quinasas/farmacologíaRESUMEN
Copy number variants (CNVs) contribute to human genetic and phenotypic diversity. However, the distribution of larger CNVs in the general population remains largely unexplored. We identify large variants in approximately 2500 individuals by using Illumina SNP data, with an emphasis on "hotspots" prone to recurrent mutations. We find variants larger than 500 kb in 5%-10% of individuals and variants greater than 1 Mb in 1%-2%. In contrast to previous studies, we find limited evidence for stratification of CNVs in geographically distinct human populations. Importantly, our sample size permits a robust distinction between truly rare and polymorphic but low-frequency copy number variation. We find that a significant fraction of individual CNVs larger than 100 kb are rare and that both gene density and size are strongly anticorrelated with allele frequency. Thus, although large CNVs commonly exist in normal individuals, which suggests that size alone can not be used as a predictor of pathogenicity, such variation is generally deleterious. Considering these observations, we combine our data with published CNVs from more than 12,000 individuals contrasting control and neurological disease collections. This analysis identifies known disease loci and highlights additional CNVs (e.g., 3q29, 16p12, and 15q25.2) for further investigation. This study provides one of the first analyses of large, rare (0.1%-1%) CNVs in the general population, with insights relevant to future analyses of genetic disease.
Asunto(s)
Enfermedades Genéticas Congénitas/genética , Variación Genética , Genética de Población , Dosificación de Gen , Duplicación de Gen , Genoma Humano , Genotipo , Geografía , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo Genético , Polimorfismo de Nucleótido Simple , Eliminación de SecuenciaRESUMEN
BACKGROUND: Duplications and deletions in the human genome can cause disease or predispose persons to disease. Advances in technologies to detect these changes allow for the routine identification of submicroscopic imbalances in large numbers of patients. METHODS: We tested for the presence of microdeletions and microduplications at a specific region of chromosome 1q21.1 in two groups of patients with unexplained mental retardation, autism, or congenital anomalies and in unaffected persons. RESULTS: We identified 25 persons with a recurrent 1.35-Mb deletion within 1q21.1 from screening 5218 patients. The microdeletions had arisen de novo in eight patients, were inherited from a mildly affected parent in three patients, were inherited from an apparently unaffected parent in six patients, and were of unknown inheritance in eight patients. The deletion was absent in a series of 4737 control persons (P=1.1x10(-7)). We found considerable variability in the level of phenotypic expression of the microdeletion; phenotypes included mild-to-moderate mental retardation, microcephaly, cardiac abnormalities, and cataracts. The reciprocal duplication was enriched in nine children with mental retardation or autism spectrum disorder and other variable features (P=0.02). We identified three deletions and three duplications of the 1q21.1 region in an independent sample of 788 patients with mental retardation and congenital anomalies. CONCLUSIONS: We have identified recurrent molecular lesions that elude syndromic classification and whose disease manifestations must be considered in a broader context of development as opposed to being assigned to a specific disease. Clinical diagnosis in patients with these lesions may be most readily achieved on the basis of genotype rather than phenotype.
Asunto(s)
Trastorno Autístico/genética , Aberraciones Cromosómicas , Cromosomas Humanos Par 1/genética , Anomalías Congénitas/genética , Discapacidad Intelectual/genética , Catarata/congénito , Catarata/genética , Niño , Deleción Cromosómica , Femenino , Duplicación de Gen , Reordenamiento Génico , Variación Genética , Cardiopatías Congénitas/genética , Humanos , Masculino , Microcefalia/genética , Fenotipo , Recombinación GenéticaRESUMEN
[This corrects the article DOI: 10.1371/journal.pone.0025598.].
RESUMEN
The most common recurrent copy-number variants associated with autism, developmental delay and epilepsy are flanked by segmental duplications. Complete genetic characterization of these events is challenging because their breakpoints often occur within high-identity, copy-number polymorphic paralogous sequences that cannot be specifically assayed using hybridization-based methods. Here we provide a protocol for breakpoint resolution with sequence-level precision. Massively parallel sequencing is performed on libraries generated from haplotype-resolved chromosomes, genomic DNA or molecular inversion probe (MIP)-captured breakpoint-informative regions harboring paralog-distinguishing variants. Quantification of sequencing depth over informative sites enables breakpoint localization, typically within several kilobases to tens of kilobases. Depending on the approach used, the sequencing platform, and the accuracy and completeness of the reference genome sequence, this protocol takes from a few days to several months to complete. Once established for a specific genomic disorder, it is possible to process thousands of DNA samples within as little as 3-4 weeks.
Asunto(s)
Puntos de Rotura del Cromosoma , Genoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Duplicaciones Segmentarias en el Genoma/genética , Biología Computacional/métodos , Biblioteca Genómica , HumanosRESUMEN
Complement factor H shows very strong association with Age-related Macular Degeneration (AMD), and recent data suggest that multiple causal variants are associated with disease. To refine the location of the disease associated variants, we characterized in detail the structural variation at CFH and its paralogs, including two copy number polymorphisms (CNP), CNP147 and CNP148, and several rare deletions and duplications. Examination of 34 AMD-enriched extended families (Nâ=â293) and AMD cases (White Nâ=â4210 Indianâ=â134; Malayâ=â140) and controls (White Nâ=â3229; Indianâ=â117; Malayâ=â2390) demonstrated that deletion CNP148 was protective against AMD, independent of SNPs at CFH. Regression analysis of seven common haplotypes showed three haplotypes, H1, H6 and H7, as conferring risk for AMD development. Being the most common haplotype H1 confers the greatest risk by increasing the odds of AMD by 2.75-fold (95% CIâ=â[2.51, 3.01]; pâ=â8.31×10(-109)); Caucasian (H6) and Indian-specific (H7) recombinant haplotypes increase the odds of AMD by 1.85-fold (pâ=â3.52×10(-9)) and by 15.57-fold (Pâ=â0.007), respectively. We identified a 32-kb region downstream of Y402H (rs1061170), shared by all three risk haplotypes, suggesting that this region may be critical for AMD development. Further analysis showed that two SNPs within the 32 kb block, rs1329428 and rs203687, optimally explain disease association. rs1329428 resides in 20 kb unique sequence block, but rs203687 resides in a 12 kb block that is 89% similar to a noncoding region contained in ΔCNP148. We conclude that causal variation in this region potentially encompasses both regulatory effects at single markers and copy number.
Asunto(s)
Emparejamiento Base/genética , Factor H de Complemento/genética , Predisposición Genética a la Enfermedad , Degeneración Macular/genética , Polimorfismo de Nucleótido Simple/genética , Secuencia de Bases , Estudios de Cohortes , Variaciones en el Número de Copia de ADN/genética , Haplotipos/genética , Humanos , Desequilibrio de Ligamiento/genética , Datos de Secuencia Molecular , Familia de Multigenes/genética , Mutación/genética , Estándares de Referencia , Reproducibilidad de los Resultados , Factores de RiesgoRESUMEN
We report the identification of a recurrent, 520-kb 16p12.1 microdeletion associated with childhood developmental delay. The microdeletion was detected in 20 of 11,873 cases compared with 2 of 8,540 controls (P = 0.0009, OR = 7.2) and replicated in a second series of 22 of 9,254 cases compared with 6 of 6,299 controls (P = 0.028, OR = 2.5). Most deletions were inherited, with carrier parents likely to manifest neuropsychiatric phenotypes compared to non-carrier parents (P = 0.037, OR = 6). Probands were more likely to carry an additional large copy-number variant when compared to matched controls (10 of 42 cases, P = 5.7 x 10(-5), OR = 6.6). The clinical features of individuals with two mutations were distinct from and/or more severe than those of individuals carrying only the co-occurring mutation. Our data support a two-hit model in which the 16p12.1 microdeletion both predisposes to neuropsychiatric phenotypes as a single event and exacerbates neurodevelopmental phenotypes in association with other large deletions or duplications. Analysis of other microdeletions with variable expressivity indicates that this two-hit model might be more generally applicable to neuropsychiatric disease.
Asunto(s)
Deleción Cromosómica , Cromosomas Humanos Par 16 , Discapacidades del Desarrollo/genética , Modelos Genéticos , Adulto , Estudios de Casos y Controles , Niño , Preescolar , Cromosomas Humanos Par 16/genética , Hibridación Genómica Comparativa/métodos , Familia , Frecuencia de los Genes , Humanos , Lactante , Análisis de Secuencia por Matrices de Oligonucleótidos , Linaje , Fenotipo , Polimorfismo de Nucleótido Simple , Recurrencia , Índice de Severidad de la EnfermedadRESUMEN
Copy-number variants (CNVs) occur frequently within the human genome, and may be associated with many human phenotypes. If disease association studies of CNVs are to be performed routinely, it is essential that the copy-number status be accurately genotyped. We systematically assessed the dynamic range response of an oligonucleotide microarray platform to accurately predict copy-number in a set of seven patients who had previously been shown to carry between 1 and 6 copies of an approximately 4 Mb region of 15q12.2-q13.1. We identify probe uniqueness, probe length, uniformity of probe melting temperature, overlap with SNPs and common repeats (particularly Alu elements) and guanine homopolymer content as parameters that significantly affect probe performance. Further, we prove the influence of these criteria on array performance by using these parameters to prospectively filter data from a second array design covering an independent genomic region and observing significant improvements in data quality. The informed selection of probes which have superior performance characteristics allows the prospective design of oligonucleotide arrays which show increased sensitivity and specificity compared with current designs. Although based on the analysis of data from comparative genomic hybridization experiments, we anticipate that our results are relevant to the design of improved oligonucleotide arrays for high-throughput copy-number genotyping of complex regions of the human genome.