Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 39
Filtrar
1.
Sci Adv ; 10(5): eadk8598, 2024 Feb 02.
Artigo em Inglês | MEDLINE | ID: mdl-38295174

RESUMO

Here, we characterize the DNA methylation phenotypes of bone marrow cells from mice with hematopoietic deficiency of Dnmt3a or Dnmt3b (or both enzymes) or expressing the dominant-negative Dnmt3aR878H mutation [R882H in humans; the most common DNMT3A mutation found in acute myeloid leukemia (AML)]. Using these cells as substrates, we defined DNA remethylation after overexpressing wild-type (WT) DNMT3A1, DNMT3B1, DNMT3B3 (an inactive splice isoform of DNMT3B), or DNMT3L (a catalytically inactive "chaperone" for DNMT3A and DNMT3B in early embryogenesis). Overexpression of DNMT3A for 2 weeks reverses the hypomethylation phenotype of Dnmt3a-deficient cells or cells expressing the R878H mutation. Overexpression of DNMT3L (which is minimally expressed in AML cells) also corrects the hypomethylation phenotype of Dnmt3aR878H/+ marrow, probably by augmenting the activity of WT DNMT3A encoded by the residual WT allele. DNMT3L reactivation may represent a previously unidentified approach for restoring DNMT3A activity in hematopoietic cells with reduced DNMT3A function.


Assuntos
DNA (Citosina-5-)-Metiltransferases , Leucemia Mieloide Aguda , Humanos , Camundongos , Animais , DNA (Citosina-5-)-Metiltransferases/genética , DNA Metiltransferase 3A , DNA , Mutação , Metilação de DNA , Leucemia Mieloide Aguda/genética
2.
Blood Adv ; 7(16): 4586-4598, 2023 08 22.
Artigo em Inglês | MEDLINE | ID: mdl-37339484

RESUMO

TP53-mutated myeloid malignancies are associated with complex cytogenetics and extensive structural variants, which complicates detailed genomic analysis by conventional clinical techniques. We performed whole-genome sequencing (WGS) of 42 acute myeloid leukemia (AML)/myelodysplastic syndromes (MDS) cases with paired normal tissue to better characterize the genomic landscape of TP53-mutated AML/MDS. WGS accurately determines TP53 allele status, a key prognostic factor, resulting in the reclassification of 12% of cases from monoallelic to multihit. Although aneuploidy and chromothripsis are shared with most TP53-mutated cancers, the specific chromosome abnormalities are distinct to each cancer type, suggesting a dependence on the tissue of origin. ETV6 expression is reduced in nearly all cases of TP53-mutated AML/MDS, either through gene deletion or presumed epigenetic silencing. Within the AML cohort, mutations of NF1 are highly enriched, with deletions of 1 copy of NF1 present in 45% of cases and biallelic mutations in 17%. Telomere content is increased in TP53-mutated AMLs compared with other AML subtypes, and abnormal telomeric sequences were detected in the interstitial regions of chromosomes. These data highlight the unique features of TP53-mutated myeloid malignancies, including the high frequency of chromothripsis and structural variation, the frequent involvement of unique genes (including NF1 and ETV6) as cooperating events, and evidence for altered telomere maintenance.


Assuntos
Cromotripsia , Leucemia Mieloide Aguda , Síndromes Mielodisplásicas , Transtornos Mieloproliferativos , Humanos , Mutação , Aberrações Cromossômicas , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Transtornos Mieloproliferativos/genética , Síndromes Mielodisplásicas/genética , Síndromes Mielodisplásicas/patologia , Genômica , Proteína Supressora de Tumor p53/genética
3.
Nature ; 617(7960): 312-324, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37165242

RESUMO

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.


Assuntos
Genoma Humano , Genômica , Humanos , Diploide , Genoma Humano/genética , Haplótipos/genética , Análise de Sequência de DNA , Genômica/normas , Padrões de Referência , Estudos de Coortes , Alelos , Variação Genética
4.
JCO Precis Oncol ; 7: e2200559, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-37079859

RESUMO

PURPOSE: Persistent molecular disease (PMD) after induction chemotherapy predicts relapse in AML. In this study, we used whole-exome sequencing (WES) and targeted error-corrected sequencing to assess the frequency and mutational patterns of PMD in 30 patients with AML. MATERIALS AND METHODS: The study cohort included 30 patients with adult AML younger than 65 years who were uniformly treated with standard induction chemotherapy. Tumor/normal WES was performed for all patients at presentation. PMD analysis was evaluated in bone marrow samples obtained during clinicopathologic remission using repeat WES and analysis of patient-specific mutations and error-corrected sequencing of 40 recurrently mutated AML genes (MyeloSeq). RESULTS: WES for patient-specific mutations detected PMD in 63% of patients (19/30) using a minimum variant allele fraction (VAF) of 2.5%. In comparison, MyeloSeq identified persistent mutations above 0.1% VAF in 77% of patients (23/30). PMD was usually present at relatively high levels (>2.5% VAFs), such that WES and MyeloSeq agreed for 73% of patients despite differences in detection limits. Mutations in DNMT3A, ASXL1, and TET2 (ie, DTA mutations) were persistent in 16 of 17 patients, but WES also detected non-DTA mutations in 14 of these patients, which for some patients distinguished residual AML cells from clonal hematopoiesis. Surprisingly, MyeloSeq detected additional variants not identified at presentation in 73% of patients that were consistent with new clonal cell populations after chemotherapy. CONCLUSION: PMD and clonal hematopoiesis are both common in patients with AML in first remission. These findings demonstrate the importance of baseline testing for accurate interpretation of mutation-based tumor monitoring assays for patients with AML and highlight the need for clinical trials to determine whether these complex mutation patterns correlate with clinical outcomes in AML.


Assuntos
Leucemia Mieloide Aguda , Humanos , Adulto , Leucemia Mieloide Aguda/genética , Exoma , Prognóstico , Recidiva Local de Neoplasia/genética , Análise de Sequência de DNA
5.
medRxiv ; 2023 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-36711871

RESUMO

TP53 -mutated myeloid malignancies are most frequently associated with complex cytogenetics. The presence of complex and extensive structural variants complicates detailed genomic analysis by conventional clinical techniques. We performed whole genome sequencing of 42 AML/MDS cases with paired normal tissue to characterize the genomic landscape of TP53 -mutated myeloid malignancies. The vast majority of cases had multi-hit involvement at the TP53 genetic locus (94%), as well as aneuploidy and chromothripsis. Chromosomal patterns of aneuploidy differed significantly from TP53 -mutated cancers arising in other tissues. Recurrent structural variants affected regions that include ETV6 on chr12p, RUNX1 on chr21, and NF1 on chr17q. Most notably for ETV6 , transcript expression was low in cases of TP53 -mutated myeloid malignancies both with and without structural rearrangements involving chromosome 12p. Telomeric content is increased in TP53 -mutated AML/MDS compared other AML subtypes, and telomeric content was detected adjacent to interstitial regions of chromosomes. The genomic landscape of TP53 -mutated myeloid malignancies reveals recurrent structural variants affecting key hematopoietic transcription factors and telomeric repeats that are generally not detected by panel sequencing or conventional cytogenetic analyses. Key Points: WGS comprehensively determines TP53 mutation status, resulting in the reclassification of 12% of cases from mono-allelic to multi-hit Chromothripsis is more frequent than previously appreciated, with a preference for specific chromosomes ETV6 is deleted in 45% of cases, with evidence for epigenetic suppression in non-deleted cases NF1 is mutated in 48% of cases, with multi-hit mutations in 17% of these cases TP53 -mutated AML/MDS is associated with altered telomere content compared with other AMLs.

6.
Cell ; 185(18): 3426-3440.e19, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-36055201

RESUMO

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.


Assuntos
Genoma Humano , Sequenciamento Completo do Genoma , Feminino , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Mutação INDEL , Masculino , Polimorfismo de Nucleotídeo Único
7.
iScience ; 25(4): 104004, 2022 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-35313694

RESUMO

Mutations in the gene encoding DNA methyltransferase 3A (DNMT3A) are the most common cause of clonal hematopoiesis and are among the most common initiating events of acute myeloid leukemia (AML). Studies in germline and somatic Dnmt3a knockout mice have identified focal, canonical hypomethylation phenotypes in hematopoietic cells; however, the kinetics of methylation loss following acquired DNMT3A inactivation in hematopoietic cells is essentially unknown. Therefore, we evaluated a somatic, inducible model of hematopoietic Dnmt3a loss, and show that inactivation of Dnmt3a in murine hematopoietic cells results in a relatively slow loss of methylation at canonical sites throughout the genome; in contrast, remethylation of Dnmt3a deficient genomes in hematopoietic cells occurs much more quickly. This data suggests that slow methylation loss may contribute, at least in part, to the long latent period that characterizes clonal expansion and leukemia development in individuals with acquired DNMT3A mutations in hematopoietic stem cells.

8.
Nat Commun ; 12(1): 4549, 2021 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-34315901

RESUMO

Germline pathogenic variants in DNMT3A were recently described in patients with overgrowth, obesity, behavioral, and learning difficulties (DNMT3A Overgrowth Syndrome/DOS). Somatic mutations in the DNMT3A gene are also the most common cause of clonal hematopoiesis, and can initiate acute myeloid leukemia (AML). Using whole genome bisulfite sequencing, we studied DNA methylation in peripheral blood cells of 11 DOS patients and found a focal, canonical hypomethylation phenotype, which is most severe with the dominant negative DNMT3AR882H mutation. A germline mouse model expressing the homologous Dnmt3aR878H mutation phenocopies most aspects of the human DOS syndrome, including the methylation phenotype and an increased incidence of spontaneous hematopoietic malignancies, suggesting that all aspects of this syndrome are caused by this mutation.


Assuntos
Anormalidades Múltiplas/genética , DNA (Citosina-5-)-Metiltransferases/genética , Epigênese Genética , Anormalidades Múltiplas/sangue , Adolescente , Adulto , Animais , Comportamento Animal , Peso Corporal/genética , Células da Medula Óssea/metabolismo , Criança , Pré-Escolar , Ilhas de CpG/genética , Metilação de DNA/genética , DNA Metiltransferase 3A , Feminino , Perfilação da Expressão Gênica , Mutação em Linhagem Germinativa/genética , Hematopoese/genética , Células-Tronco Hematopoéticas/metabolismo , Humanos , Lactente , Leucemia/genética , Leucemia/patologia , Masculino , Camundongos Endogâmicos C57BL , Obesidade/genética , Fenótipo , Síndrome , Transcrição Gênica
9.
Am J Hum Genet ; 108(4): 583-596, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33798444

RESUMO

The contribution of genome structural variation (SV) to quantitative traits associated with cardiometabolic diseases remains largely unknown. Here, we present the results of a study examining genetic association between SVs and cardiometabolic traits in the Finnish population. We used sensitive methods to identify and genotype 129,166 high-confidence SVs from deep whole-genome sequencing (WGS) data of 4,848 individuals. We tested the 64,572 common and low-frequency SVs for association with 116 quantitative traits and tested candidate associations using exome sequencing and array genotype data from an additional 15,205 individuals. We discovered 31 genome-wide significant associations at 15 loci, including 2 loci at which SVs have strong phenotypic effects: (1) a deletion of the ALB promoter that is greatly enriched in the Finnish population and causes decreased serum albumin level in carriers (p = 1.47 × 10-54) and is also associated with increased levels of total cholesterol (p = 1.22 × 10-28) and 14 additional cholesterol-related traits, and (2) a multi-allelic copy number variant (CNV) at PDPR that is strongly associated with pyruvate (p = 4.81 × 10-21) and alanine (p = 6.14 × 10-12) levels and resides within a structurally complex genomic region that has accumulated many rearrangements over evolutionary time. We also confirmed six previously reported associations, including five led by stronger signals in single nucleotide variants (SNVs) and one linking recurrent HP gene deletion and cholesterol levels (p = 6.24 × 10-10), which was also found to be strongly associated with increased glycoprotein level (p = 3.53 × 10-35). Our study confirms that integrating SVs in trait-mapping studies will expand our knowledge of genetic factors underlying disease risk.


Assuntos
Doenças Cardiovasculares/genética , Variação Estrutural do Genoma/genética , Alelos , Colesterol/sangue , Variações do Número de Cópias de DNA/genética , Feminino , Finlândia , Genoma Humano/genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Proteínas Mitocondriais/genética , Regiões Promotoras Genéticas/genética , Piruvato Desidrogenase (Lipoamida)-Fosfatase/genética , Ácido Pirúvico/metabolismo , Albumina Sérica Humana/genética
10.
Science ; 372(6537)2021 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-33632895

RESUMO

Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.


Assuntos
Variação Genética , Genoma Humano , Haplótipos , Feminino , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Sequências Repetitivas Dispersas , Masculino , Grupos Populacionais/genética , Locos de Características Quantitativas , Retroelementos , Análise de Sequência de DNA , Inversão de Sequência , Sequenciamento Completo do Genoma
11.
Nature ; 583(7814): 83-89, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32460305

RESUMO

A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.


Assuntos
Variação Genética , Genoma Humano/genética , Sequenciamento Completo do Genoma , Alelos , Estudos de Casos e Controles , Epigênese Genética , Feminino , Dosagem de Genes/genética , Genética Populacional , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Anotação de Sequência Molecular , Locos de Características Quantitativas , Grupos Raciais/genética , Software
13.
Nature ; 572(7769): 323-328, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31367044

RESUMO

Exome-sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded considerably and in isolation following a series of bottlenecks, individuals of these populations have numerous deleterious alleles at a relatively high frequency. Here, using exome sequencing of nearly 20,000 individuals from these regions, we investigate the role of rare coding variants in clinically relevant quantitative cardiometabolic traits. Exome-wide association studies for 64 quantitative traits identified 26 newly associated deleterious alleles. Of these 26 alleles, 19 are either unique to or more than 20 times more frequent in Finnish individuals than in other Europeans and show geographical clustering comparable to Mendelian disease mutations that are characteristic of the Finnish population. We estimate that sequencing studies of populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.


Assuntos
Sequenciamento do Exoma , Estudos de Associação Genética/métodos , Predisposição Genética para Doença/genética , Variação Genética/genética , Locos de Características Quantitativas/genética , Alelos , HDL-Colesterol/genética , Análise por Conglomerados , Determinação de Ponto Final , Finlândia , Mapeamento Geográfico , Humanos , Herança Multifatorial/genética , Reprodutibilidade dos Testes
14.
Bioinformatics ; 35(22): 4782-4787, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31218349

RESUMO

SUMMARY: Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps-including deletions, duplications, mobile element insertions, inversions and other rearrangements-in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. AVAILABILITY AND IMPLEMENTATION: svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma Humano , Software , Humanos , Deleção de Sequência , Sequenciamento Completo do Genoma
15.
Exp Mol Pathol ; 102(1): 156-161, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-28093192

RESUMO

Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas.


Assuntos
Predisposição Genética para Doença/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Leiomiossarcoma/genética , Mutação , Adolescente , Adulto , Idoso , Proteínas Mutadas de Ataxia Telangiectasia/genética , Variações do Número de Cópias de DNA , DNA Helicases/genética , Receptores ErbB/genética , Feminino , Humanos , Mutação INDEL , Leiomiossarcoma/patologia , Masculino , Pessoa de Meia-Idade , Proteínas Nucleares/genética , Polimorfismo de Nucleotídeo Único , Proteína do Retinoblastoma/genética , Proteína Supressora de Tumor p53/genética , Proteína Nuclear Ligada ao X , Adulto Jovem
16.
J Mol Diagn ; 19(1): 35-42, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27863262

RESUMO

Quality assurance for clinical next-generation sequencing (NGS)-based assays is difficult given the complex methods and the range of sequence variants such assays can detect. As the number and range of mutations detected by clinical NGS assays has increased, it is difficult to apply standard analyte-specific proficiency testing (PT). Most current proficiency testing challenges for NGS are methods-based PT surveys that use DNA from reference samples engineered to harbor specific mutations that test both sequence generation and bioinformatics analysis. These methods-based PTs are limited by the number and types of mutations that can be physically introduced into a single DNA sample. In silico proficiency testing, which evaluates only the bioinformatics component of NGS assays, is a recently introduced PT method that allows for evaluation of numerous mutations spanning a range of variant classes. In silico PT data sets can be generated from simulated or actual sequencing data and are used to test alignment through variant detection and annotation steps. In silico PT has several advantages over the use of physical samples, including greater flexibility in tested variants, the ability to design laboratory-specific challenges, and lower costs. Herein, we review the use of in silico PT as an alternative to traditional methods-based PT as it is evolving in oncology applications and discuss how the approach is applicable more broadly.


Assuntos
Análise Mutacional de DNA/normas , Sequenciamento de Nucleotídeos em Larga Escala/normas , Ensaio de Proficiência Laboratorial/métodos , Sequência de Bases , Biologia Computacional , Simulação por Computador , Frequência do Gene , Humanos , Padrões de Referência
17.
Bioinformatics ; 33(7): 1083-1085, 2017 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-28031184

RESUMO

Summary: Here we present SVScore, a tool for in silico structural variation (SV) impact prediction. SVScore aggregates per-base single nucleotide polymorphism (SNP) pathogenicity scores across relevant genomic intervals for each SV in a manner that considers variant type, gene features and positional uncertainty. We show that the allele frequency spectrum of high-scoring SVs is strongly skewed toward lower frequencies, suggesting that they are under purifying selection, and that SVScore identifies deleterious variants more effectively than alternative methods. Notably, our results also suggest that duplications are under surprisingly strong selection relative to deletions, and that there are a similar number of strongly pathogenic SVs and SNPs in the human population. Availability and Implementation: SVScore is implemented in Perl and available freely at {{ http://www.github.com/lganel/SVScore }} for use under the MIT license. Contact: ihall@wustl.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Variação Estrutural do Genoma , Software , Frequência do Gene , Genômica/métodos , Humanos , Polimorfismo de Nucleotídeo Único , Deleção de Sequência
18.
Arch Pathol Lab Med ; 140(10): 1085-91, 2016 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-27388684

RESUMO

CONTEXT: -Most current proficiency testing challenges for next-generation sequencing assays are methods-based proficiency testing surveys that use DNA from characterized reference samples to test both the wet-bench and bioinformatics/dry-bench aspects of the tests. Methods-based proficiency testing surveys are limited by the number and types of mutations that either are naturally present or can be introduced into a single DNA sample. OBJECTIVE: -To address these limitations by exploring a model of in silico proficiency testing in which sequence data from a single well-characterized specimen are manipulated electronically. DESIGN: -DNA from the College of American Pathologists reference genome was enriched using the Illumina TruSeq and Life Technologies AmpliSeq panels and sequenced on the MiSeq and Ion Torrent platforms, respectively. The resulting data were mutagenized in silico and 26 variants, including single-nucleotide variants, deletions, and dinucleotide substitutions, were added at variant allele fractions (VAFs) from 10% to 50%. Participating clinical laboratories downloaded these files and analyzed them using their clinical bioinformatics pipelines. RESULTS: -Laboratories using the AmpliSeq/Ion Torrent and/or the TruSeq/MiSeq participated in the 2 surveys. On average, laboratories identified 24.6 of 26 variants (95%) overall and 21.4 of 22 variants (97%) with VAFs greater than 15%. No false-positive calls were reported. The most frequently missed variants were single-nucleotide variants with VAFs less than 15%. Across both challenges, reported VAF concordance was excellent, with less than 1% median absolute difference between the simulated VAF and mean reported VAF. CONCLUSIONS: -The results indicate that in silico proficiency testing is a feasible approach for methods-based proficiency testing, and demonstrate that the sensitivity and specificity of current next-generation sequencing bioinformatics across clinical laboratories are high.


Assuntos
Simulação por Computador , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Ensaio de Proficiência Laboratorial/métodos , Patologia Clínica/métodos , Alelos , DNA/química , DNA/genética , Estudos de Viabilidade , Frequência do Gene , Testes Genéticos/métodos , Genoma Humano/genética , Humanos , Mutação , Polimorfismo de Nucleotídeo Único , Reprodutibilidade dos Testes
19.
BMC Geriatr ; 16: 80, 2016 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-27060904

RESUMO

BACKGROUND: The Long Life Family Study (LLFS) is an international study to identify the genetic components of various healthy aging phenotypes. We hypothesized that pedigree-specific rare variants at longevity-associated genes could have a similar functional impact on healthy phenotypes. METHODS: We performed custom hybridization capture sequencing to identify the functional variants in 464 candidate genes for longevity or the major diseases of aging in 615 pedigrees (4,953 individuals) from the LLFS, using a multiplexed, custom hybridization capture. Variants were analyzed individually or as a group across an entire gene for association to aging phenotypes using family based tests. RESULTS: We found significant associations to three genes and nine single variants. Most notably, we found a novel variant significantly associated with exceptional survival in the 3' UTR OBFC1 in 13 individuals from six pedigrees. OBFC1 (chromosome 10) is involved in telomere maintenance, and falls within a linkage peak recently reported from an analysis of telomere length in LLFS families. Two different algorithms for single gene associations identified three genes with an enrichment of variation that was significantly associated with three phenotypes (GSK3B with the Healthy Aging Index, NOTCH1 with diastolic blood pressure and TP53 with serum HDL). CONCLUSIONS: Sequencing analysis of family-based associations for age-related phenotypes can identify rare or novel variants.


Assuntos
Estudos de Associação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Longevidade/genética , Linhagem , Fenótipo , Idoso , Feminino , Testes Genéticos , Variação Genética/genética , Humanos , Masculino
20.
Am J Clin Pathol ; 144(4): 667-74, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26386089

RESUMO

OBJECTIVES: To evaluate the extent of human-to-human specimen contamination in clinical next-generation sequencing (NGS) data. METHODS: Using haplotype analysis to detect specimen admixture, with orthogonal validation by short tandem repeat analysis, we determined the rate of clinically significant (>5%) DNA contamination in clinical NGS data from 296 consecutive cases. Haplotype analysis was performed using read haplotypes at common, closely spaced single-nucleotide polymorphisms in low linkage disequilibrium in the population, which were present in regions targeted by the clinical assay. Percent admixture was estimated based on frequencies of the read haplotypes at loci that showed evidence for contamination. RESULTS: We identified nine (3%) cases with at least 5% DNA admixture. Three cases were bone marrow transplant patients known to be chimeric. Six admixed cases were incidents of contamination, and the rate of contamination was strongly correlated with DNA yield from the tissue specimen. CONCLUSIONS: Human-human specimen contamination occurs in clinical NGS testing. Tools for detecting contamination in NGS sequence data should be integrated into clinical bioinformatics pipelines, especially as laboratories trend toward using smaller amounts of input DNA and reporting lower frequency variants. This study provides one estimate of the rate of clinically significant human-human specimen contamination in clinical NGS testing.


Assuntos
Contaminação por DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética , Patologia Molecular/normas , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...