Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 39
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Blood ; 142(8): 711-723, 2023 08 24.
Artículo en Inglés | MEDLINE | ID: mdl-37216686

RESUMEN

Intrachromosomal amplification of chromosome 21 defines a subtype of high-risk childhood acute lymphoblastic leukemia (iAMP21-ALL) characterized by copy number changes and complex rearrangements of chromosome 21. The genomic basis of iAMP21-ALL and the pathogenic role of the region of amplification of chromosome 21 to leukemogenesis remains incompletely understood. In this study, using integrated whole genome and transcriptome sequencing of 124 patients with iAMP21-ALL, including rare cases arising in the context of constitutional chromosomal aberrations, we identified subgroups of iAMP21-ALL based on the patterns of copy number alteration and structural variation. This large data set enabled formal delineation of a 7.8 Mb common region of amplification harboring 71 genes, 43 of which were differentially expressed compared with non-iAMP21-ALL ones, including multiple genes implicated in the pathogenesis of acute leukemia (CHAF1B, DYRK1A, ERG, HMGN1, and RUNX1). Using multimodal single-cell genomic profiling, including single-cell whole genome sequencing of 2 cases, we documented clonal heterogeneity and genomic evolution, demonstrating that the acquisition of the iAMP21 chromosome is an early event that may undergo progressive amplification during disease ontogeny. We show that UV-mutational signatures and high mutation load are characteristic secondary genetic features. Although the genomic alterations of chromosome 21 are variable, these integrated genomic analyses and demonstration of an extended common minimal region of amplification broaden the definition of iAMP21-ALL for more precise diagnosis using cytogenetic or genomic methods to inform clinical management.


Asunto(s)
Cromosomas Humanos Par 21 , Leucemia-Linfoma Linfoblástico de Células Precursoras , Humanos , Niño , Cromosomas Humanos Par 21/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Aberraciones Cromosómicas , Citogenética , Genómica , Factor 1 de Ensamblaje de la Cromatina/genética
2.
Pharmacogenomics J ; 21(2): 251-261, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33462347

RESUMEN

Responsible for the metabolism of ~21% of clinically used drugs, CYP2D6 is a critical component of personalized medicine initiatives. Genotyping CYP2D6 is challenging due to sequence similarity with its pseudogene paralog CYP2D7 and a high number and variety of common structural variants (SVs). Here we describe a novel bioinformatics method, Cyrius, that accurately genotypes CYP2D6 using whole-genome sequencing (WGS) data. We show that Cyrius has superior performance (96.5% concordance with truth genotypes) compared to existing methods (84-86.8%). After implementing the improvements identified from the comparison against the truth data, Cyrius's accuracy has since been improved to 99.3%. Using Cyrius, we built a haplotype frequency database from 2504 ethnically diverse samples and estimate that SV-containing star alleles are more frequent than previously reported. Cyrius will be an important tool to incorporate pharmacogenomics in WGS-based precision medicine initiatives.


Asunto(s)
Citocromo P-450 CYP2D6/genética , Técnicas de Genotipaje/métodos , Alelos , Biología Computacional/métodos , Etnicidad/genética , Genotipo , Haplotipos/genética , Humanos , Polimorfismo Genético/genética , Secuenciación Completa del Genoma/métodos
3.
Genome Res ; 27(1): 157-164, 2017 01.
Artículo en Inglés | MEDLINE | ID: mdl-27903644

RESUMEN

Improvement of variant calling in next-generation sequence data requires a comprehensive, genome-wide catalog of high-confidence variants called in a set of genomes for use as a benchmark. We generated deep, whole-genome sequence data of 17 individuals in a three-generation pedigree and called variants in each genome using a range of currently available algorithms. We used haplotype transmission information to create a phased "Platinum" variant catalog of 4.7 million single-nucleotide variants (SNVs) plus 0.7 million small (1-50 bp) insertions and deletions (indels) that are consistent with the pattern of inheritance in the parents and 11 children of this pedigree. Platinum genotypes are highly concordant with the current catalog of the National Institute of Standards and Technology for both SNVs (>99.99%) and indels (99.92%) and add a validated truth catalog that has 26% more SNVs and 45% more indels. Analysis of 334,652 SNVs that were consistent between informatics pipelines yet inconsistent with haplotype transmission ("nonplatinum") revealed that the majority of these variants are de novo and cell-line mutations or reside within previously unidentified duplications and deletions. The reference materials from this study are a resource for objective assessment of the accuracy of variant calls throughout genomes.


Asunto(s)
Genoma Humano/genética , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Algoritmos , Bases de Datos Genéticas , Exoma/genética , Genotipo , Humanos , Mutación INDEL/genética , Linaje , Polimorfismo de Nucleótido Simple , Programas Informáticos
4.
Genome Res ; 27(11): 1895-1903, 2017 11.
Artículo en Inglés | MEDLINE | ID: mdl-28887402

RESUMEN

Identifying large expansions of short tandem repeats (STRs), such as those that cause amyotrophic lateral sclerosis (ALS) and fragile X syndrome, is challenging for short-read whole-genome sequencing (WGS) data. A solution to this problem is an important step toward integrating WGS into precision medicine. We developed a software tool called ExpansionHunter that, using PCR-free WGS short-read data, can genotype repeats at the locus of interest, even if the expanded repeat is larger than the read length. We applied our algorithm to WGS data from 3001 ALS patients who have been tested for the presence of the C9orf72 repeat expansion with repeat-primed PCR (RP-PCR). Compared against this truth data, ExpansionHunter correctly classified all (212/212, 95% CI [0.98, 1.00]) of the expanded samples as either expansions (208) or potential expansions (4). Additionally, 99.9% (2786/2789, 95% CI [0.997, 1.00]) of the wild-type samples were correctly classified as wild type by this method with the remaining three samples identified as possible expansions. We further applied our algorithm to a set of 152 samples in which every sample had one of eight different pathogenic repeat expansions, including those associated with fragile X syndrome, Friedreich's ataxia, and Huntington's disease, and correctly flagged all but one of the known repeat expansions. Thus, ExpansionHunter can be used to accurately detect known pathogenic repeat expansions and provides researchers with a tool that can be used to identify new pathogenic repeat expansions.


Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Expansión de las Repeticiones de ADN , Secuenciación Completa del Genoma/métodos , Algoritmos , Proteína C9orf72/genética , Bases de Datos Genéticas , Humanos , Medicina de Precisión , Sensibilidad y Especificidad , Programas Informáticos
5.
Bioinformatics ; 35(22): 4754-4756, 2019 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-31134279

RESUMEN

SUMMARY: We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci. AVAILABILITY AND IMPLEMENTATION: ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Repeticiones de Microsatélite , Programas Informáticos , Genotipo
6.
Genet Med ; 22(5): 945-953, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32066871

RESUMEN

PURPOSE: Spinal muscular atrophy (SMA), caused by loss of the SMN1 gene, is a leading cause of early childhood death. Due to the near identical sequences of SMN1 and SMN2, analysis of this region is challenging. Population-wide SMA screening to quantify the SMN1 copy number (CN) is recommended by the American College of Medical Genetics and Genomics. METHODS: We developed a method that accurately identifies the CN of SMN1 and SMN2 using genome sequencing (GS) data by analyzing read depth and eight informative reference genome differences between SMN1/2. RESULTS: We characterized SMN1/2 in 12,747 genomes, identified 1568 samples with SMN1 gains or losses and 6615 samples with SMN2 gains or losses, and calculated a pan-ethnic carrier frequency of 2%, consistent with previous studies. Additionally, 99.8% of our SMN1 and 99.7% of SMN2 CN calls agreed with orthogonal methods, with a recall of 100% for SMA and 97.8% for carriers, and a precision of 100% for both SMA and carriers. CONCLUSION: This SMN copy-number caller can be used to identify both carrier and affected status of SMA, enabling SMA testing to be offered as a comprehensive test in neonatal care and an accurate carrier screening tool in GS sequencing projects.


Asunto(s)
Atrofia Muscular Espinal , Secuencia de Bases , Niño , Preescolar , Humanos , Atrofia Muscular Espinal/diagnóstico , Atrofia Muscular Espinal/genética , Proteína 1 para la Supervivencia de la Neurona Motora/genética
7.
Genet Med ; 21(5): 1121-1130, 2019 05.
Artículo en Inglés | MEDLINE | ID: mdl-30293986

RESUMEN

PURPOSE: Current diagnostic testing for genetic disorders involves serial use of specialized assays spanning multiple technologies. In principle, genome sequencing (GS) can detect all genomic pathogenic variant types on a single platform. Here we evaluate copy-number variant (CNV) calling as part of a clinically accredited GS test. METHODS: We performed analytical validation of CNV calling on 17 reference samples, compared the sensitivity of GS-based variants with those from a clinical microarray, and set a bound on precision using orthogonal technologies. We developed a protocol for family-based analysis of GS-based CNV calls, and deployed this across a clinical cohort of 79 rare and undiagnosed cases. RESULTS: We found that CNV calls from GS are at least as sensitive as those from microarrays, while only creating a modest increase in the number of variants interpreted (~10 CNVs per case). We identified clinically significant CNVs in 15% of the first 79 cases analyzed, all of which were confirmed by an orthogonal approach. The pipeline also enabled discovery of a uniparental disomy (UPD) and a 50% mosaic trisomy 14. Directed analysis of select CNVs enabled breakpoint level resolution of genomic rearrangements and phasing of de novo CNVs. CONCLUSION: Robust identification of CNVs by GS is possible within a clinical testing environment.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Enfermedades Raras/genética , Enfermedades no Diagnosticadas/genética , Adolescente , Niño , Preescolar , Mapeo Cromosómico/métodos , Estudios de Cohortes , Femenino , Pruebas Genéticas/métodos , Genoma Humano , Genómica/métodos , Humanos , Lactante , Masculino , Enfermedades Raras/diagnóstico , Enfermedades no Diagnosticadas/diagnóstico , Secuenciación Completa del Genoma/métodos , Adulto Joven
8.
Genet Med ; 20(10): 1196-1205, 2018 10.
Artículo en Inglés | MEDLINE | ID: mdl-29388947

RESUMEN

PURPOSE: Fresh-frozen (FF) tissue is the optimal source of DNA for whole-genome sequencing (WGS) of cancer patients. However, it is not always available, limiting the widespread application of WGS in clinical practice. We explored the viability of using formalin-fixed, paraffin-embedded (FFPE) tissues, available routinely for cancer patients, as a source of DNA for clinical WGS. METHODS: We conducted a prospective study using DNAs from matched FF, FFPE, and peripheral blood germ-line specimens collected from 52 cancer patients (156 samples) following routine diagnostic protocols. We compared somatic variants detected in FFPE and matching FF samples. RESULTS: We found the single-nucleotide variant agreement reached 71% across the genome and somatic copy-number alterations (CNAs) detection from FFPE samples was suboptimal (0.44 median correlation with FF) due to nonuniform coverage. CNA detection was improved significantly with lower reverse crosslinking temperature in FFPE DNA extraction (80 °C or 65 °C depending on the methods). Our final data showed somatic variant detection from FFPE for clinical decision making is possible. We detected 98% of clinically actionable variants (including 30/31 CNAs). CONCLUSION: We present the first prospective WGS study of cancer patients using FFPE specimens collected in a routine clinical environment proving WGS can be applied in the clinic.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Genoma Humano/genética , Neoplasias/genética , Secuenciación Completa del Genoma/métodos , Toma de Decisiones , Femenino , Humanos , Masculino , Neoplasias/sangre , Neoplasias/patología , Adhesión en Parafina , Polimorfismo de Nucleótido Simple/genética
10.
Nature ; 463(7278): 191-6, 2010 Jan 14.
Artículo en Inglés | MEDLINE | ID: mdl-20016485

RESUMEN

All cancers carry somatic mutations. A subset of these somatic alterations, termed driver mutations, confer selective growth advantage and are implicated in cancer development, whereas the remainder are passengers. Here we have sequenced the genomes of a malignant melanoma and a lymphoblastoid cell line from the same person, providing the first comprehensive catalogue of somatic mutations from an individual cancer. The catalogue provides remarkable insights into the forces that have shaped this cancer genome. The dominant mutational signature reflects DNA damage due to ultraviolet light exposure, a known risk factor for malignant melanoma, whereas the uneven distribution of mutations across the genome, with a lower prevalence in gene footprints, indicates that DNA repair has been preferentially deployed towards transcribed regions. The results illustrate the power of a cancer genome sequence to reveal traces of the DNA damage, repair, mutation and selection processes that were operative years before the cancer became symptomatic.


Asunto(s)
Genes Relacionados con las Neoplasias/genética , Genoma Humano/genética , Mutación/genética , Neoplasias/genética , Adulto , Línea Celular Tumoral , Daño del ADN/genética , Análisis Mutacional de ADN , Reparación del ADN/genética , Dosificación de Gen/genética , Humanos , Pérdida de Heterocigocidad/genética , Masculino , Melanoma/etiología , Melanoma/genética , MicroARNs/genética , Mutagénesis Insercional/genética , Neoplasias/etiología , Polimorfismo de Nucleótido Simple/genética , Medicina de Precisión , Eliminación de Secuencia/genética , Rayos Ultravioleta
11.
Nature ; 456(7218): 53-9, 2008 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-18987734

RESUMEN

DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400-800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.


Asunto(s)
Genoma Humano/genética , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Cromosomas Humanos X/genética , Secuencia de Consenso/genética , Genómica/economía , Genotipo , Humanos , Masculino , Nigeria , Polimorfismo de Nucleótido Simple/genética , Sensibilidad y Especificidad , Análisis de Secuencia de ADN/economía
12.
Haematologica ; 98(9): 1383-7, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23716552

RESUMEN

The congenital dyserythropoietic anemias are a heterogeneous group of rare disorders primarily affecting erythropoiesis with characteristic morphological abnormalities and a block in erythroid maturation. Mutations in the CDAN1 gene, which encodes Codanin-1, underlie the majority of congenital dyserythropoietic anemia type I cases. However, no likely pathogenic CDAN1 mutation has been detected in approximately 20% of cases, suggesting the presence of at least one other locus. We used whole genome sequencing and segregation analysis to identify a homozygous T to A transversion (c.533T>A), predicted to lead to a p.L178Q missense substitution in C15ORF41, a gene of unknown function, in a consanguineous pedigree of Middle-Eastern origin. Sequencing C15ORF41 in other CDAN1 mutation-negative congenital dyserythropoietic anemia type I pedigrees identified a homozygous transition (c.281A>G), predicted to lead to a p.Y94C substitution, in two further pedigrees of SouthEast Asian origin. The haplotype surrounding the c.281A>G change suggests a founder effect for this mutation in Pakistan. Detailed sequence similarity searches indicate that C15ORF41 encodes a novel restriction endonuclease that is a member of the Holliday junction resolvase family of proteins.


Asunto(s)
Anemia Diseritropoyética Congénita/diagnóstico , Anemia Diseritropoyética Congénita/genética , Glicoproteínas/genética , Homocigoto , Mutación Missense/genética , Endonucleasas/química , Endonucleasas/genética , Femenino , Glicoproteínas/química , Humanos , Masculino , Proteínas Nucleares , Linaje , Valor Predictivo de las Pruebas , Estructura Secundaria de Proteína , Estructura Terciaria de Proteína
13.
Cell Genom ; 3(2): 100258, 2023 Feb 08.
Artículo en Inglés | MEDLINE | ID: mdl-36819666

RESUMEN

Current standards in clinical genetics recognize the need to establish the validity of gene-disease relationships as a first step in the interpretation of sequence variants. We describe our experience incorporating the ClinGen Gene-Disease Clinical Validity framework in our interpretation and reporting workflow for a clinical genome sequencing (cGS) test for individuals with rare and undiagnosed genetic diseases. This "reactive" gene curation is completed upon identification of candidate variants during active case analysis and within the test turn-around time by focusing on the most impactful evidence and taking advantage of the broad applicability of the framework to cover a wide range of disease areas. We demonstrate that reactive gene curation can be successfully implemented in support of cGS in a clinical laboratory environment, enabling robust clinical decision making and allowing all variants to be fully and appropriately considered and their clinical significance confidently interpreted.

14.
Leukemia ; 37(3): 518-528, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36658389

RESUMEN

Childhood B-cell acute lymphoblastic leukaemia (B-ALL) is characterised by recurrent genetic abnormalities that drive risk-directed treatment strategies. Using current techniques, accurate detection of such aberrations can be challenging, due to the rapidly expanding list of key genetic abnormalities. Whole genome sequencing (WGS) has the potential to improve genetic testing, but requires comprehensive validation. We performed WGS on 210 childhood B-ALL samples annotated with clinical and genetic data. We devised a molecular classification system to subtype these patients based on identification of key genetic changes in tumour-normal and tumour-only analyses. This approach detected 294 subtype-defining genetic abnormalities in 96% (202/210) patients. Novel genetic variants, including fusions involving genes in the MAP kinase pathway, were identified. WGS results were concordant with standard-of-care methods and whole transcriptome sequencing (WTS). We expanded the catalogue of genetic profiles that reliably classify PAX5alt and ETV6::RUNX1-like subtypes. Our novel bioinformatic pipeline improved detection of DUX4 rearrangements (DUX4-r): a good-risk B-ALL subtype with high survival rates. Overall, we have validated that WGS provides a standalone, reliable genetic test to detect all subtype-defining genetic abnormalities in B-ALL, accurately classifying patients for the risk-directed treatment stratification, while simultaneously performing as a research tool to identify novel disease biomarkers.


Asunto(s)
Leucemia-Linfoma Linfoblástico de Células Precursoras B , Leucemia-Linfoma Linfoblástico de Células Precursoras , Humanos , Leucemia-Linfoma Linfoblástico de Células Precursoras/tratamiento farmacológico , Leucemia-Linfoma Linfoblástico de Células Precursoras B/diagnóstico , Leucemia-Linfoma Linfoblástico de Células Precursoras B/genética , Biología Computacional , Pruebas Genéticas , Secuenciación Completa del Genoma
15.
Leukemia ; 37(3): 529-538, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36550215

RESUMEN

Incorporating genetics into risk-stratification for treatment of childhood B-progenitor acute lymphoblastic leukaemia (B-ALL) has contributed significantly to improved survival. In about 30% B-ALL (B-other-ALL) without well-established chromosomal changes, new genetic subtypes have recently emerged, yet their true prognostic relevance largely remains unclear. We integrated next generation sequencing (NGS): whole genome sequencing (WGS) (n = 157) and bespoke targeted NGS (t-NGS) (n = 175) (overlap n = 36), with existing genetic annotation in a representative cohort of 351 B-other-ALL patients from the childhood ALL trail, UKALL2003. PAX5alt was most frequently observed (n = 91), whereas PAX5 P80R mutations (n = 11) defined a distinct PAX5 subtype. DUX4-r subtype (n = 80) was defined by DUX4 rearrangements and/or ERG deletions. These patients had a low relapse rate and excellent survival. ETV6::RUNX1-like subtype (n = 21) was characterised by multiple abnormalities of ETV6 and IKZF1, with no reported relapses or deaths, indicating their excellent prognosis in this trial. An inferior outcome for patients with ABL-class fusions (n = 25) was confirmed. Integration of NGS into genomic profiling of B-other-ALL within a single childhood ALL trial, UKALL2003, has shown the added clinical value of NGS-based approaches, through improved accuracy in detection and classification into the range of risk stratifying genetic subtypes, while validating their prognostic significance.


Asunto(s)
Leucemia-Linfoma Linfoblástico de Células Precursoras B , Leucemia-Linfoma Linfoblástico de Células Precursoras , Humanos , Ensayos Clínicos como Asunto , Marcadores Genéticos , Genómica , Recurrencia Local de Neoplasia , Leucemia-Linfoma Linfoblástico de Células Precursoras B/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Pronóstico , Niño
16.
Curr Opin Genet Dev ; 16(6): 545-52, 2006 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-17055251

RESUMEN

DNA sequencing can be used to gain important information on genes, genetic variation and gene function for biological and medical studies. The growing collection of publicly available reference genome sequences will underpin a new era of whole genome re-sequencing, but sequencing costs need to fall and throughput needs to rise by several orders of magnitude. Novel technologies are being developed to meet this need by generating massive amounts of sequence that can be aligned to the reference sequence. The challenge is to maintain the high standards of accuracy and completeness that are hallmarks of the previous genome projects. One or more new sequencing technologies are expected to become the mainstay of future research, and to make DNA sequencing centre stage as a routine tool in genetic research in the coming years.


Asunto(s)
Genómica/métodos , Análisis de Secuencia de ADN/métodos , Genoma Humano , Humanos , Análisis de Secuencia de ADN/economía
17.
Curr Opin Genet Dev ; 16(3): 213-8, 2006 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-16650760

RESUMEN

The sequences of both of the human sex chromosomes and of a substantial part of the chimpanzee Y chromosome have now been determined, and most of the protein-coding genes have been identified. The X chromosome codes for more than 800 proteins but the Y chromosome for only approximately 60, illustrating their very different evolutionary histories since their origin from an autosomal pair approximately 300 million years ago and explaining their differential importance in disease. These sequences have provided the basis for understanding normal patterns of variation, such as the distribution of SNPs, and patterns of linkage disequilibrium. In addition, they have been useful for identifying variants associated with simple Mendelian disorders such as microphthalmia or mental retardation, and more complex disorders such as osteoporosis.


Asunto(s)
Cromosomas Humanos X/genética , Cromosomas Humanos Y/genética , Animales , Secuencia de Bases , Enfermedades Genéticas Congénitas/genética , Variación Genética/genética , Genoma Humano/genética , Humanos
18.
Nature ; 429(6990): 440-5, 2004 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-15164068

RESUMEN

We have the human genome sequence. It is freely available, accurate and nearly complete. But is the genome ready for medicine? The new resource is already changing genetic research strategies to find information of medical value. Now we need high-quality annotation of all the functionally important sequences and the variations within them that contribute to health and disease. To achieve this, we need more genome sequences, systematic experimental analyses, and extensive information on human phenotypes. Flexible and user-friendly access to well-annotated genomes will create an environment for innovation, and the potential for unlimited use of sequencing in biomedical research and practice.


Asunto(s)
Genética Médica/tendencias , Genoma Humano , Genómica/tendencias , Medicina/tendencias , Variación Genética , Humanos
19.
Genome Biol ; 21(1): 102, 2020 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-32345345

RESUMEN

Repeat expansions are responsible for over 40 monogenic disorders, and undoubtedly more pathogenic repeat expansions remain to be discovered. Existing methods for detecting repeat expansions in short-read sequencing data require predefined repeat catalogs. Recent discoveries emphasize the need for methods that do not require pre-specified candidate repeats. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide repeat expansion detection. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 44 pathogenic repeats, including nine recently reported non-reference repeat expansions not discoverable via existing methods.


Asunto(s)
Expansión de las Repeticiones de ADN , Programas Informáticos , Estudios de Casos y Controles , Síndrome del Cromosoma X Frágil/genética , Ataxia de Friedreich/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Enfermedad de Huntington/genética , Repeticiones de Microsatélite , Distrofia Miotónica/genética , Secuenciación Completa del Genoma
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA