Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Más filtros

Banco de datos
Tipo de estudio
País/Región como asunto
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Nature ; 548(7665): 87-91, 2017 08 03.
Artículo en Inglés | MEDLINE | ID: mdl-28746312

RESUMEN

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.


Asunto(s)
Variación Genética/genética , Genética de Población/normas , Genoma Humano/genética , Genómica/normas , Análisis de Secuencia de ADN/normas , Adulto , Alelos , Niño , Cromosomas Humanos Y/genética , Dinamarca , Femenino , Haplotipos/genética , Humanos , Complejo Mayor de Histocompatibilidad/genética , Masculino , Edad Materna , Tasa de Mutación , Edad Paterna , Mutación Puntual/genética , Estándares de Referencia
2.
Genome Res ; 27(9): 1597-1607, 2017 09.
Artículo en Inglés | MEDLINE | ID: mdl-28774965

RESUMEN

Genes in the major histocompatibility complex (MHC, also known as HLA) play a critical role in the immune response and variation within the extended 4-Mb region shows association with major risks of many diseases. Yet, deciphering the underlying causes of these associations is difficult because the MHC is the most polymorphic region of the genome with a complex linkage disequilibrium structure. Here, we reconstruct full MHC haplotypes from de novo assembled trios without relying on a reference genome and perform evolutionary analyses. We report 100 full MHC haplotypes and call a large set of structural variants in the regions for future use in imputation with GWAS data. We also present the first complete analysis of the recombination landscape in the entire region and show how balancing selection at classical genes have linked effects on the frequency of variants throughout the region.


Asunto(s)
Variación Genética/genética , Genética de Población , Desequilibrio de Ligamiento/genética , Complejo Mayor de Histocompatibilidad/genética , Alelos , Mapeo Cromosómico , Dinamarca , Haplotipos/genética , Humanos , Polimorfismo de Nucleótido Simple/genética
3.
Nat Commun ; 6: 5969, 2015 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-25597990

RESUMEN

Building a population-specific catalogue of single nucleotide variants (SNVs), indels and structural variants (SVs) with frequencies, termed a national pan-genome, is critical for further advancing clinical and public health genetics in large cohorts. Here we report a Danish pan-genome obtained from sequencing 10 trios to high depth (50 × ). We report 536k novel SNVs and 283k novel short indels from mapping approaches and develop a population-wide de novo assembly approach to identify 132k novel indels larger than 10 nucleotides with low false discovery rates. We identify a higher proportion of indels and SVs than previous efforts showing the merits of high coverage and de novo assembly approaches. In addition, we use trio information to identify de novo mutations and use a probabilistic method to provide direct estimates of 1.27e-8 and 1.5e-9 per nucleotide per generation for SNVs and indels, respectively.


Asunto(s)
Genoma Humano/genética , Algoritmos , Humanos , Tasa de Mutación , Polimorfismo de Nucleótido Simple/genética , Análisis de Secuencia de ADN/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA