Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Nature ; 611(7936): 519-531, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-36261518

RESUMO

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.


Assuntos
Mapeamento Cromossômico , Diploide , Genoma Humano , Genômica , Humanos , Mapeamento Cromossômico/normas , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Padrões de Referência , Genômica/métodos , Genômica/normas , Cromossomos Humanos/genética , Variação Genética/genética
2.
Nat Commun ; 11(1): 5040, 2020 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-33028839

RESUMO

Bringing together cancer genomes from different projects increases power and allows the investigation of pan-cancer, molecular mechanisms. However, working with whole genomes sequenced over several years in different sequencing centres requires a framework to compare the quality of these sequences. We used the Pan-Cancer Analysis of Whole Genomes cohort as a test case to construct such a framework. This cohort contains whole cancer genomes of 2832 donors from 18 sequencing centres. We developed a non-redundant set of five quality control (QC) measurements to establish a star rating system. These QC measures reflect known differences in sequencing protocol and provide a guide to downstream analyses and allow for exclusion of samples of poor quality. We have found that this is an effective framework of quality measures. The implementation of the framework is available at: https://dockstore.org/containers/quay.io/jwerner_dkfz/pancanqc:1.2.2 .


Assuntos
Genoma Humano/genética , Genômica/normas , Neoplasias/genética , Controle de Qualidade , Mapeamento Cromossômico/normas , Cromossomos Humanos/genética , Análise Mutacional de DNA/normas , Feminino , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Masculino , Mutação , Software , Sequenciamento Completo do Genoma/normas
3.
Nat Genet ; 42(9): 745-50, 2010 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-20729854

RESUMO

There is a complex relationship between the evolution of segmental duplications and rearrangements associated with human disease. We performed a detailed analysis of one region on chromosome 16p12.1 associated with neurocognitive disease and identified one of the largest structural inconsistencies in the human reference assembly. Various genomic analyses show that all examined humans are homozygously inverted relative to the reference genome for a 1.1-Mb region on 16p12.1. We determined that this assembly discrepancy stems from two common structural configurations with worldwide frequencies of 17.6% (S1) and 82.4% (S2). This polymorphism arose from the rapid integration of segmental duplications, precipitating two local inversions within the human lineage over the last 10 million years. The two human haplotypes differ by 333 kb of additional duplicated sequence present in S2 but not in S1. Notably, we show that the S2 configuration harbors directly oriented duplications, specifically predisposing this chromosome to disease-associated rearrangement.


Assuntos
Deleção Cromossômica , Transtornos Cromossômicos/genética , Cromossomos Humanos Par 16 , Polimorfismo Genético , Animais , Linhagem Celular Tumoral , Mapeamento Cromossômico/normas , Cromossomos Humanos Par 16/química , Cromossomos Humanos Par 16/genética , Hibridização Genômica Comparativa , Dosagem de Genes , Predisposição Genética para Doença , Genética Populacional , Humanos , Dados de Sequência Molecular , Análise de Sequência com Séries de Oligonucleotídeos , Primatas/genética , Projetos de Pesquisa , Risco
4.
Genome Res ; 14(5): 789-801, 2004 May.
Artigo em Inglês | MEDLINE | ID: mdl-15123579

RESUMO

Limited comparative studies suggest that the human genome is particularly enriched for recent segmental duplications. The extent of segmental duplications in other mammalian genomes is unknown and confounded by methodological differences in genome assembly. Here, we present a detailed analysis of recent duplication content within the mouse genome using a whole-genome assembly comparison method and a novel assembly independent method, designed to take advantage of the reduced allelic variation of the C57BL/6J strain. We conservatively estimate that approximately 57% of all highly identical segmental duplications (>or=90%) were misassembled or collapsed within the working draft WGS assembly. The WGS approach often leaves duplications fragmented and unassigned to a chromosome when compared with the clone-ordered-based approach. Our preliminary analysis suggests that 1.7%-2.0% of the mouse genome is part of recent large segmental duplications (about half of what is observed for the human genome). We have constructed a mouse segmental duplication database to aid in the characterization of these regions and their integration into the final mouse genome assembly. This work suggests significant biological differences in the architecture of recent segmental duplications between human and mouse. In addition, our unique method provides the means for improving whole-genome shotgun sequence assembly of mouse and future mammalian genomes.


Assuntos
Duplicação Gênica , Genoma , Análise de Sequência de DNA/métodos , Animais , Composição de Bases/genética , Células da Medula Óssea/química , Células da Medula Óssea/metabolismo , Núcleo Celular/genética , Mapeamento Cromossômico/métodos , Mapeamento Cromossômico/normas , Mapeamento Cromossômico/estatística & dados numéricos , Biologia Computacional/métodos , DNA/genética , Bases de Dados Genéticas , Genes , Genoma Humano , Humanos , Hibridização in Situ Fluorescente/métodos , Camundongos , Camundongos Endogâmicos C57BL , Alinhamento de Sequência/métodos
5.
Genome Res ; 14(1): 179-87, 2004 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-14672980

RESUMO

Array-based comparative genomic hybridization (aCGH) is a recently developed tool for genome-wide determination of DNA copy number alterations. This technology has tremendous potential for disease-gene discovery in cancer and developmental disorders as well as numerous other applications. However, widespread utilization of a CGH has been limited by the lack of well characterized, high-resolution clone sets optimized for consistent performance in aCGH assays and specifically designed analytic software. We have assembled a set of approximately 4100 publicly available human bacterial artificial chromosome (BAC) clones evenly spaced at approximately 1-Mb resolution across the genome, which includes direct coverage of approximately 400 known cancer genes. This aCGH-optimized clone set was compiled from five existing sets, experimentally refined, and supplemented for higher resolution and enhancing mapping capabilities. This clone set is associated with a public online resource containing detailed clone mapping data, protocols for the construction and use of arrays, and a suite of analytical software tools designed specifically for aCGH analysis. These resources should greatly facilitate the use of aCGH in gene discovery.


Assuntos
Cromossomos Artificiais Bacterianos/genética , Clonagem Molecular/métodos , Genes Neoplásicos/genética , Genoma Humano , Hibridização de Ácido Nucleico/métodos , Linhagem Celular Tumoral , Mapeamento Cromossômico/normas , DNA de Neoplasias/análise , Dosagem de Genes , Humanos , Internet/tendências , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Alinhamento de Sequência/métodos , Software
6.
Genet Epidemiol ; 21(1): 40-52, 2001 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-11443733

RESUMO

The weighted pairwise correlation (WPC) method is a simple and powerful model-free method of linkage analysis that has the advantages of being applicable to binary, ordered categorical, quantitative, or censored traits, and to consider all pairs of relatives in large pedigrees. The originally implemented approach was limited to the use of the identical by state (IBS) information, and we recently extended the WPC method to incorporate the identical by descent (IBD) information for two-point linkage analysis. Here, we develop a multipoint WPC method suitable for pedigrees of arbitrary size and large number of markers. The multipoint IBD estimation procedure for relative pairs is based on the efficient regression approach developed for pedigrees implemented in SOLAR. A robust and fast Monte-Carlo procedure is used to determine reliable P values. Application of the method to the 214 pedigrees from the Breast Cancer Linkage Consortium provided for the Genetic Analysis Workshop (GAW) 9 shows that multipoint WPC statistic values were not far from two-point maximum lod-score values obtained by the classical parametric linkage method and were higher than multipoint variance component analysis lod-scores obtained with SOLAR. The multipoint WPC method is also used to analyze the familial Collaborative Study of the Genetics of Alcoholism data on alcoholism released for GAW11. It allows a better specification of the linkage results previously obtained within the chromosome 4 region.


Assuntos
Alcoolismo/genética , Neoplasias da Mama/genética , Mapeamento Cromossômico/métodos , Interpretação Estatística de Dados , Marcadores Genéticos/genética , Modelos Genéticos , Método de Monte Carlo , Análise de Regressão , Alcoolismo/epidemiologia , Viés , Neoplasias da Mama/epidemiologia , Mapeamento Cromossômico/normas , Feminino , Frequência do Gene/genética , Heterozigoto , Humanos , Escore Lod , Análise por Pareamento , Análise Multivariada , Linhagem
7.
J Cell Biol ; 149(5): 1039-52, 2000 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-10831608

RESUMO

It has been proposed that cytoplasmic peptide:N-glycanase (PNGase) may be involved in the proteasome-dependent quality control machinery used to degrade newly synthesized glycoproteins that do not correctly fold in the ER. However, a lack of information about the structure of the enzyme has limited our ability to obtain insight into its precise biological function. A PNGase-defective mutant (png1-1) was identified by screening a collection of mutagenized strains for the absence of PNGase activity in cell extracts. The PNG1 gene was mapped to the left arm of chromosome XVI by genetic approaches and its open reading frame was identified. PNG1 encodes a soluble protein that, when expressed in Escherichia coli, exhibited PNGase activity. PNG1 may be required for efficient proteasome-mediated degradation of a misfolded glycoprotein. Subcellular localization studies indicate that Png1p is present in the nucleus as well as the cytosol. Sequencing of expressed sequence tag clones revealed that Png1p is highly conserved in a wide variety of eukaryotes including mammals, suggesting that the enzyme has an important function.


Assuntos
Amidoidrolases/genética , Amidoidrolases/metabolismo , Sequência Conservada , Amidoidrolases/análise , Carboxipeptidases/genética , Carboxipeptidases/metabolismo , Catepsina A , Núcleo Celular/enzimologia , Mapeamento Cromossômico/normas , Cromossomos Fúngicos , Clonagem Molecular , Cisteína Endopeptidases/genética , Cisteína Endopeptidases/metabolismo , Citosol/enzimologia , Deleção de Genes , Dados de Sequência Molecular , Complexos Multienzimáticos/genética , Complexos Multienzimáticos/metabolismo , Mutação/genética , Peptídeo-N4-(N-acetil-beta-glucosaminil) Asparagina Amidase , Complexo de Endopeptidases do Proteassoma , Controle de Qualidade , Saccharomyces cerevisiae , Homologia de Sequência de Aminoácidos , Solubilidade , Temperatura
11.
Lancet ; 340(8831): 1330-3, 1992 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-1360045

RESUMO

It is now clear that mutations of at least two genetic loci can lead to autosomal dominant polycystic kidney disease (ADPKD). We have compared the clinical features of ADPKD caused by mutations at the PKD1 locus (linked to the alpha-globin complex on chromosome 16) with those of disease not linked to the locus (non-PKD1). We identified 18 families (285 affected members) with mutations at PKD1 and 5 families (49 affected individuals) in which involvement of this locus could be dismissed. Non-PKD1 patients lived longer than PKD1 patients (median survival 71.5 vs 56.0 years), had a lower risk of progressing to renal failure (odds ratio 0.35, 95% CI 0.13-0.92), were less likely to have hypertension (odds ratio adjusted for age and family of origin 0.29, 0.11-0.80), were diagnosed at an older age (median 69.1 vs 44.8 years), and had fewer renal cysts at the time of diagnosis. Although most of the PKD1 families were ascertained through clinics treating patients with renal impairment, no non-PKD1 family was identified through this source. Non-PKD1 ADPKD has a much milder phenotype than that linked to PKD1. Partly as a result of this difference in severity, the reported prevalence of this genotype is probably an underestimate.


Assuntos
Mapeamento Cromossômico/normas , Genótipo , Mutação , Fenótipo , Rim Policístico Autossômico Dominante/genética , Adolescente , Adulto , Fatores Etários , Criança , Pré-Escolar , Diagnóstico Diferencial , Estudos de Avaliação como Assunto , Humanos , Lactente , Meiose , Pessoa de Meia-Idade , Linhagem , Rim Policístico Autossômico Dominante/diagnóstico por imagem , Rim Policístico Autossômico Dominante/epidemiologia , Polimorfismo Genético , Prevalência , Recombinação Genética , Sensibilidade e Especificidade , Taxa de Sobrevida , Ultrassonografia , Vitória/epidemiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA