Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
1.
Genome Res ; 21(10): 1672-85, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21813624

RESUMO

Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, "Max Planck One" (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 ~1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying "haploid landscapes," which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for "phase-sensitive" personal genomics. MP1's annotated haploid genomes are available as a public resource.


Assuntos
Genoma Humano , Haplótipos , Feminino , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA
2.
Nucleic Acids Res ; 40(5): 2041-53, 2012 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-22102577

RESUMO

Determining the underlying haplotypes of individual human genomes is an essential, but currently difficult, step toward a complete understanding of genome function. Fosmid pool-based next-generation sequencing allows genome-wide generation of 40-kb haploid DNA segments, which can be phased into contiguous molecular haplotypes computationally by Single Individual Haplotyping (SIH). Many SIH algorithms have been proposed, but the accuracy of such methods has been difficult to assess due to the lack of real benchmark data. To address this problem, we generated whole genome fosmid sequence data from a HapMap trio child, NA12878, for which reliable haplotypes have already been produced. We assembled haplotypes using eight algorithms for SIH and carried out direct comparisons of their accuracy, completeness and efficiency. Our comparisons indicate that fosmid-based haplotyping can deliver highly accurate results even at low coverage and that our SIH algorithm, ReFHap, is able to efficiently produce high-quality haplotypes. We expanded the haplotypes for NA12878 by combining the current haplotypes with our fosmid-based haplotypes, producing near-to-complete new gold-standard haplotypes containing almost 98% of heterozygous SNPs. This improvement includes notable fractions of disease-related and GWA SNPs. Integrated with other molecular biological data sets, this phase information will advance the emerging field of diploid genomics.


Assuntos
Genoma Humano , Projeto HapMap , Haplótipos , Análise de Sequência de DNA , Algoritmos , Genômica/normas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA/normas
3.
Mech Dev ; 121(7-8): 903-13, 2004 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-15210195

RESUMO

In order to realize the full potential of the medaka as a model system for developmental biology and genetics, characterized genomic resources need to be established, culminating in the sequence of the medaka genome. To facilitate the map-based cloning of genes underlying induced mutations and to provide templates for clone-based genomic sequencing, we have created a first-generation physical map of the medaka genome in bacterial artificial chromosome (BAC) clones. In particular, we exploited the synteny to the closely related genome of the pufferfish, Takifugu rubripes, by marker content mapping. As a first step, we clustered 103,144 public medaka EST sequences to obtain a set of 21,121 non-redundant sequence entities. Avoiding oversampling of gene-dense regions, 11,254 of EST clusters were successfully matched against the draft sequence of the fugu genome, and 2363 genes were selected for the BAC map project. We designed 35mer oligonucleotide probes from the selected genes and hybridized them against 64,500 BAC clones of strains Cab and Hd-rR, representing 14-fold coverage of the medaka genome. Our data set is further supplemented with 437 results generated from PCR-amplified inserts of medaka cDNA clones and BAC end-fragment markers. Our current, edited, first generation medaka BAC map consists of 902 map segments that cover about 74% of the medaka genome. The map contains 2721 markers. Of these, 2534 are from expressed sequences, equivalent to a non-redundant set of 2328 loci. The 934 markers (724 different) are anchored to the medaka genetic map. Thus, genetic map assignments provide immediate access to underlying clones and contigs, simplifying molecular access to candidate gene regions and their characterization.


Assuntos
Cromossomos Artificiais Bacterianos , Genoma , Oryzias/genética , Mapeamento Físico do Cromossomo , Animais , Biologia Computacional , Sondas de DNA , Marcadores Genéticos , Oryzias/embriologia
4.
Nat Commun ; 5: 5569, 2014 Nov 26.
Artigo em Inglês | MEDLINE | ID: mdl-25424553

RESUMO

To fully understand human biology and link genotype to phenotype, the phase of DNA variants must be known. Here we present a comprehensive analysis of haplotype-resolved genomes to assess the nature and variation of haplotypes and their pairs, diplotypes, in European population samples. We use a set of 14 haplotype-resolved genomes generated by fosmid clone-based sequencing, complemented and expanded by up to 372 statistically resolved genomes from the 1000 Genomes Project. We find immense diversity of both haploid and diploid gene forms, up to 4.1 and 3.9 million corresponding to 249 and 235 per gene on average. Less than 15% of autosomal genes have a predominant form. We describe a 'common diplotypic proteome', a set of 4,269 genes encoding two different proteins in over 30% of genomes. We show moreover an abundance of cis configurations of mutations in the 386 genomes with an average cis/trans ratio of 60:40, and distinguishable classes of cis- versus trans-abundant genes. This work identifies key features characterizing the diplotypic nature of human genomes and provides a conceptual and analytical framework, rich resources and novel hypotheses on the functional importance of diploidy.


Assuntos
Diploide , Genoma Humano , Proteínas/genética , Estudos de Coortes , Variação Genética , Genética Populacional , Alemanha , Haplótipos , Humanos , Mutação , Polimorfismo de Nucleotídeo Único , População Branca/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA