Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
1.
Cell ; 149(4): 912-22, 2012 May 11.
Artigo em Inglês | MEDLINE | ID: mdl-22559943

RESUMO

Gene duplication is an important source of phenotypic change and adaptive evolution. We leverage a haploid hydatidiform mole to identify highly identical sequences missing from the reference genome, confirming that the cortical development gene Slit-Robo Rho GTPase-activating protein 2 (SRGAP2) duplicated three times exclusively in humans. We show that the promoter and first nine exons of SRGAP2 duplicated from 1q32.1 (SRGAP2A) to 1q21.1 (SRGAP2B) ∼3.4 million years ago (mya). Two larger duplications later copied SRGAP2B to chromosome 1p12 (SRGAP2C) and to proximal 1q21.1 (SRGAP2D) ∼2.4 and ∼1 mya, respectively. Sequence and expression analyses show that SRGAP2C is the most likely duplicate to encode a functional protein and is among the most fixed human-specific duplicate genes. Our data suggest a mechanism where incomplete duplication created a novel gene function-antagonizing parental SRGAP2 function-immediately "at birth" 2-3 mya, which is a time corresponding to the transition from Australopithecus to Homo and the beginning of neocortex expansion.


Assuntos
Evolução Molecular , Proteínas Ativadoras de GTPase/genética , Primatas/genética , Duplicações Segmentares Genômicas , Animais , Variações do Número de Cópias de DNA , Feminino , Genética Médica , Humanos , Mola Hidatiforme/genética , Hibridização in Situ Fluorescente , Mamíferos/genética , Dados de Sequência Molecular , Gravidez
3.
Genome Res ; 26(11): 1453-1467, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27803192

RESUMO

Recurrent rearrangements of Chromosome 8p23.1 are associated with congenital heart defects and developmental delay. The complexity of this region has led to inconsistencies in the current reference assembly, confounding studies of genetic variation. Using comparative sequence-based approaches, we generated a high-quality 6.3-Mbp alternate reference assembly of an inverted Chromosome 8p23.1 haplotype. Comparison with nonhuman primates reveals a 746-kbp duplicative transposition and two separate inversion events that arose in the last million years of human evolution. The breakpoints associated with these rearrangements map to an ape-specific interchromosomal core duplicon that clusters at sites of evolutionary inversion (P = 7.8 × 10-5). Refinement of microdeletion breakpoints identifies a subgroup of patients that map to the same interchromosomal core involved in the evolutionary formation of the duplication blocks. Our results define a higher-order genomic instability element that has shaped the structure of specific chromosomes during primate evolution contributing to rearrangements associated with inversion and disease.


Assuntos
Evolução Molecular , Predisposição Genética para Doença , Instabilidade Genômica , Duplicações Segmentares Genômicas , Animais , Pontos de Quebra do Cromossomo , Deleção Cromossômica , Cromossomos Humanos Par 8/genética , Humanos , Primatas/genética
4.
Nature ; 483(7388): 169-75, 2012 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-22398555

RESUMO

Gorillas are humans' closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago. In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution.


Assuntos
Evolução Molecular , Especiação Genética , Genoma/genética , Gorilla gorilla/genética , Animais , Feminino , Regulação da Expressão Gênica , Variação Genética/genética , Genômica , Humanos , Macaca mulatta/genética , Dados de Sequência Molecular , Pan troglodytes/genética , Filogenia , Pongo/genética , Proteínas/genética , Alinhamento de Sequência , Especificidade da Espécie , Transcrição Gênica
5.
Genome Res ; 24(4): 688-96, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24418700

RESUMO

Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger shotgun sequencing of clone inserts, however, has now been largely abandoned, leaving most of these regions unresolved in newer genome assemblies generated primarily by next-generation sequencing hybrid approaches. Here we show that it is possible to resolve regions that are complex in a genome-wide context but simple in isolation for a fraction of the time and cost of traditional methods using long-read single molecule, real-time (SMRT) sequencing and assembly technology from Pacific Biosciences (PacBio). We sequenced and assembled BAC clones corresponding to a 1.3-Mbp complex region of chromosome 17q21.31, demonstrating 99.994% identity to Sanger assemblies of the same clones. We targeted 44 differences using Illumina sequencing and find that PacBio and Sanger assemblies share a comparable number of validated variants, albeit with different sequence context biases. Finally, we targeted a poorly assembled 766-kbp duplicated region of the chimpanzee genome and resolved the structure and organization for a fraction of the cost and time of traditional finishing approaches. Our data suggest a straightforward path for upgrading genomes to a higher quality finished state.


Assuntos
Cromossomos Humanos Par 17/genética , Genoma Bacteriano/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Animais , Cromossomos Artificiais Bacterianos/genética , Humanos , Camundongos , Dados de Sequência Molecular , Pan troglodytes/genética
6.
Nature ; 469(7331): 529-33, 2011 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-21270892

RESUMO

'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.


Assuntos
Variação Genética , Genoma/genética , Pongo abelii/genética , Pongo pygmaeus/genética , Animais , Centrômero/genética , Cerebrosídeos/metabolismo , Cromossomos , Evolução Molecular , Feminino , Rearranjo Gênico/genética , Especiação Genética , Genética Populacional , Humanos , Masculino , Filogenia , Densidade Demográfica , Dinâmica Populacional , Especificidade da Espécie
7.
Nat Genet ; 40(9): 1076-83, 2008 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-19165922

RESUMO

Using comparative sequencing approaches, we investigated the evolutionary history of the European-enriched 17q21.31 MAPT inversion polymorphism. We present a detailed, BAC-based sequence assembly of the inverted human H2 haplotype and compare it to the sequence structure and genetic variation of the corresponding 1.5-Mb region for the noninverted H1 human haplotype and that of chimpanzee and orangutan. We found that inversion of the MAPT region is similarly polymorphic in other great ape species, and we present evidence that the inversions occurred independently in chimpanzees and humans. In humans, the inversion breakpoints correspond to core duplications with the LRRC37 gene family. Our analysis favors the H2 configuration and sequence haplotype as the likely great ape and human ancestral state, with inversion recurrences during primate evolution. We show that the H2 architecture has evolved more extensive sequence homology, perhaps explaining its tendency to undergo microdeletion associated with mental retardation in European populations.


Assuntos
Inversão Cromossômica , Cromossomos Humanos Par 17 , Evolução Molecular , Polimorfismo Genético , Proteínas tau/genética , Animais , Sequência de Bases , Duplicação Gênica , Humanos , Modelos Biológicos , Dados de Sequência Molecular , Pan troglodytes/genética , Filogenia , Pongo pygmaeus/genética , Análise de Sequência de DNA
8.
Am J Hum Genet ; 92(4): 530-46, 2013 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-23541343

RESUMO

The immunoglobulin heavy-chain locus (IGH) encodes variable (IGHV), diversity (IGHD), joining (IGHJ), and constant (IGHC) genes and is responsible for antibody heavy-chain biosynthesis, which is vital to the adaptive immune response. Programmed V-(D)-J somatic rearrangement and the complex duplicated nature of the locus have impeded attempts to reconcile its genomic organization based on traditional B-lymphocyte derived genetic material. As a result, sequence descriptions of germline variation within IGHV are lacking, haplotype inference using traditional linkage disequilibrium methods has been difficult, and the human genome reference assembly is missing several expressed IGHV genes. By using a hydatidiform mole BAC clone resource, we present the most complete haplotype of IGHV, IGHD, and IGHJ gene regions derived from a single chromosome, representing an alternate assembly of ∼1 Mbp of high-quality finished sequence. From this we add 101 kbp of previously uncharacterized sequence, including functional IGHV genes, and characterize four large germline copy-number variants (CNVs). In addition to this germline reference, we identify and characterize eight CNV-containing haplotypes from a panel of nine diploid genomes of diverse ethnic origin, discovering previously unmapped IGHV genes and an additional 121 kbp of insertion sequence. We genotype four of these CNVs by using PCR in 425 individuals from nine human populations. We find that all four are highly polymorphic and show considerable evidence of stratification (Fst = 0.3-0.5), with the greatest differences observed between African and Asian populations. These CNVs exhibit weak linkage disequilibrium with SNPs from two commercial arrays in most of the populations tested.


Assuntos
Variações do Número de Cópias de DNA/genética , Fusão Gênica/genética , Genes de Cadeia Pesada de Imunoglobulina , Haplótipos/genética , Mola Hidatiforme/genética , Cadeias Pesadas de Imunoglobulinas/genética , Região Variável de Imunoglobulina/genética , Alelos , Cromossomos Artificiais Bacterianos , Feminino , Genética Populacional , Genótipo , Humanos , Dados de Sequência Molecular , Gravidez , Análise de Sequência de DNA , Recombinação V(D)J
9.
Am J Hum Genet ; 92(1): 28-40, 2013 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-23261299

RESUMO

Reduced FCGR3B copy number is associated with increased risk of systemic lupus erythematosus (SLE). The five FCGR2/FCGR3 genes are arranged across two highly paralogous genomic segments on chromosome 1q23. Previous studies have suggested mechanisms for structural rearrangements at the FCGR2/FCGR3 locus and have proposed mechanisms whereby altered FCGR3B copy number predisposes to autoimmunity, but the high degree of sequence similarity between paralogous segments has prevented precise definition of the molecular events and their functional consequences. To pursue the genomic pathology associated with FCGR3B copy-number variation, we integrated sequencing data from fosmid and bacterial artificial chromosome clones and sequence-captured DNA from FCGR3B-deleted genomes to establish a detailed map of allelic and paralogous sequence variation across the FCGR2/FCGR3 locus. This analysis identified two highly paralogous 24.5 kb blocks within the FCGR2C/FCGR3B/FCGR2B locus that are devoid of nonpolymorphic paralogous sequence variations and that define the limits of the genomic regions in which nonallelic homologous recombination leads to FCGR2C/FCGR3B copy-number variation. Further, the data showed evidence of swapping of haplotype blocks between these highly paralogous blocks that most likely arose from sequential ancestral recombination events across the region. Functionally, we found by flow cytometry, immunoblotting and cDNA sequencing that individuals with FCGR3B-deleted alleles show ectopic presence of FcγRIIb on natural killer (NK) cells. We conclude that FCGR3B deletion juxtaposes the 5'-regulatory sequences of FCGR2C with the coding sequence of FCGR2B, creating a chimeric gene that results in an ectopic accumulation of FcγRIIb on NK cells and provides an explanation for SLE risk associated with reduced FCGR3B gene copy number.


Assuntos
Variações do Número de Cópias de DNA , Lúpus Eritematoso Sistêmico/genética , Receptores de IgG/genética , Mapeamento Cromossômico , Proteínas Ligadas por GPI/genética , Deleção de Genes , Predisposição Genética para Doença , Humanos , Células Matadoras Naturais/metabolismo , Polimorfismo de Nucleotídeo Único
10.
Nature ; 463(7280): 536-9, 2010 Jan 28.
Artigo em Inglês | MEDLINE | ID: mdl-20072128

RESUMO

The human Y chromosome began to evolve from an autosome hundreds of millions of years ago, acquiring a sex-determining function and undergoing a series of inversions that suppressed crossing over with the X chromosome. Little is known about the recent evolution of the Y chromosome because only the human Y chromosome has been fully sequenced. Prevailing theories hold that Y chromosomes evolve by gene loss, the pace of which slows over time, eventually leading to a paucity of genes, and stasis. These theories have been buttressed by partial sequence data from newly emergent plant and animal Y chromosomes, but they have not been tested in older, highly evolved Y chromosomes such as that of humans. Here we finished sequencing of the male-specific region of the Y chromosome (MSY) in our closest living relative, the chimpanzee, achieving levels of accuracy and completion previously reached for the human MSY. By comparing the MSYs of the two species we show that they differ radically in sequence structure and gene content, indicating rapid evolution during the past 6 million years. The chimpanzee MSY contains twice as many massive palindromes as the human MSY, yet it has lost large fractions of the MSY protein-coding genes and gene families present in the last common ancestor. We suggest that the extraordinary divergence of the chimpanzee and human MSYs was driven by four synergistic factors: the prominent role of the MSY in sperm production, 'genetic hitchhiking' effects in the absence of meiotic crossing over, frequent ectopic recombination within the MSY, and species differences in mating behaviour. Although genetic decay may be the principal dynamic in the evolution of newly emergent Y chromosomes, wholesale renovation is the paramount theme in the continuing evolution of chimpanzee, human and perhaps other older MSYs.


Assuntos
Cromossomos Humanos Y/genética , Genes/genética , Conformação de Ácido Nucleico , Pan troglodytes/genética , Cromossomo Y/genética , Animais , Cromossomos Humanos Par 21/genética , DNA/química , DNA/genética , Humanos , Masculino , Dados de Sequência Molecular , Homologia de Sequência do Ácido Nucleico
11.
Genes Immun ; 16(1): 24-34, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25338678

RESUMO

Germline variation at immunoglobulin (IG) loci is critical for pathogen-mediated immunity, but establishing complete haplotype sequences in these regions has been problematic because of complex sequence architecture and diploid source DNA. We sequenced BAC clones from the effectively haploid human hydatidiform mole cell line, CHM1htert, across the light chain IG loci, kappa (IGK) and lambda (IGL), creating single haplotype representations of these regions. The IGL haplotype generated here is 1.25 Mb of contiguous sequence, including four novel IGLV alleles, one novel IGLC allele, and an 11.9-kb insertion. The CH17 IGK haplotype consists of two 644 kb proximal and 466 kb distal contigs separated by a large gap of unknown size; these assemblies added 49 kb of unique sequence extending into this gap. Our analysis also resulted in the characterization of seven novel IGKV alleles and a 16.7-kb region exhibiting signatures of interlocus sequence exchange between distal and proximal IGKV gene clusters. Genetic diversity in IGK/IGL was compared with that of the IG heavy chain (IGH) locus within the same haploid genome, revealing threefold (IGK) and sixfold (IGL) higher diversity in the IGH locus, potentially associated with increased levels of segmental duplication and the telomeric location of IGH.


Assuntos
Genes de Cadeia Leve de Imunoglobulina , Mola Hidatiforme/genética , Linhagem Celular Tumoral , Cromossomos Artificiais Bacterianos , Feminino , Genes de Cadeia Pesada de Imunoglobulina , Humanos , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Gravidez
12.
Am J Hum Genet ; 90(4): 599-613, 2012 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-22482802

RESUMO

Recurrent deletions have been associated with numerous diseases and genomic disorders. Few, however, have been resolved at the molecular level because their breakpoints often occur in highly copy-number-polymorphic duplicated sequences. We present an approach that uses a combination of somatic cell hybrids, array comparative genomic hybridization, and the specificity of next-generation sequencing to determine breakpoints that occur within segmental duplications. Applying our technique to the 17q21.31 microdeletion syndrome, we used genome sequencing to determine copy-number-variant breakpoints in three deletion-bearing individuals with molecular resolution. For two cases, we observed breakpoints consistent with nonallelic homologous recombination involving only H2 chromosomal haplotypes, as expected. Molecular resolution revealed that the breakpoints occurred at different locations within a 145 kbp segment of >99% identity and disrupt KANSL1 (previously known as KANSL1). In the remaining case, we found that unequal crossover occurred interchromosomally between the H1 and H2 haplotypes and that this event was mediated by a homologous sequence that was once again missing from the human reference. Interestingly, the breakpoints mapped preferentially to gaps in the current reference genome assembly, which we resolved in this study. Our method provides a strategy for the identification of breakpoints within complex regions of the genome harboring high-identity and copy-number-polymorphic segmental duplication. The approach should become particularly useful as high-quality alternate reference sequences become available and genome sequencing of individuals' DNA becomes more routine.


Assuntos
Pontos de Quebra do Cromossomo , Cromossomos Humanos Par 17/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Deleção Cromossômica , Hibridização Genômica Comparativa/métodos , Variações do Número de Cópias de DNA , Haplótipos , Recombinação Homóloga , Humanos , Dados de Sequência Molecular , Duplicações Segmentares Genômicas , Síndrome de Smith-Magenis
14.
Genome Res ; 22(6): 1036-49, 2012 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-22419167

RESUMO

Chimpanzee and gorilla chromosomes differ from human chromosomes by the presence of large blocks of subterminal heterochromatin thought to be composed primarily of arrays of tandem satellite sequence. We explore their sequence composition and organization and show a complex organization composed of specific sets of segmental duplications that have hyperexpanded in concert with the formation of subterminal satellites. These regions are highly copy number polymorphic between and within species, and copy number differences involving hundreds of copies can be accurately estimated by assaying read-depth of next-generation sequencing data sets. Phylogenetic and comparative genomic analyses suggest that the structures have arisen largely independently in the two lineages with the exception of a few seed sequences present in the common ancestor of humans and African apes. We propose a model where an ancestral human-chimpanzee pericentric inversion and the ancestral chromosome 2 fusion both predisposed and protected the chimpanzee and human genomes, respectively, to the formation of subtelomeric heterochromatin. Our findings highlight the complex interplay between duplicated sequences and chromosomal rearrangements that rapidly alter the cytogenetic landscape in a short period of evolutionary time.


Assuntos
Cromossomos Humanos Par 2 , Evolução Molecular , Heterocromatina/genética , Hominidae/genética , Modelos Genéticos , Sequência de Aminoácidos , Animais , Análise Citogenética , DNA Satélite , Duplicação Gênica , Gorilla gorilla/genética , Humanos , Dados de Sequência Molecular , Pan troglodytes/genética , Filogenia , Telômero/genética
15.
Nature ; 457(7231): 877-81, 2009 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-19212409

RESUMO

It is generally accepted that the extent of phenotypic change between human and great apes is dissonant with the rate of molecular change. Between these two groups, proteins are virtually identical, cytogenetically there are few rearrangements that distinguish ape-human chromosomes, and rates of single-base-pair change and retrotransposon activity have slowed particularly within hominid lineages when compared to rodents or monkeys. Studies of gene family evolution indicate that gene loss and gain are enriched within the primate lineage. Here, we perform a systematic analysis of duplication content of four primate genomes (macaque, orang-utan, chimpanzee and human) in an effort to understand the pattern and rates of genomic duplication during hominid evolution. We find that the ancestral branch leading to human and African great apes shows the most significant increase in duplication activity both in terms of base pairs and in terms of events. This duplication acceleration within the ancestral species is significant when compared to lineage-specific rate estimates even after accounting for copy-number polymorphism and homoplasy. We discover striking examples of recurrent and independent gene-containing duplications within the gorilla and chimpanzee that are absent in the human lineage. Our results suggest that the evolutionary properties of copy-number mutation differ significantly from other forms of genetic mutation and, in contrast to the hominid slowdown of single-base-pair mutations, there has been a genomic burst of duplication activity at this period during human evolution.


Assuntos
Catarrinos/genética , Evolução Molecular , Duplicação Gênica , Genoma/genética , África , Animais , Catarrinos/classificação , Mapeamento Cromossômico , Humanos , Polimorfismo Genético , Reprodutibilidade dos Testes
16.
Genome Res ; 21(10): 1640-9, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21685127

RESUMO

Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes.


Assuntos
Evolução Molecular , Variação Estrutural do Genoma , Gorilla gorilla/genética , Pan troglodytes/genética , Animais , Sequência de Bases , Mapeamento Cromossômico , Estruturas Cromossômicas , Hibridização Genômica Comparativa , Humanos , Hibridização in Situ Fluorescente , Cariótipo , Dados de Sequência Molecular , Duplicações Segmentares Genômicas , Análise de Sequência de DNA
17.
PLoS Genet ; 5(11): e1000715, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19936061

RESUMO

Maize is a major cereal crop and an important model system for basic biological research. Knowledge gained from maize research can also be used to genetically improve its grass relatives such as sorghum, wheat, and rice. The primary objective of the Maize Genome Sequencing Consortium (MGSC) was to generate a reference genome sequence that was integrated with both the physical and genetic maps. Using a previously published integrated genetic and physical map, combined with in-coming maize genomic sequence, new sequence-based genetic markers, and an optical map, we dynamically picked a minimum tiling path (MTP) of 16,910 bacterial artificial chromosome (BAC) and fosmid clones that were used by the MGSC to sequence the maize genome. The final MTP resulted in a significantly improved physical map that reduced the number of contigs from 721 to 435, incorporated a total of 8,315 mapped markers, and ordered and oriented the majority of FPC contigs. The new integrated physical and genetic map covered 2,120 Mb (93%) of the 2,300-Mb genome, of which 405 contigs were anchored to the genetic map, totaling 2,103.4 Mb (99.2% of the 2,120 Mb physical map). More importantly, 336 contigs, comprising 94.0% of the physical map ( approximately 1,993 Mb), were ordered and oriented. Finally we used all available physical, sequence, genetic, and optical data to generate a golden path (AGP) of chromosome-based pseudomolecules, herein referred to as the B73 Reference Genome Sequence version 1 (B73 RefGen_v1).


Assuntos
Genoma de Planta/genética , Zea mays/genética , Algoritmos , Sequência de Bases , Cromossomos Artificiais Bacterianos/genética , Cromossomos de Plantas/genética , Clonagem Molecular , Mapeamento de Sequências Contíguas , Marcadores Genéticos , Dados de Sequência Molecular , Fenômenos Ópticos , Mapeamento Físico do Cromossomo , Análise de Sequência de DNA , Homologia de Sequência do Ácido Nucleico
18.
PLoS Genet ; 5(11): e1000728, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19936048

RESUMO

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on approximately 1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses.


Assuntos
Pareamento de Bases/genética , Genoma de Planta/genética , Zea mays/genética , Sequência de Bases , Cromossomos de Plantas/genética , Elementos de DNA Transponíveis/genética , Evolução Molecular , Duplicação Gênica , Rearranjo Gênico/genética , Genes de Plantas , Loci Gênicos/genética , Dados de Sequência Molecular , Mutação/genética , Fases de Leitura Aberta/genética , Oryza/genética , Mapeamento Físico do Cromossomo , RNA de Plantas/genética , Homologia de Sequência do Ácido Nucleico , Sorghum/genética , Sintenia/genética
19.
Nature ; 434(7034): 724-31, 2005 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-15815621

RESUMO

Human chromosome 2 is unique to the human lineage in being the product of a head-to-head fusion of two intermediate-sized ancestral chromosomes. Chromosome 4 has received attention primarily related to the search for the Huntington's disease gene, but also for genes associated with Wolf-Hirschhorn syndrome, polycystic kidney disease and a form of muscular dystrophy. Here we present approximately 237 million base pairs of sequence for chromosome 2, and 186 million base pairs for chromosome 4, representing more than 99.6% of their euchromatic sequences. Our initial analyses have identified 1,346 protein-coding genes and 1,239 pseudogenes on chromosome 2, and 796 protein-coding genes and 778 pseudogenes on chromosome 4. Extensive analyses confirm the underlying construction of the sequence, and expand our understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.


Assuntos
Cromossomos Humanos Par 2/genética , Cromossomos Humanos Par 4/genética , Animais , Composição de Bases , Sequência de Bases , Centrômero/genética , Sequência Conservada/genética , Ilhas de CpG/genética , Eucromatina/genética , Etiquetas de Sequências Expressas , Duplicação Gênica , Variação Genética/genética , Genômica , Humanos , Dados de Sequência Molecular , Mapeamento Físico do Cromossomo , Polimorfismo Genético/genética , Primatas/genética , Proteínas/genética , Pseudogenes/genética , RNA Mensageiro/análise , RNA Mensageiro/genética , RNA não Traduzido/análise , RNA não Traduzido/genética , Recombinação Genética/genética , Análise de Sequência de DNA
20.
Nature ; 432(7018): 761-4, 2004 Dec 09.
Artigo em Inglês | MEDLINE | ID: mdl-15592415

RESUMO

Strategies for assembling large, complex genomes have evolved to include a combination of whole-genome shotgun sequencing and hierarchal map-assisted sequencing. Whole-genome maps of all types can aid genome assemblies, generally starting with low-resolution cytogenetic maps and ending with the highest resolution of sequence. Fingerprint clone maps are based upon complete restriction enzyme digests of clones representative of the target genome, and ultimately comprise a near-contiguous path of clones across the genome. Such clone-based maps are used to validate sequence assembly order, supply long-range linking information for assembled sequences, anchor sequences to the genetic map and provide templates for closing gaps. Fingerprint maps are also a critical resource for subsequent functional genomic studies, because they provide a redundant and ordered sampling of the genome with clones. In an accompanying paper we describe the draft genome sequence of the chicken, Gallus gallus, the first species sequenced that is both a model organism and a global food source. Here we present a clone-based physical map of the chicken genome at 20-fold coverage, containing 260 contigs of overlapping clones. This map represents approximately 91% of the chicken genome and enables identification of chicken clones aligned to positions in other sequenced genomes.


Assuntos
Galinhas/genética , Genoma , Genômica , Mapeamento Físico do Cromossomo , Animais , Cromossomos Artificiais Bacterianos/genética , Clonagem Molecular , Mapeamento de Sequências Contíguas , Impressões Digitais de DNA , Ligação Genética/genética , Sitios de Sequências Rotuladas
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa