Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Nature ; 453(7191): 56-64, 2008 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-18451855

RESUMO

Genetic variation among individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single nucleotide changes. Here we explore variation on an intermediate scale--particularly insertions, deletions and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1,695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number between individuals. Complete sequencing of 261 structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence map of human structural variation--a standard for genotyping platforms and a prelude to future individual genome sequencing projects.


Assuntos
Variação Genética/genética , Genoma Humano/genética , Mapeamento Físico do Cromossomo , Análise de Sequência de DNA , Inversão Cromossômica/genética , Eucromatina/genética , Deleção de Genes , Geografia , Haplótipos , Humanos , Mutagênese Insercional/genética , Polimorfismo de Nucleotídeo Único/genética , Grupos Raciais/genética , Reprodutibilidade dos Testes
2.
Nat Genet ; 37(7): 727-32, 2005 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-15895083

RESUMO

Inversions, deletions and insertions are important mediators of disease and disease susceptibility. We systematically compared the human genome reference sequence with a second genome (represented by fosmid paired-end sequences) to detect intermediate-sized structural variants >8 kb in length. We identified 297 sites of structural variation: 139 insertions, 102 deletions and 56 inversion breakpoints. Using combined literature, sequence and experimental analyses, we validated 112 of the structural variants, including several that are of biomedical relevance. These data provide a fine-scale structural variation map of the human genome and the requisite sequence precision for subsequent genetic studies of human disease.


Assuntos
Genoma Humano , Instabilidade Genômica , Mutação , Polimorfismo Genético , Pareamento de Bases , Linhagem Celular Tumoral , Biologia Computacional , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Valores de Referência , Análise de Sequência de DNA
3.
Nature ; 431(7011): 927-30, 2004 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-15496912

RESUMO

Complex eukaryotic genomes are now being sequenced at an accelerated pace primarily using whole-genome shotgun (WGS) sequence assembly approaches. WGS assembly was initially criticized because of its perceived inability to resolve repeat structures within genomes. Here, we quantify the effect of WGS sequence assembly on large, highly similar repeats by comparison of the segmental duplication content of two different human genome assemblies. Our analysis shows that large (> 15 kilobases) and highly identical (> 97%) duplications are not adequately resolved by WGS assembly. This leads to significant reduction in genome length and the loss of genes embedded within duplications. Comparable analyses of mouse genome assemblies confirm that strict WGS sequence assembly will oversimplify our understanding of mammalian genome structure and evolution; a hybrid strategy using a targeted clone-by-clone approach to resolve duplications is proposed.


Assuntos
Duplicação Gênica , Genoma Humano , Genômica/métodos , Mapeamento Físico do Cromossomo/métodos , Análise de Sequência de DNA/métodos , Animais , Cromossomos Humanos/genética , Biologia Computacional/métodos , Genes Duplicados/genética , Humanos , Camundongos , Sensibilidade e Especificidade , Alinhamento de Sequência
4.
PLoS Genet ; 3(4): e63, 2007 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-17447845

RESUMO

The APOBEC3 gene family plays a role in innate cellular immunity inhibiting retroviral infection, hepatitis B virus propagation, and the retrotransposition of endogenous elements. We present a detailed sequence and population genetic analysis of a 29.5-kb common human deletion polymorphism that removes the APOBEC3B gene. We developed a PCR-based genotyping assay, characterized 1,277 human diversity samples, and found that the frequency of the deletion allele varies significantly among major continental groups (global FST = 0.2843). The deletion is rare in Africans and Europeans (frequency of 0.9% and 6%), more common in East Asians and Amerindians (36.9% and 57.7%), and almost fixed in Oceanic populations (92.9%). Despite a worldwide frequency of 22.5%, analysis of data from the International HapMap Project reveals that no single existing tag single nucleotide polymorphism may serve as a surrogate for the deletion variant, emphasizing that without careful analysis its phenotypic impact may be overlooked in association studies. Application of haplotype-based tests for selection revealed potential pitfalls in the direct application of existing methods to the analysis of genomic structural variation. These data emphasize the importance of directly genotyping structural variation in association studies and of accurately resolving variant breakpoints before proceeding with more detailed population-genetic analysis.


Assuntos
Citidina Desaminase/genética , Deleção de Genes , Genética Populacional , Polimorfismo Genético , Frequência do Gene , Genótipo , Geografia , Homozigoto , Humanos , Desequilíbrio de Ligação , Antígenos de Histocompatibilidade Menor , Dados de Sequência Molecular
5.
Nucleic Acids Res ; 33(Web Server issue): W295-8, 2005 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-15980474

RESUMO

MaM is a software tool that processes and manipulates multiple alignments of genomic sequence. MaM computes the exact location of common repeat elements, exons and unique regions within aligned genomics sequences using a variety of user identified programs, databases and/or tables. The program can extract subalignments, corresponding to these various regions of DNA to be analyzed independently or in conjunction with other elements of genomic DNA. Graphical displays further allow an assessment of sequence variation throughout these different regions of the aligned sequence, providing separate displays for their repeat, non-repeat and coding portions of genomic DNA. The program should facilitate the phylogenetic analysis and processing of different portions of genomic sequence as part of large-scale sequencing efforts. MaM source code is freely available for non-commercial use at http://compbio.cs.sfu.ca/MAM.htm; and the web interface WebMaM is hosted at http://atgc.lirmm.fr/mam.


Assuntos
Genômica/métodos , Alinhamento de Sequência/métodos , Software , Éxons , Internet , Filogenia , Sequências Repetitivas de Ácido Nucleico
6.
J Comput Biol ; 11(5): 933-44, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-15700410

RESUMO

Human DNA consists of a large number of tandem repeat sequences. Such sequences are usually called satellites, with the primary example being the centromeric alpha-satellite DNA. The basic repeat unit of the alpha-satellite DNA is a 171 bp monomer. Arbitrary monomer pairs usually have considerable sequence divergence (20-40%). However, with the exception of peripheral alpha-satellite DNA, monomers can be grouped into blocks of k-monomers (4 < or = k < or = 20) between which the divergence rate is much smaller (e.g., 5%). Perhaps the simplest and best understood mechanism for tandem repeat array evolution is unequal crossover. Although it is possible that alpha-satellite sequences developed as a result of subsequent unequal crossovers only, no formal computational framework seems to have been developed to verify this possibility. In this paper, we develop such a framework and report on experiments which imply that pericentromeric alpha-satellite segments (which are devoid of higher order structure) are evolutionarily distinct from the higher order repeat segments. It is likely that the higher order repeats developed independently in distinct regions of the genome and were carried into their current locations through an unknown mechanism of transposition.


Assuntos
Biologia Computacional , Troca Genética , DNA Satélite , Evolução Molecular , Algoritmos , Interpretação Estatística de Dados , Filogenia
7.
Genome Inform ; 13: 93-102, 2002.
Artigo em Inglês | MEDLINE | ID: mdl-14571378

RESUMO

Human DNA consists of a large number of tandem repeat sequences. Such sequences are usually called satellites, with the primary example being the centromeric alpha-satellite DNA. The basic repeat unit of the alpha-satellite DNA is a 171 bp monomer. However, with the exception of peripheral alpha-satellite DNA, monomers can be grouped into blocks of k-monomers (4 < k < 20) between which the divergence rate is much smaller (e.g. 5%). Perhaps the simplest and best understood mechanism for tandem repeat array evolution is the unequal crossover. Although it is possible that the alpha-satellite sequence developed as a result of subsequent unequal crossovers only, no formal computational framework seems to have been developed to verify this possibility. In this paper we develop such a framework and perform experiments which seem to indicate that pericentromeric alpha-satellite segments (which are devoid of higher-order structure) are evolutionarily distinct from the higher-order repeat segments. It is likely that the higher order repeats developed independently in distinct regions of the genome and were carried into their current locations through an unknown mechanism of transposition.


Assuntos
Algoritmos , Biologia Computacional/métodos , Troca Genética , DNA Satélite , Evolução Molecular , Humanos , Filogenia
8.
Nat Genet ; 40(1): 96-101, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18157130

RESUMO

The human genome sequence has been finished to very high standards; however, more than 340 gaps remained when the finished genome was published by the International Human Genome Sequencing Consortium in 2004. Using fosmid resources generated from multiple individuals, we targeted gaps in the euchromatic part of the human genome. Here we report 2,488,842 bp of previously unknown euchromatic sequence, 363,114 bp of which close 26 of 250 euchromatic gaps, or 10%, including two remaining euchromatic gaps on chromosome 19. Eight (30.7%) of the closed gaps were found to be polymorphic. These sequences allow complete annotation of several human genes as well as the assignment of mRNAs. The gap sequences are 2.3-fold enriched in segmentally duplicated sequences compared to the whole genome. Our analysis confirms that not all gaps within 'finished' genomes are recalcitrant to subcloning and suggests that the paired-end-sequenced fosmid libraries could prove to be a rich resource for completion of the human euchromatic genome.


Assuntos
Cromossomos Humanos Par 19 , Genoma Humano , Sequência de Bases , Clonagem Molecular , Eucromatina , Biblioteca Gênica , Vetores Genéticos , Projeto Genoma Humano , Humanos , Dados de Sequência Molecular , Polimorfismo Genético
9.
Genome Res ; 17(2): 249-57, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17185643

RESUMO

The gibbon karyotype is known to be extensively rearranged when compared to the human and to the ancestral primate karyotype. By combining a bioinformatics (paired-end sequence analysis) approach and a molecular cytogenetics approach, we have refined the synteny block arrangement of the white-cheeked gibbon (Nomascus leucogenys, NLE) with respect to the human genome. We provide the first detailed clone framework map of the gibbon genome and refine the location of 86 evolutionary breakpoints to <1 Mb resolution. An additional 12 breakpoints, mapping primarily to centromeric and telomeric regions, were mapped to approximately 5 Mb resolution. Our combined FISH and BES analysis indicates that we have effectively subcloned 49 of these breakpoints within NLE gibbon BAC clones, mapped to a median resolution of 79.7 kb. Interestingly, many of the intervals associated with translocations were gene-rich, including some genes associated with normal skeletal development. Comparisons of NLE breakpoints with those of other gibbon species reveal variability in the position, suggesting that chromosomal rearrangement has been a longstanding property of this particular ape lineage. Our data emphasize the synergistic effect of combining computational genomics and cytogenetics and provide a framework for ultimate sequence and assembly of the gibbon genome.


Assuntos
Hylobates/genética , Animais , Evolução Biológica , Mapeamento Cromossômico , Biologia Computacional , Rearranjo Gênico , Humanos , Hibridização in Situ Fluorescente , Cariotipagem , Primatas/genética , Especificidade da Espécie
10.
Genome Res ; 15(10): 1344-56, 2005 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-16169929

RESUMO

Structural changes (deletions, insertions, and inversions) between human and chimpanzee genomes have likely had a significant impact on lineage-specific evolution because of their potential for dramatic and irreversible mutation. The low-quality nature of the current chimpanzee genome assembly precludes the reliable identification of many of these differences. To circumvent this, we applied a method to optimally map chimpanzee fosmid paired-end sequences against the human genome to systematically identify sites of structural variation > or = 12 kb between the two species. Our analysis yielded a total of 651 putative sites of chimpanzee deletion (n = 293), insertions (n = 184), and rearrangements consistent with local inversions between the two genomes (n = 174). We validated a subset (19/23) of insertion and deletions using PCR and Southern blot assays, confirming the accuracy of our method. The events are distributed throughout the genome on all chromosomes but are highly correlated with sites of segmental duplication in human and chimpanzee. These structural variants encompass at least 24 Mb of DNA and overlap with > 245 genes. Seventeen of these genes contain exons missing in the chimpanzee genomic sequence and also show a significant reduction in gene expression in chimpanzee. Compared with the pioneering work of Yunis, Prakash, Dutrillaux, and Lejeune, this analysis expands the number of potential rearrangements between chimpanzees and humans 50-fold. Furthermore, this work prioritizes regions for further finishing in the chimpanzee genome and provides a resource for interrogating functional differences between humans and chimpanzees.


Assuntos
Genoma , Pan troglodytes/genética , Animais , Coleta de Dados , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Deleção de Sequência
11.
Genome Res ; 14(4): 493-506, 2004 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-15059990

RESUMO

We assessed the content, structure, and distribution of segmental duplications (> or =90% sequence identity, > or =5 kb length) within the published version of the Rattus norvegicus genome assembly (v.3.1). The overall fraction of duplicated sequence within the rat assembly (2.92%) is greater than that of the mouse (1%-1.2%) but significantly less than that of human ( approximately 5%). Duplications were nonuniformly distributed, occurring predominantly as tandem and tightly clustered intrachromosomal duplications. Regions containing extensive interchromosomal duplications were observed, particularly within subtelomeric and pericentromeric regions. We identified 41 discrete genomic regions greater than 1 Mb in size, termed "duplication blocks." These appear to have been the target of extensive duplication over millions of years of evolution. Gene content within duplicated regions ( approximately 1%) was lower than expected based on the genome representation. Interestingly, sequence contigs lacking chromosome assignment ("the unplaced chromosome") showed a marked enrichment for segmental duplication (45% of 75.2 Mb), indicating that segmental duplications have been problematic for sequence and assembly of the rat genome. Further targeted efforts are required to resolve the organization and complexity of these regions.


Assuntos
Duplicação Gênica , Ratos Endogâmicos BN/genética , Animais , Composição de Bases/genética , Cromossomos/genética , Biologia Computacional/métodos , Biologia Computacional/estatística & dados numéricos , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/estatística & dados numéricos , Conversão Gênica/genética , Genes/genética , Genoma , Ratos
12.
Genome Res ; 13(3): 358-68, 2003 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-12618366

RESUMO

We performed a detailed analysis of both single-nucleotide and large insertion/deletion events based on large-scale comparison of 10.6 Mb of genomic sequence from lemur, baboon, and chimpanzee to human. Using a human genomic reference, optimal global alignments were constructed from large (>50-kb) genomic sequence clones. These alignments were examined for the pattern, frequency, and nature of mutational events. Whereas rates of single-nucleotide substitution remain relatively constant (1-2 x 10(-9) substitutions/site/year), rates of retrotransposition vary radically among different primate lineages. These differences have lead to a 15%-20% expansion of human genome size over the last 50 million years of primate evolution, 90% of it due to new retroposon insertions. Orthologous comparisons with the chimpanzee suggest that the human genome continues to significantly expand due to shifts in retrotransposition activity. Assuming that the primate genome sequence we have sampled is representative, we estimate that human euchromatin has expanded 30 Mb and 550 Mb compared to the primate genomes of chimpanzee and lemur, respectively.


Assuntos
Evolução Molecular , Variação Genética/genética , Genoma Humano , Genoma , Primatas/genética , Animais , Humanos , Lemur/genética , Papio/genética , Mutação Puntual/genética , Pongo pygmaeus/genética , Retroelementos/genética , Alinhamento de Sequência/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA