Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Cell ; 155(1): 27-38, 2013 Sep 26.
Artigo em Inglês | MEDLINE | ID: mdl-24074859

RESUMO

Genomics is a relatively new scientific discipline, having DNA sequencing as its core technology. As technology has improved the cost and scale of genome characterization over sequencing's 40-year history, the scope of inquiry has commensurately broadened. Massively parallel sequencing has proven revolutionary, shifting the paradigm of genomics to address biological questions at a genome-wide scale. Sequencing now empowers clinical diagnostics and other aspects of medical care, including disease risk, therapeutic identification, and prenatal testing. This Review explores the current state of genomics in the massively parallel sequencing era.


Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Animais , Doença/genética , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala/história , História do Século XX , História do Século XXI , Humanos , Mutação , Análise de Sequência de DNA/história
2.
Nature ; 572(7769): 323-328, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31367044

RESUMO

Exome-sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded considerably and in isolation following a series of bottlenecks, individuals of these populations have numerous deleterious alleles at a relatively high frequency. Here, using exome sequencing of nearly 20,000 individuals from these regions, we investigate the role of rare coding variants in clinically relevant quantitative cardiometabolic traits. Exome-wide association studies for 64 quantitative traits identified 26 newly associated deleterious alleles. Of these 26 alleles, 19 are either unique to or more than 20 times more frequent in Finnish individuals than in other Europeans and show geographical clustering comparable to Mendelian disease mutations that are characteristic of the Finnish population. We estimate that sequencing studies of populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.


Assuntos
Sequenciamento do Exoma , Estudos de Associação Genética/métodos , Predisposição Genética para Doença/genética , Variação Genética/genética , Locos de Características Quantitativas/genética , Alelos , HDL-Colesterol/genética , Análise por Conglomerados , Determinação de Ponto Final , Finlândia , Mapeamento Geográfico , Humanos , Herança Multifatorial/genética , Reprodutibilidade dos Testes
4.
Genome Res ; 27(5): 677-685, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-27895111

RESUMO

In an effort to more fully understand the full spectrum of human genetic variation, we generated deep single-molecule, real-time (SMRT) sequencing data from two haploid human genomes. By using an assembly-based approach (SMRT-SV), we systematically assessed each genome independently for structural variants (SVs) and indels resolving the sequence structure of 461,553 genetic variants from 2 bp to 28 kbp in length. We find that >89% of these variants have been missed as part of analysis of the 1000 Genomes Project even after adjusting for more common variants (MAF > 1%). We estimate that this theoretical human diploid differs by as much as ∼16 Mbp with respect to the human reference, with long-read sequencing data providing a fivefold increase in sensitivity for genetic variants ranging in size from 7 bp to 1 kbp compared with short-read sequence data. Although a large fraction of genetic variants were not detected by short-read approaches, once the alternate allele is sequence-resolved, we show that 61% of SVs can be genotyped in short-read sequence data sets with high accuracy. Uncoupling discovery from genotyping thus allows for the majority of this missed common variation to be genotyped in the human population. Interestingly, when we repeat SV detection on a pseudodiploid genome constructed in silico by merging the two haploids, we find that ∼59% of the heterozygous SVs are no longer detected by SMRT-SV. These results indicate that haploid resolution of long-read sequencing data will significantly increase sensitivity of SV detection.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Variação Estrutural do Genoma , Haploidia , Análise de Sequência de DNA/métodos , Mapeamento de Sequências Contíguas/normas , Projeto Genoma Humano , Humanos , Análise de Sequência de DNA/normas
5.
Genome Res ; 27(5): 849-864, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28396521

RESUMO

The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health.


Assuntos
Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Genômica/métodos , Análise de Sequência de DNA/métodos , Software , Mapeamento de Sequências Contíguas/normas , Genômica/normas , Haploidia , Haplótipos , Humanos , Polimorfismo Genético , Padrões de Referência , Análise de Sequência de DNA/normas
6.
Am J Hum Genet ; 96(3): 397-411, 2015 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-25704602

RESUMO

Although genome-wide association studies (GWASs) for nonsyndromic orofacial clefts have identified multiple strongly associated regions, the causal variants are unknown. To address this, we selected 13 regions from GWASs and other studies, performed targeted sequencing in 1,409 Asian and European trios, and carried out a series of statistical and functional analyses. Within a cluster of strongly associated common variants near NOG, we found that one, rs227727, disrupts enhancer activity. We furthermore identified significant clusters of non-coding rare variants near NTN1 and NOG and found several rare coding variants likely to affect protein function, including four nonsense variants in ARHGAP29. We confirmed 48 de novo mutations and, based on best biological evidence available, chose two of these for functional assays. One mutation in PAX7 disrupted the DNA binding of the encoded transcription factor in an in vitro assay. The second, a non-coding mutation, disrupted the activity of a neural crest enhancer downstream of FGFR2 both in vitro and in vivo. This targeted sequencing study provides strong functional evidence implicating several specific variants as primary contributory risk alleles for nonsyndromic clefting in humans.


Assuntos
Encéfalo/anormalidades , Proteínas de Transporte/genética , Fenda Labial/genética , Fissura Palatina/genética , Fator de Transcrição PAX7/genética , Polimorfismo de Nucleotídeo Único , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/genética , Alelos , Sequência de Aminoácidos , Animais , Povo Asiático/genética , Proteínas de Transporte/metabolismo , Linhagem Celular , Células Epiteliais/metabolismo , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Dados de Sequência Molecular , Mutação de Sentido Incorreto , Fator de Transcrição PAX7/metabolismo , Receptor Tipo 2 de Fator de Crescimento de Fibroblastos/metabolismo , Análise de Sequência de DNA , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , População Branca/genética , Peixe-Zebra/genética
7.
Genome Res ; 24(12): 2066-76, 2014 12.
Artigo em Inglês | MEDLINE | ID: mdl-25373144

RESUMO

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly.


Assuntos
Genoma Humano , Haplótipos , Mola Hidatiforme/genética , Alelos , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Biologia Computacional/métodos , Feminino , Genômica/métodos , Heterozigoto , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único , Gravidez , Sequências Repetitivas de Ácido Nucleico , Duplicações Segmentares Genômicas , Análise de Sequência de DNA
8.
Am J Hum Genet ; 92(1): 28-40, 2013 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-23261299

RESUMO

Reduced FCGR3B copy number is associated with increased risk of systemic lupus erythematosus (SLE). The five FCGR2/FCGR3 genes are arranged across two highly paralogous genomic segments on chromosome 1q23. Previous studies have suggested mechanisms for structural rearrangements at the FCGR2/FCGR3 locus and have proposed mechanisms whereby altered FCGR3B copy number predisposes to autoimmunity, but the high degree of sequence similarity between paralogous segments has prevented precise definition of the molecular events and their functional consequences. To pursue the genomic pathology associated with FCGR3B copy-number variation, we integrated sequencing data from fosmid and bacterial artificial chromosome clones and sequence-captured DNA from FCGR3B-deleted genomes to establish a detailed map of allelic and paralogous sequence variation across the FCGR2/FCGR3 locus. This analysis identified two highly paralogous 24.5 kb blocks within the FCGR2C/FCGR3B/FCGR2B locus that are devoid of nonpolymorphic paralogous sequence variations and that define the limits of the genomic regions in which nonallelic homologous recombination leads to FCGR2C/FCGR3B copy-number variation. Further, the data showed evidence of swapping of haplotype blocks between these highly paralogous blocks that most likely arose from sequential ancestral recombination events across the region. Functionally, we found by flow cytometry, immunoblotting and cDNA sequencing that individuals with FCGR3B-deleted alleles show ectopic presence of FcγRIIb on natural killer (NK) cells. We conclude that FCGR3B deletion juxtaposes the 5'-regulatory sequences of FCGR2C with the coding sequence of FCGR2B, creating a chimeric gene that results in an ectopic accumulation of FcγRIIb on NK cells and provides an explanation for SLE risk associated with reduced FCGR3B gene copy number.


Assuntos
Variações do Número de Cópias de DNA , Lúpus Eritematoso Sistêmico/genética , Receptores de IgG/genética , Mapeamento Cromossômico , Proteínas Ligadas por GPI/genética , Deleção de Genes , Predisposição Genética para Doença , Humanos , Células Matadoras Naturais/metabolismo , Polimorfismo de Nucleotídeo Único
10.
Genes Immun ; 16(1): 24-34, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25338678

RESUMO

Germline variation at immunoglobulin (IG) loci is critical for pathogen-mediated immunity, but establishing complete haplotype sequences in these regions has been problematic because of complex sequence architecture and diploid source DNA. We sequenced BAC clones from the effectively haploid human hydatidiform mole cell line, CHM1htert, across the light chain IG loci, kappa (IGK) and lambda (IGL), creating single haplotype representations of these regions. The IGL haplotype generated here is 1.25 Mb of contiguous sequence, including four novel IGLV alleles, one novel IGLC allele, and an 11.9-kb insertion. The CH17 IGK haplotype consists of two 644 kb proximal and 466 kb distal contigs separated by a large gap of unknown size; these assemblies added 49 kb of unique sequence extending into this gap. Our analysis also resulted in the characterization of seven novel IGKV alleles and a 16.7-kb region exhibiting signatures of interlocus sequence exchange between distal and proximal IGKV gene clusters. Genetic diversity in IGK/IGL was compared with that of the IG heavy chain (IGH) locus within the same haploid genome, revealing threefold (IGK) and sixfold (IGL) higher diversity in the IGH locus, potentially associated with increased levels of segmental duplication and the telomeric location of IGH.


Assuntos
Genes de Cadeia Leve de Imunoglobulina , Mola Hidatiforme/genética , Linhagem Celular Tumoral , Cromossomos Artificiais Bacterianos , Feminino , Genes de Cadeia Pesada de Imunoglobulina , Humanos , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Gravidez
11.
Am J Hum Genet ; 90(4): 599-613, 2012 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-22482802

RESUMO

Recurrent deletions have been associated with numerous diseases and genomic disorders. Few, however, have been resolved at the molecular level because their breakpoints often occur in highly copy-number-polymorphic duplicated sequences. We present an approach that uses a combination of somatic cell hybrids, array comparative genomic hybridization, and the specificity of next-generation sequencing to determine breakpoints that occur within segmental duplications. Applying our technique to the 17q21.31 microdeletion syndrome, we used genome sequencing to determine copy-number-variant breakpoints in three deletion-bearing individuals with molecular resolution. For two cases, we observed breakpoints consistent with nonallelic homologous recombination involving only H2 chromosomal haplotypes, as expected. Molecular resolution revealed that the breakpoints occurred at different locations within a 145 kbp segment of >99% identity and disrupt KANSL1 (previously known as KANSL1). In the remaining case, we found that unequal crossover occurred interchromosomally between the H1 and H2 haplotypes and that this event was mediated by a homologous sequence that was once again missing from the human reference. Interestingly, the breakpoints mapped preferentially to gaps in the current reference genome assembly, which we resolved in this study. Our method provides a strategy for the identification of breakpoints within complex regions of the genome harboring high-identity and copy-number-polymorphic segmental duplication. The approach should become particularly useful as high-quality alternate reference sequences become available and genome sequencing of individuals' DNA becomes more routine.


Assuntos
Pontos de Quebra do Cromossomo , Cromossomos Humanos Par 17/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Deleção Cromossômica , Hibridização Genômica Comparativa/métodos , Variações do Número de Cópias de DNA , Haplótipos , Recombinação Homóloga , Humanos , Dados de Sequência Molecular , Duplicações Segmentares Genômicas , Síndrome de Smith-Magenis
12.
BMC Bioinformatics ; 11: 471, 2010 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-20854673

RESUMO

BACKGROUND: The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. RESULTS: SeqAnt (Sequence Annotator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. CONCLUSION: SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.


Assuntos
Genômica/métodos , Anotação de Sequência Molecular/métodos , Análise de Sequência de DNA/métodos , Software , Animais , Sequência de Bases , Bases de Dados Genéticas , Variação Genética , Humanos , Internet , Camundongos
13.
Proc Biol Sci ; 274(1621): 1921-9, 2007 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-17535798

RESUMO

Humans play little role in the epidemiology of Escherichia coli O157:H7, a commensal bacterium of cattle. Why then does E. coli O157:H7 code for virulence determinants, like the Shiga toxins (Stxs), responsible for the morbidity and mortality of colonized humans? One possibility is that the virulence of these bacteria to humans is coincidental and these virulence factors evolved for and are maintained for other roles they play in the ecology of these bacteria. Here, we test the hypothesis that the carriage of the Stx-encoding prophage of E. coli O157:H7 increases the rate of survival of E. coli in the presence of grazing protozoa, Tetrahymena pyriformis. In the presence but not the absence of Tetrahymena, the carriage of the Stx-encoding prophage considerably augments the fitness of E. coli K-12 as well as clinical isolates of E. coli O157 by increasing the rate of survival of the bacteria in the food vacuoles of these ciliates. Grazing protozoa in the environment or natural host are likely to play a significant role in the ecology and maintenance of the Stx-encoding prophage of E. coli O157:H7 and may well contribute to the evolution of the virulence of these bacteria to colonize humans.


Assuntos
Escherichia coli K12/virologia , Escherichia coli O157/virologia , Prófagos/genética , Toxina Shiga/genética , Tetrahymena pyriformis/fisiologia , Fatores de Virulência/fisiologia , Animais , Evolução Biológica , Escherichia coli K12/genética , Escherichia coli K12/fisiologia , Escherichia coli O157/genética , Escherichia coli O157/fisiologia , Cadeia Alimentar , Toxina Shiga/metabolismo , Tetrahymena pyriformis/microbiologia , Vacúolos/microbiologia , Fatores de Virulência/genética , Fatores de Virulência/metabolismo
14.
Sci Rep ; 5: 9124, 2015 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-25773295

RESUMO

The contribution of genetic variants to sporadic amyotrophic lateral sclerosis (ALS) remains largely unknown. Either recessive or de novo variants could result in an apparently sporadic occurrence of ALS. In an attempt to find such variants we sequenced the exomes of 44 ALS-unaffected-parents trios. Rare and potentially damaging compound heterozygous variants were found in 27% of ALS patients, homozygous recessive variants in 14% and coding de novo variants in 27%. In 20% of patients more than one of the above variants was present. Genes with recessive variants were enriched in nucleotide binding capacity, ATPase activity, and the dynein heavy chain. Genes with de novo variants were enriched in transcription regulation and cell cycle processes. This trio study indicates that rare private recessive variants could be a mechanism underlying some case of sporadic ALS, and that de novo mutations are also likely to play a part in the disease.


Assuntos
Esclerose Lateral Amiotrófica/genética , Exoma , Genes Recessivos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Adulto , Idade de Início , Idoso , Idoso de 80 Anos ou mais , Esclerose Lateral Amiotrófica/diagnóstico , Feminino , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Mutação
15.
Genome Biol ; 16: 13, 2015 Jan 24.
Artigo em Inglês | MEDLINE | ID: mdl-25651527

RESUMO

The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools and updated data reporting formats are also required.


Assuntos
Biologia Computacional/métodos , Genoma Humano , Genômica/métodos , Bases de Dados Genéticas , Humanos , Software
16.
Nat Genet ; 46(12): 1293-302, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-25326701

RESUMO

Recurrent deletions of chromosome 15q13.3 associate with intellectual disability, schizophrenia, autism and epilepsy. To gain insight into the instability of this region, we sequenced it in affected individuals, normal individuals and nonhuman primates. We discovered five structural configurations of the human chromosome 15q13.3 region ranging in size from 2 to 3 Mb. These configurations arose recently (∼0.5-0.9 million years ago) as a result of human-specific expansions of segmental duplications and two independent inversion events. All inversion breakpoints map near GOLGA8 core duplicons-a ∼14-kb primate-specific chromosome 15 repeat that became organized into larger palindromic structures. GOLGA8-flanked palindromes also demarcate the breakpoints of recurrent 15q13.3 microdeletions, the expansion of chromosome 15 segmental duplications in the human lineage and independent structural changes in apes. The significant clustering (P = 0.002) of breakpoints provides mechanistic evidence for the role of this core duplicon and its palindromic architecture in promoting the evolutionary and disease-related instability of chromosome 15.


Assuntos
Transtornos Cromossômicos/genética , Deficiência Intelectual/genética , Sequências Repetitivas de Ácido Nucleico , Duplicações Segmentares Genômicas , Convulsões/genética , Animais , Evolução Biológica , Deleção Cromossômica , Cromossomos Artificiais Bacterianos , Cromossomos Humanos Par 15/genética , Análise por Conglomerados , Hibridização Genômica Comparativa , Dosagem de Genes , Genoma Humano , Humanos , Hibridização in Situ Fluorescente , Modelos Genéticos , Polimorfismo Genético , Primatas , Análise de Sequência de DNA
18.
Mol Autism ; 3(1): 8, 2012 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-23020841

RESUMO

BACKGROUND: Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. METHODS: We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. RESULTS: We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3' UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. CONCLUSIONS: These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects.

19.
Nat Genet ; 44(8): 872-80, 2012 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-22751100

RESUMO

The 17q21.31 inversion polymorphism exists either as direct (H1) or inverted (H2) haplotypes with differential predispositions to disease and selection. We investigated its genetic diversity in 2,700 individuals, with an emphasis on African populations. We characterize eight structural haplotypes due to complex rearrangements that vary in size from 1.08-1.49 Mb and provide evidence for a 30-kb H1-H2 double recombination event. We show that recurrent partial duplications of the KANSL1 gene have occurred on both the H1 and H2 haplotypes and have risen to high frequency in European populations. We identify a likely ancestral H2 haplotype (H2') lacking these duplications that is enriched among African hunter-gatherer groups yet essentially absent from West African populations. Whereas H1 and H2 segmental duplications arose independently and before human migration out of Africa, they have reached high frequencies recently among Europeans, either because of extraordinary genetic drift or selective sweeps.


Assuntos
Inversão Cromossômica/genética , Cromossomos Humanos Par 17/genética , África , População Negra/genética , Evolução Molecular , Frequência do Gene , Haplótipos , Humanos , Hibridização in Situ Fluorescente , Desequilíbrio de Ligação , Filogenia , Polimorfismo Genético , Polimorfismo de Nucleotídeo Único
20.
Nat Methods ; 4(11): 907-9, 2007 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-17934469

RESUMO

We developed a general method, microarray-based genomic selection (MGS), capable of selecting and enriching targeted sequences from complex eukaryotic genomes without the repeat blocking steps necessary for bacterial artificial chromosome (BAC)-based genomic selection. We demonstrate that large human genomic regions, on the order of hundreds of kilobases, can be enriched and resequenced with resequencing arrays. MGS, when combined with a next-generation resequencing technology, can enable large-scale resequencing in single-investigator laboratories.


Assuntos
Genoma/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise de Sequência de DNA/métodos , Animais , Células Eucarióticas/metabolismo , Proteína do X Frágil da Deficiência Intelectual/genética , Biblioteca Gênica , Genoma Humano/genética , Humanos , Mutação , Proteínas Nucleares/genética , Hibridização de Ácido Nucleico , Oligodesoxirribonucleotídeos/genética , Reação em Cadeia da Polimerase , Reprodutibilidade dos Testes , Transativadores/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA