Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
1.
Cell ; 155(1): 27-38, 2013 Sep 26.
Article in English | MEDLINE | ID: mdl-24074859

ABSTRACT

Genomics is a relatively new scientific discipline, having DNA sequencing as its core technology. As technology has improved the cost and scale of genome characterization over sequencing's 40-year history, the scope of inquiry has commensurately broadened. Massively parallel sequencing has proven revolutionary, shifting the paradigm of genomics to address biological questions at a genome-wide scale. Sequencing now empowers clinical diagnostics and other aspects of medical care, including disease risk, therapeutic identification, and prenatal testing. This Review explores the current state of genomics in the massively parallel sequencing era.


Subject(s)
Genomics , High-Throughput Nucleotide Sequencing/methods , Sequence Analysis, DNA/methods , Animals , Disease/genetics , Genetic Variation , High-Throughput Nucleotide Sequencing/history , History, 20th Century , History, 21st Century , Humans , Mutation , Sequence Analysis, DNA/history
2.
Nature ; 572(7769): 323-328, 2019 08.
Article in English | MEDLINE | ID: mdl-31367044

ABSTRACT

Exome-sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded considerably and in isolation following a series of bottlenecks, individuals of these populations have numerous deleterious alleles at a relatively high frequency. Here, using exome sequencing of nearly 20,000 individuals from these regions, we investigate the role of rare coding variants in clinically relevant quantitative cardiometabolic traits. Exome-wide association studies for 64 quantitative traits identified 26 newly associated deleterious alleles. Of these 26 alleles, 19 are either unique to or more than 20 times more frequent in Finnish individuals than in other Europeans and show geographical clustering comparable to Mendelian disease mutations that are characteristic of the Finnish population. We estimate that sequencing studies of populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.


Subject(s)
Exome Sequencing , Genetic Association Studies/methods , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Quantitative Trait Loci/genetics , Alleles , Cholesterol, HDL/genetics , Cluster Analysis , Endpoint Determination , Finland , Geographic Mapping , Humans , Multifactorial Inheritance/genetics , Reproducibility of Results
4.
Genome Res ; 27(5): 677-685, 2017 05.
Article in English | MEDLINE | ID: mdl-27895111

ABSTRACT

In an effort to more fully understand the full spectrum of human genetic variation, we generated deep single-molecule, real-time (SMRT) sequencing data from two haploid human genomes. By using an assembly-based approach (SMRT-SV), we systematically assessed each genome independently for structural variants (SVs) and indels resolving the sequence structure of 461,553 genetic variants from 2 bp to 28 kbp in length. We find that >89% of these variants have been missed as part of analysis of the 1000 Genomes Project even after adjusting for more common variants (MAF > 1%). We estimate that this theoretical human diploid differs by as much as ∼16 Mbp with respect to the human reference, with long-read sequencing data providing a fivefold increase in sensitivity for genetic variants ranging in size from 7 bp to 1 kbp compared with short-read sequence data. Although a large fraction of genetic variants were not detected by short-read approaches, once the alternate allele is sequence-resolved, we show that 61% of SVs can be genotyped in short-read sequence data sets with high accuracy. Uncoupling discovery from genotyping thus allows for the majority of this missed common variation to be genotyped in the human population. Interestingly, when we repeat SV detection on a pseudodiploid genome constructed in silico by merging the two haploids, we find that ∼59% of the heterozygous SVs are no longer detected by SMRT-SV. These results indicate that haploid resolution of long-read sequencing data will significantly increase sensitivity of SV detection.


Subject(s)
Contig Mapping/methods , Genome, Human , Genomic Structural Variation , Haploidy , Sequence Analysis, DNA/methods , Contig Mapping/standards , Human Genome Project , Humans , Sequence Analysis, DNA/standards
5.
Genome Res ; 27(5): 849-864, 2017 05.
Article in English | MEDLINE | ID: mdl-28396521

ABSTRACT

The human reference genome assembly plays a central role in nearly all aspects of today's basic and clinical research. GRCh38 is the first coordinate-changing assembly update since 2009; it reflects the resolution of roughly 1000 issues and encompasses modifications ranging from thousands of single base changes to megabase-scale path reorganizations, gap closures, and localization of previously orphaned sequences. We developed a new approach to sequence generation for targeted base updates and used data from new genome mapping technologies and single haplotype resources to identify and resolve larger assembly issues. For the first time, the reference assembly contains sequence-based representations for the centromeres. We also expanded the number of alternate loci to create a reference that provides a more robust representation of human population variation. We demonstrate that the updates render the reference an improved annotation substrate, alter read alignments in unchanged regions, and impact variant interpretation at clinically relevant loci. We additionally evaluated a collection of new de novo long-read haploid assemblies and conclude that although the new assemblies compare favorably to the reference with respect to continuity, error rate, and gene completeness, the reference still provides the best representation for complex genomic regions and coding sequences. We assert that the collected updates in GRCh38 make the newer assembly a more robust substrate for comprehensive analyses that will promote our understanding of human biology and advance our efforts to improve health.


Subject(s)
Contig Mapping/methods , Genome, Human , Genomics/methods , Sequence Analysis, DNA/methods , Software , Contig Mapping/standards , Genomics/standards , Haploidy , Haplotypes , Humans , Polymorphism, Genetic , Reference Standards , Sequence Analysis, DNA/standards
6.
Am J Hum Genet ; 96(3): 397-411, 2015 Mar 05.
Article in English | MEDLINE | ID: mdl-25704602

ABSTRACT

Although genome-wide association studies (GWASs) for nonsyndromic orofacial clefts have identified multiple strongly associated regions, the causal variants are unknown. To address this, we selected 13 regions from GWASs and other studies, performed targeted sequencing in 1,409 Asian and European trios, and carried out a series of statistical and functional analyses. Within a cluster of strongly associated common variants near NOG, we found that one, rs227727, disrupts enhancer activity. We furthermore identified significant clusters of non-coding rare variants near NTN1 and NOG and found several rare coding variants likely to affect protein function, including four nonsense variants in ARHGAP29. We confirmed 48 de novo mutations and, based on best biological evidence available, chose two of these for functional assays. One mutation in PAX7 disrupted the DNA binding of the encoded transcription factor in an in vitro assay. The second, a non-coding mutation, disrupted the activity of a neural crest enhancer downstream of FGFR2 both in vitro and in vivo. This targeted sequencing study provides strong functional evidence implicating several specific variants as primary contributory risk alleles for nonsyndromic clefting in humans.


Subject(s)
Brain/abnormalities , Carrier Proteins/genetics , Cleft Lip/genetics , Cleft Palate/genetics , PAX7 Transcription Factor/genetics , Polymorphism, Single Nucleotide , Receptor, Fibroblast Growth Factor, Type 2/genetics , Alleles , Amino Acid Sequence , Animals , Asian People/genetics , Carrier Proteins/metabolism , Cell Line , Epithelial Cells/metabolism , Gene Expression Regulation , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Mice , Mice, Inbred C57BL , Molecular Sequence Data , Mutation, Missense , PAX7 Transcription Factor/metabolism , Receptor, Fibroblast Growth Factor, Type 2/metabolism , Sequence Analysis, DNA , Transcription Factors/genetics , Transcription Factors/metabolism , White People/genetics , Zebrafish/genetics
7.
Genome Res ; 24(12): 2066-76, 2014 12.
Article in English | MEDLINE | ID: mdl-25373144

ABSTRACT

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly.


Subject(s)
Genome, Human , Haplotypes , Hydatidiform Mole/genetics , Alleles , Chromosome Mapping , Chromosomes, Artificial, Bacterial , Computational Biology/methods , Female , Genomics/methods , Heterozygote , High-Throughput Nucleotide Sequencing , Humans , Polymorphism, Single Nucleotide , Pregnancy , Repetitive Sequences, Nucleic Acid , Segmental Duplications, Genomic , Sequence Analysis, DNA
8.
Am J Hum Genet ; 92(1): 28-40, 2013 Jan 10.
Article in English | MEDLINE | ID: mdl-23261299

ABSTRACT

Reduced FCGR3B copy number is associated with increased risk of systemic lupus erythematosus (SLE). The five FCGR2/FCGR3 genes are arranged across two highly paralogous genomic segments on chromosome 1q23. Previous studies have suggested mechanisms for structural rearrangements at the FCGR2/FCGR3 locus and have proposed mechanisms whereby altered FCGR3B copy number predisposes to autoimmunity, but the high degree of sequence similarity between paralogous segments has prevented precise definition of the molecular events and their functional consequences. To pursue the genomic pathology associated with FCGR3B copy-number variation, we integrated sequencing data from fosmid and bacterial artificial chromosome clones and sequence-captured DNA from FCGR3B-deleted genomes to establish a detailed map of allelic and paralogous sequence variation across the FCGR2/FCGR3 locus. This analysis identified two highly paralogous 24.5 kb blocks within the FCGR2C/FCGR3B/FCGR2B locus that are devoid of nonpolymorphic paralogous sequence variations and that define the limits of the genomic regions in which nonallelic homologous recombination leads to FCGR2C/FCGR3B copy-number variation. Further, the data showed evidence of swapping of haplotype blocks between these highly paralogous blocks that most likely arose from sequential ancestral recombination events across the region. Functionally, we found by flow cytometry, immunoblotting and cDNA sequencing that individuals with FCGR3B-deleted alleles show ectopic presence of FcγRIIb on natural killer (NK) cells. We conclude that FCGR3B deletion juxtaposes the 5'-regulatory sequences of FCGR2C with the coding sequence of FCGR2B, creating a chimeric gene that results in an ectopic accumulation of FcγRIIb on NK cells and provides an explanation for SLE risk associated with reduced FCGR3B gene copy number.


Subject(s)
DNA Copy Number Variations , Lupus Erythematosus, Systemic/genetics , Receptors, IgG/genetics , Chromosome Mapping , GPI-Linked Proteins/genetics , Gene Deletion , Genetic Predisposition to Disease , Humans , Killer Cells, Natural/metabolism , Polymorphism, Single Nucleotide
10.
Genes Immun ; 16(1): 24-34, 2015.
Article in English | MEDLINE | ID: mdl-25338678

ABSTRACT

Germline variation at immunoglobulin (IG) loci is critical for pathogen-mediated immunity, but establishing complete haplotype sequences in these regions has been problematic because of complex sequence architecture and diploid source DNA. We sequenced BAC clones from the effectively haploid human hydatidiform mole cell line, CHM1htert, across the light chain IG loci, kappa (IGK) and lambda (IGL), creating single haplotype representations of these regions. The IGL haplotype generated here is 1.25 Mb of contiguous sequence, including four novel IGLV alleles, one novel IGLC allele, and an 11.9-kb insertion. The CH17 IGK haplotype consists of two 644 kb proximal and 466 kb distal contigs separated by a large gap of unknown size; these assemblies added 49 kb of unique sequence extending into this gap. Our analysis also resulted in the characterization of seven novel IGKV alleles and a 16.7-kb region exhibiting signatures of interlocus sequence exchange between distal and proximal IGKV gene clusters. Genetic diversity in IGK/IGL was compared with that of the IG heavy chain (IGH) locus within the same haploid genome, revealing threefold (IGK) and sixfold (IGL) higher diversity in the IGH locus, potentially associated with increased levels of segmental duplication and the telomeric location of IGH.


Subject(s)
Genes, Immunoglobulin Light Chain , Hydatidiform Mole/genetics , Cell Line, Tumor , Chromosomes, Artificial, Bacterial , Female , Genes, Immunoglobulin Heavy Chain , Humans , Molecular Sequence Data , Polymorphism, Single Nucleotide , Pregnancy
11.
Am J Hum Genet ; 90(4): 599-613, 2012 Apr 06.
Article in English | MEDLINE | ID: mdl-22482802

ABSTRACT

Recurrent deletions have been associated with numerous diseases and genomic disorders. Few, however, have been resolved at the molecular level because their breakpoints often occur in highly copy-number-polymorphic duplicated sequences. We present an approach that uses a combination of somatic cell hybrids, array comparative genomic hybridization, and the specificity of next-generation sequencing to determine breakpoints that occur within segmental duplications. Applying our technique to the 17q21.31 microdeletion syndrome, we used genome sequencing to determine copy-number-variant breakpoints in three deletion-bearing individuals with molecular resolution. For two cases, we observed breakpoints consistent with nonallelic homologous recombination involving only H2 chromosomal haplotypes, as expected. Molecular resolution revealed that the breakpoints occurred at different locations within a 145 kbp segment of >99% identity and disrupt KANSL1 (previously known as KANSL1). In the remaining case, we found that unequal crossover occurred interchromosomally between the H1 and H2 haplotypes and that this event was mediated by a homologous sequence that was once again missing from the human reference. Interestingly, the breakpoints mapped preferentially to gaps in the current reference genome assembly, which we resolved in this study. Our method provides a strategy for the identification of breakpoints within complex regions of the genome harboring high-identity and copy-number-polymorphic segmental duplication. The approach should become particularly useful as high-quality alternate reference sequences become available and genome sequencing of individuals' DNA becomes more routine.


Subject(s)
Chromosome Breakpoints , Chromosomes, Human, Pair 17/genetics , Sequence Analysis, DNA/methods , Base Sequence , Chromosome Deletion , Comparative Genomic Hybridization/methods , DNA Copy Number Variations , Haplotypes , Homologous Recombination , Humans , Molecular Sequence Data , Segmental Duplications, Genomic , Smith-Magenis Syndrome
12.
BMC Bioinformatics ; 11: 471, 2010 Sep 20.
Article in English | MEDLINE | ID: mdl-20854673

ABSTRACT

BACKGROUND: The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. RESULTS: SeqAnt (Sequence Annotator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. CONCLUSION: SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.


Subject(s)
Genomics/methods , Molecular Sequence Annotation/methods , Sequence Analysis, DNA/methods , Software , Animals , Base Sequence , Databases, Genetic , Genetic Variation , Humans , Internet , Mice
13.
Proc Biol Sci ; 274(1621): 1921-9, 2007 Aug 22.
Article in English | MEDLINE | ID: mdl-17535798

ABSTRACT

Humans play little role in the epidemiology of Escherichia coli O157:H7, a commensal bacterium of cattle. Why then does E. coli O157:H7 code for virulence determinants, like the Shiga toxins (Stxs), responsible for the morbidity and mortality of colonized humans? One possibility is that the virulence of these bacteria to humans is coincidental and these virulence factors evolved for and are maintained for other roles they play in the ecology of these bacteria. Here, we test the hypothesis that the carriage of the Stx-encoding prophage of E. coli O157:H7 increases the rate of survival of E. coli in the presence of grazing protozoa, Tetrahymena pyriformis. In the presence but not the absence of Tetrahymena, the carriage of the Stx-encoding prophage considerably augments the fitness of E. coli K-12 as well as clinical isolates of E. coli O157 by increasing the rate of survival of the bacteria in the food vacuoles of these ciliates. Grazing protozoa in the environment or natural host are likely to play a significant role in the ecology and maintenance of the Stx-encoding prophage of E. coli O157:H7 and may well contribute to the evolution of the virulence of these bacteria to colonize humans.


Subject(s)
Escherichia coli K12/virology , Escherichia coli O157/virology , Prophages/genetics , Shiga Toxin/genetics , Tetrahymena pyriformis/physiology , Virulence Factors/physiology , Animals , Biological Evolution , Escherichia coli K12/genetics , Escherichia coli K12/physiology , Escherichia coli O157/genetics , Escherichia coli O157/physiology , Food Chain , Shiga Toxin/metabolism , Tetrahymena pyriformis/microbiology , Vacuoles/microbiology , Virulence Factors/genetics , Virulence Factors/metabolism
14.
Sci Rep ; 5: 9124, 2015 Mar 16.
Article in English | MEDLINE | ID: mdl-25773295

ABSTRACT

The contribution of genetic variants to sporadic amyotrophic lateral sclerosis (ALS) remains largely unknown. Either recessive or de novo variants could result in an apparently sporadic occurrence of ALS. In an attempt to find such variants we sequenced the exomes of 44 ALS-unaffected-parents trios. Rare and potentially damaging compound heterozygous variants were found in 27% of ALS patients, homozygous recessive variants in 14% and coding de novo variants in 27%. In 20% of patients more than one of the above variants was present. Genes with recessive variants were enriched in nucleotide binding capacity, ATPase activity, and the dynein heavy chain. Genes with de novo variants were enriched in transcription regulation and cell cycle processes. This trio study indicates that rare private recessive variants could be a mechanism underlying some case of sporadic ALS, and that de novo mutations are also likely to play a part in the disease.


Subject(s)
Amyotrophic Lateral Sclerosis/genetics , Exome , Genes, Recessive , Genetic Variation , High-Throughput Nucleotide Sequencing , Adult , Age of Onset , Aged , Aged, 80 and over , Amyotrophic Lateral Sclerosis/diagnosis , Female , Genotype , Humans , Male , Middle Aged , Mutation
15.
Genome Biol ; 16: 13, 2015 Jan 24.
Article in English | MEDLINE | ID: mdl-25651527

ABSTRACT

The human genome reference assembly is crucial for aligning and analyzing sequence data, and for genome annotation, among other roles. However, the models and analysis assumptions that underlie the current assembly need revising to fully represent human sequence diversity. Improved analysis tools and updated data reporting formats are also required.


Subject(s)
Computational Biology/methods , Genome, Human , Genomics/methods , Databases, Genetic , Humans , Software
16.
Nat Genet ; 46(12): 1293-302, 2014 Dec.
Article in English | MEDLINE | ID: mdl-25326701

ABSTRACT

Recurrent deletions of chromosome 15q13.3 associate with intellectual disability, schizophrenia, autism and epilepsy. To gain insight into the instability of this region, we sequenced it in affected individuals, normal individuals and nonhuman primates. We discovered five structural configurations of the human chromosome 15q13.3 region ranging in size from 2 to 3 Mb. These configurations arose recently (∼0.5-0.9 million years ago) as a result of human-specific expansions of segmental duplications and two independent inversion events. All inversion breakpoints map near GOLGA8 core duplicons-a ∼14-kb primate-specific chromosome 15 repeat that became organized into larger palindromic structures. GOLGA8-flanked palindromes also demarcate the breakpoints of recurrent 15q13.3 microdeletions, the expansion of chromosome 15 segmental duplications in the human lineage and independent structural changes in apes. The significant clustering (P = 0.002) of breakpoints provides mechanistic evidence for the role of this core duplicon and its palindromic architecture in promoting the evolutionary and disease-related instability of chromosome 15.


Subject(s)
Chromosome Disorders/genetics , Intellectual Disability/genetics , Repetitive Sequences, Nucleic Acid , Segmental Duplications, Genomic , Seizures/genetics , Animals , Biological Evolution , Chromosome Deletion , Chromosomes, Artificial, Bacterial , Chromosomes, Human, Pair 15/genetics , Cluster Analysis , Comparative Genomic Hybridization , Gene Dosage , Genome, Human , Humans , In Situ Hybridization, Fluorescence , Models, Genetic , Polymorphism, Genetic , Primates , Sequence Analysis, DNA
18.
Mol Autism ; 3(1): 8, 2012 Sep 28.
Article in English | MEDLINE | ID: mdl-23020841

ABSTRACT

BACKGROUND: Autism spectrum disorder (ASD) is highly heritable, but the genetic risk factors for it remain largely unknown. Although structural variants with large effect sizes may explain up to 15% ASD, genome-wide association studies have failed to uncover common single nucleotide variants with large effects on phenotype. The focus within ASD genetics is now shifting to the examination of rare sequence variants of modest effect, which is most often achieved via exome selection and sequencing. This strategy has indeed identified some rare candidate variants; however, the approach does not capture the full spectrum of genetic variation that might contribute to the phenotype. METHODS: We surveyed two loci with known rare variants that contribute to ASD, the X-linked neuroligin genes by performing massively parallel Illumina sequencing of the coding and noncoding regions from these genes in males from families with multiplex autism. We annotated all variant sites and functionally tested a subset to identify other rare mutations contributing to ASD susceptibility. RESULTS: We found seven rare variants at evolutionary conserved sites in our study population. Functional analyses of the three 3' UTR variants did not show statistically significant effects on the expression of NLGN3 and NLGN4X. In addition, we identified two NLGN3 intronic variants located within conserved transcription factor binding sites that could potentially affect gene regulation. CONCLUSIONS: These data demonstrate the power of massively parallel, targeted sequencing studies of affected individuals for identifying rare, potentially disease-contributing variation. However, they also point out the challenges and limitations of current methods of direct functional testing of rare variants and the difficulties of identifying alleles with modest effects.

19.
Nat Genet ; 44(8): 872-80, 2012 Jul 01.
Article in English | MEDLINE | ID: mdl-22751100

ABSTRACT

The 17q21.31 inversion polymorphism exists either as direct (H1) or inverted (H2) haplotypes with differential predispositions to disease and selection. We investigated its genetic diversity in 2,700 individuals, with an emphasis on African populations. We characterize eight structural haplotypes due to complex rearrangements that vary in size from 1.08-1.49 Mb and provide evidence for a 30-kb H1-H2 double recombination event. We show that recurrent partial duplications of the KANSL1 gene have occurred on both the H1 and H2 haplotypes and have risen to high frequency in European populations. We identify a likely ancestral H2 haplotype (H2') lacking these duplications that is enriched among African hunter-gatherer groups yet essentially absent from West African populations. Whereas H1 and H2 segmental duplications arose independently and before human migration out of Africa, they have reached high frequencies recently among Europeans, either because of extraordinary genetic drift or selective sweeps.


Subject(s)
Chromosome Inversion/genetics , Chromosomes, Human, Pair 17/genetics , Africa , Black People/genetics , Evolution, Molecular , Gene Frequency , Haplotypes , Humans , In Situ Hybridization, Fluorescence , Linkage Disequilibrium , Phylogeny , Polymorphism, Genetic , Polymorphism, Single Nucleotide
20.
Nat Methods ; 4(11): 907-9, 2007 Nov.
Article in English | MEDLINE | ID: mdl-17934469

ABSTRACT

We developed a general method, microarray-based genomic selection (MGS), capable of selecting and enriching targeted sequences from complex eukaryotic genomes without the repeat blocking steps necessary for bacterial artificial chromosome (BAC)-based genomic selection. We demonstrate that large human genomic regions, on the order of hundreds of kilobases, can be enriched and resequenced with resequencing arrays. MGS, when combined with a next-generation resequencing technology, can enable large-scale resequencing in single-investigator laboratories.


Subject(s)
Genome/genetics , Oligonucleotide Array Sequence Analysis/methods , Sequence Analysis, DNA/methods , Animals , Eukaryotic Cells/metabolism , Fragile X Mental Retardation Protein/genetics , Gene Library , Genome, Human/genetics , Humans , Mutation , Nuclear Proteins/genetics , Nucleic Acid Hybridization , Oligodeoxyribonucleotides/genetics , Polymerase Chain Reaction , Reproducibility of Results , Trans-Activators/genetics
SELECTION OF CITATIONS
SEARCH DETAIL