Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
1.
Nature ; 463(7278): 184-90, 2010 Jan 14.
Article in English | MEDLINE | ID: mdl-20016488

ABSTRACT

Cancer is driven by mutation. Worldwide, tobacco smoking is the principal lifestyle exposure that causes cancer, exerting carcinogenicity through >60 chemicals that bind and mutate DNA. Using massively parallel sequencing technology, we sequenced a small-cell lung cancer cell line, NCI-H209, to explore the mutational burden associated with tobacco smoking. A total of 22,910 somatic substitutions were identified, including 134 in coding exons. Multiple mutation signatures testify to the cocktail of carcinogens in tobacco smoke and their proclivities for particular bases and surrounding sequence context. Effects of transcription-coupled repair and a second, more general, expression-linked repair pathway were evident. We identified a tandem duplication that duplicates exons 3-8 of CHD7 in frame, and another two lines carrying PVT1-CHD7 fusion genes, indicating that CHD7 may be recurrently rearranged in this disease. These findings illustrate the potential for next-generation sequencing to provide unprecedented insights into mutational processes, cellular repair pathways and gene networks associated with cancer.


Subject(s)
Lung Neoplasms/etiology , Lung Neoplasms/genetics , Mutation/genetics , Nicotiana/adverse effects , Small Cell Lung Carcinoma/etiology , Small Cell Lung Carcinoma/genetics , Smoking/adverse effects , Carcinogens/toxicity , Cell Line, Tumor , DNA Copy Number Variations/drug effects , DNA Copy Number Variations/genetics , DNA Damage/genetics , DNA Helicases/genetics , DNA Mutational Analysis , DNA Repair/genetics , DNA-Binding Proteins/genetics , Exons/genetics , Gene Expression Regulation, Neoplastic/drug effects , Genome, Human/drug effects , Genome, Human/genetics , Humans , Mutagenesis, Insertional/drug effects , Mutagenesis, Insertional/genetics , Mutation/drug effects , Promoter Regions, Genetic/genetics , Sequence Deletion/genetics
2.
Genome Res ; 19(9): 1527-41, 2009 Sep.
Article in English | MEDLINE | ID: mdl-19546169

ABSTRACT

We describe the genome sequencing of an anonymous individual of African origin using a novel ligation-based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9%, allowing us to accurately call SNPs with as few as two reads per allele. We collected several billion mate-paired reads yielding approximately 18x haploid coverage of aligned sequence and close to 300x clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read, and 99.65% is spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data are used to physically resolve haplotype phases of nearly two-thirds of the genotypes obtained and produce phased segments of up to 215 kb. We detect 226,529 intra-read indels, 5590 indels between mate-paired reads, 91 inversions, and four gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of mutations previously described in OMIM and hundreds of nonsynonymous single-nucleotide and structural variants in genes previously implicated in disease are identified in this individual. There is more genetic variation in the human genome still to be uncovered, and we provide guidance for future surveys in populations and cancer biopsies.


Subject(s)
Base Pairing , Computational Biology/methods , Genetic Variation , Genome, Human , Ligases , Sequence Analysis, DNA/methods , Africa , Base Sequence , Genomics , Genotype , Heterozygote , Homozygote , Humans , Polymorphism, Single Nucleotide , Reference Standards
3.
Nat Methods ; 5(2): 183-8, 2008 Feb.
Article in English | MEDLINE | ID: mdl-18204455

ABSTRACT

Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.


Subject(s)
Caenorhabditis elegans/genetics , Chromosome Mapping/methods , DNA Mutational Analysis/methods , Genetic Variation/genetics , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA/methods , Animals , Base Sequence , Molecular Sequence Data
4.
Hum Mutat ; 24(2): 178-84, 2004 Aug.
Article in English | MEDLINE | ID: mdl-15241800

ABSTRACT

The success rate of association studies can be improved by selecting better genetic markers for genotyping or by providing better leads for identifying pathogenic single nucleotide polymorphisms (SNPs) in the regions of linkage disequilibrium with positive disease associations. We have developed a novel algorithm to predict pathogenic single amino acid changes, either nonsynonymous SNPs (nsSNPs) or missense mutations, in conserved protein domains. Using a Bayesian framework, we found that the probability of a microbial missense mutation causing a significant change in phenotype depended on how much difference it made in several phylogenetic, biochemical, and structural features related to the single amino acid substitution. We tested our model on pathogenic allelic variants (missense mutations or nsSNPs) included in OMIM, and on the other nsSNPs in the same genes (from dbSNP) as the nonpathogenic variants. As a result, our model predicted pathogenic variants with a 10% false-positive rate. The high specificity of our prediction algorithm should make it valuable in genetic association studies aimed at identifying pathogenic SNPs.


Subject(s)
Bayes Theorem , Conserved Sequence/genetics , Peptides/genetics , Polymorphism, Single Nucleotide/genetics , Algorithms , Computational Biology/methods , Computational Biology/statistics & numerical data , Databases, Genetic , Humans , Linkage Disequilibrium/genetics , Models, Genetic , Mutation, Missense/genetics , Neural Networks, Computer , Predictive Value of Tests , Protein Structure, Tertiary/genetics , Transcription Factors/genetics
5.
Science ; 306(5705): 2255-7, 2004 Dec 24.
Article in English | MEDLINE | ID: mdl-15618518

ABSTRACT

In the developing brain, transcription factors (TFs) direct the formation of a diverse array of neurons and glia. We identifed 1445 putative TFs in the mouse genome. We used in situ hybridization to map the expression of over 1000 of these TFs and TF-coregulator genes in the brains of developing mice. We found that 349 of these genes showed restricted expression patterns that were adequate to describe the anatomical organization of the brain. We provide a comprehensive inventory of murine TFs and their expression patterns in a searchable brain atlas database.


Subject(s)
Brain/growth & development , Brain/metabolism , Gene Expression Profiling , Genome , Transcription Factors/genetics , Transcription Factors/metabolism , Animals , Animals, Newborn , Brain/anatomy & histology , Brain/embryology , Cloning, Molecular , Corpus Striatum/anatomy & histology , Corpus Striatum/embryology , Corpus Striatum/growth & development , Corpus Striatum/metabolism , DNA Primers , Databases, Factual , Hypothalamus/anatomy & histology , Hypothalamus/embryology , Hypothalamus/growth & development , Hypothalamus/metabolism , In Situ Hybridization , Mesencephalon/anatomy & histology , Mesencephalon/embryology , Mesencephalon/growth & development , Mesencephalon/metabolism , Mice , Neocortex/anatomy & histology , Neocortex/embryology , Neocortex/growth & development , Neocortex/metabolism , Polymerase Chain Reaction , Rhombencephalon/anatomy & histology , Rhombencephalon/embryology , Rhombencephalon/growth & development , Rhombencephalon/metabolism , Spinal Cord/anatomy & histology , Spinal Cord/embryology , Spinal Cord/growth & development , Spinal Cord/metabolism , Thalamus/anatomy & histology , Thalamus/embryology , Thalamus/growth & development , Thalamus/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL