Assuntos
Agricultura/métodos , Agricultura/tendências , Abastecimento de Alimentos/estatística & dados numéricos , Aclimatação/genética , Agricultura/economia , Biodiversidade , Bancos de Espécimes Biológicos , Cruzamento , Produtos Agrícolas/genética , Genes de Plantas , Humanos , Fenótipo , Sementes/genéticaRESUMO
We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of approximately 5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling.
Assuntos
Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Animais , Caenorhabditis elegans/genética , Linhagem Celular , Separação Celular , Bases de Dados de Ácidos Nucleicos , Células-Tronco Embrionárias/química , Citometria de Fluxo , Genoma , Humanos , Camundongos , Microdissecção , Análise de Sequência de DNA , Software , Peixe-Zebra/genéticaRESUMO
The cause of mental retardation in one-third to one-half of all affected individuals is unknown. Microscopically detectable chromosomal abnormalities are the most frequently recognized cause, but gain or loss of chromosomal segments that are too small to be seen by conventional cytogenetic analysis has been found to be another important cause. Array-based methods offer a practical means of performing a high-resolution survey of the entire genome for submicroscopic copy-number variants. We studied 100 children with idiopathic mental retardation and normal results of standard chromosomal analysis, by use of whole-genome sampling analysis with Affymetrix GeneChip Human Mapping 100K arrays. We found de novo deletions as small as 178 kb in eight cases, de novo duplications as small as 1.1 Mb in two cases, and unsuspected mosaic trisomy 9 in another case. This technology can detect at least twice as many potentially pathogenic de novo copy-number variants as conventional cytogenetic analysis can in people with mental retardation.
Assuntos
Aberrações Cromossômicas , Deficiência Intelectual/diagnóstico , Análise de Sequência com Séries de Oligonucleotídeos , Criança , Dosagem de Genes , Genoma Humano , Humanos , Deleção de SequênciaRESUMO
We analyzed 8.55 million LongSAGE tags generated from 72 libraries. Each LongSAGE library was prepared from a different mouse tissue. Analysis of the data revealed extensive overlap with existing gene data sets and evidence for the existence of approximately 24,000 previously undescribed genomic loci. The visual cortex, pancreas, mammary gland, preimplantation embryo, and placenta contain the largest number of differentially expressed transcripts, 25% of which are previously undescribed loci.