Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Nat Methods ; 7(10): 843-7, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20835245

RESUMO

In alternative expression analysis by sequencing (ALEXA-seq), we developed a method to analyze massively parallel RNA sequence data to catalog transcripts and assess differential and alternative expression of known and predicted mRNA isoforms in cells and tissues. As proof of principle, we used the approach to compare fluorouracil-resistant and -nonresistant human colorectal cancer cell lines. We assessed the sensitivity and specificity of the approach by comparison to exon tiling and splicing microarrays and validated the results with reverse transcription-PCR, quantitative PCR and Sanger sequencing. We observed global disruption of splicing in fluorouracil-resistant cells characterized by expression of new mRNA isoforms resulting from exon skipping, alternative splice site usage and intron retention. Alternative expression annotation databases, source code, a data viewer and other resources to facilitate analysis are available at http://www.alexaplatform.org/alexa_seq/.


Assuntos
Processamento Alternativo , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Antimetabólitos Antineoplásicos/farmacologia , Linhagem Celular Tumoral , Neoplasias Colorretais/tratamento farmacológico , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Bases de Dados Genéticas , Resistencia a Medicamentos Antineoplásicos/genética , Etiquetas de Sequências Expressas , Fluoruracila/farmacologia , Expressão Gênica/efeitos dos fármacos , Perfilação da Expressão Gênica , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Isoformas de Proteínas , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Alinhamento de Sequência
2.
J Appl Lab Med ; 7(5): 1025-1036, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-35723286

RESUMO

BACKGROUND: To support the implementation of high-throughput pipelines suitable for SARS-CoV-2 sequencing and analysis in a clinical laboratory, we developed an automated sample preparation and analysis workflow. METHODS: We used the established ARTIC protocol with approximately 400 bp amplicons sequenced on Oxford Nanopore's MinION. Sequences were analyzed using Nextclade, assigning both a clade and quality score to each sample. RESULTS: A total of 2179 samples on twenty-five 96-well plates were sequenced. Plates of purified RNA were processed within 12 h, sequencing required up to 24 h, and analysis of each pooled plate required 1 h. The use of samples with known threshold cycle (Ct) values enabled normalization, acted as a quality control check, and revealed a strong correlation between sample Ct values and successful analysis, with 85% of samples with Ct < 30 achieving a "good" Nextclade score. Less abundant samples responded to enrichment with the fraction of Ct > 30 samples achieving a "good" classification rising by 60% after addition of a post-ARTIC PCR normalization. Serial dilutions of 3 variant of concern samples, diluted from approximately Ct = 16 to approximately Ct = 50, demonstrated successful sequencing to Ct = 37. The sample set contained a median of 24 mutations per sample and a total of 1281 unique mutations with reduced sequence read coverage noted in some regions of some samples. A total of 10 separate strains were observed in the sample set, including 3 variants of concern prevalent in British Columbia in the spring of 2021. CONCLUSIONS: We demonstrated a robust automated sequencing pipeline that takes advantage of input Ct values to improve reliability.


Assuntos
COVID-19 , Sequenciamento por Nanoporos , Nanoporos , COVID-19/diagnóstico , COVID-19/epidemiologia , Humanos , Reprodutibilidade dos Testes , SARS-CoV-2/genética
3.
BMC Bioinformatics ; 8: 368, 2007 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-17910767

RESUMO

BACKGROUND: Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays. RESULTS: We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection. CONCLUSION: We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.


Assuntos
Algoritmos , Dosagem de Genes/genética , Variação Genética/genética , Genômica/normas , Análise de Sequência com Séries de Oligonucleotídeos/normas , Validação de Programas de Computador , Adulto , Criança , Genoma Humano/genética , Genômica/métodos , Humanos , Análise de Sequência com Séries de Oligonucleotídeos/métodos
5.
Nucleic Acids Res ; 30(11): 2460-8, 2002 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-12034834

RESUMO

We describe an efficient high-throughput method for accurate DNA sequencing of entire cDNA clones. Developed as part of our involvement in the Mammalian Gene Collection full-length cDNA sequencing initiative, the method has been used and refined in our laboratory since September 2000. Amenable to large scale projects, we have used the method to generate >7 Mb of accurate sequence from 3695 candidate full-length cDNAs. Sequencing is accomplished through the insertion of Mu transposon into cDNAs, followed by sequencing reactions primed with Mu-specific sequencing primers. Transposon insertion reactions are not performed with individual cDNAs but rather on pools of up to 96 clones. This pooling strategy reduces the number of transposon insertion sequencing libraries that would otherwise be required, reducing the costs and enhancing the efficiency of the transposon library construction procedure. Sequences generated using transposon-specific sequencing primers are assembled to yield the full-length cDNA sequence, with sequence editing and other sequence finishing activities performed as required to resolve sequence ambiguities. Although analysis of the many thousands (22 785) of sequenced Mu transposon insertion events revealed a weak sequence preference for Mu insertion, we observed insertion of the Mu transposon into 1015 of the possible 1024 5mer candidate insertion sites.


Assuntos
Bacteriófago mu/genética , Elementos de DNA Transponíveis/genética , DNA Complementar/genética , Mutagênese Insercional/genética , Recombinação Genética/genética , Análise de Sequência de DNA/métodos , Composição de Bases , Clonagem Molecular , Primers do DNA/genética , Biblioteca Gênica , Vetores Genéticos/genética , Método de Monte Carlo , Mapeamento Físico do Cromossomo/métodos , Sensibilidade e Especificidade , Análise de Sequência de DNA/economia , Especificidade por Substrato , Fatores de Tempo
6.
Curr Protoc Hum Genet ; Chapter 11: Unit 11.11.1-36, 2010 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-20373513

RESUMO

This unit provides a protocol for performing digital gene expression profiling on the Illumina Genome Analyzer sequencing platform. Tag sequencing (Tag-seq) is an implementation of the LongSAGE protocol on the Illumina sequencing platform that increases utility while reducing both the cost and time required to generate gene expression profiles. The ultra-high-throughput sequencing capability of the Illumina platform allows the cost-effective generation of libraries containing an average of 20 million tags, a 200-fold improvement over classical LongSAGE. Tag-seq has less sequence composition bias, leading to a better representation of AT-rich tag sequences, and allows a more accurate profiling of a subset of the transcriptome characterized by AT-rich genes expressed at levels below the threshold of detection of LongSAGE (Morrissy et al., 2009).


Assuntos
Etiquetas de Sequências Expressas , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Genômica/métodos , RNA Mensageiro/genética , Análise de Sequência de DNA/métodos , Reação em Cadeia da Polimerase/métodos
7.
Stem Cells ; 25(7): 1681-9, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17412892

RESUMO

Transcriptome profiling offers a powerful approach to investigating developmental processes. Long serial analysis of gene expression (LongSAGE) is particularly attractive for this purpose because of its inherent quantitative features and independence of both hybridization variables and prior knowledge of transcript identity. Here, we describe the validation and initial application of a modified protocol for amplifying cDNA preparations from <10 ng of RNA (<10(3) cells) to allow representative LongSAGE libraries to be constructed from rare stem cell-enriched populations. Quantitative real-time polymerase chain reaction (Q-RT-PCR) analyses and comparison of tag frequencies in replicate LongSAGE libraries produced from amplified and nonamplified cDNA preparations demonstrated preservation of the relative levels of different transcripts originally present at widely varying levels. This PCR-LongSAGE protocol was then used to obtain a 200,000-tag library from the CD34+ subset of normal adult human bone marrow cells. Analysis of this library revealed many anticipated transcripts, as well as transcripts not previously known to be present in CD34+ hematopoietic cells. The latter included numerous novel tags that mapped to unique and conserved sites in the human genome but not previously identified as transcribed elements in human cells. Q-RT-PCR was used to demonstrate that 10 of these novel tags were expressed in cDNA pools and present in extracts of other sources of normal human CD34+ hematopoietic cells. These findings illustrate the power of LongSAGE to identify new transcripts in stem cell-enriched populations and indicate the potential of this approach to be extended to other sources of rare cells. Disclosure of potential conflicts of interest is found at the end of this article.


Assuntos
Antígenos CD34/metabolismo , Células da Medula Óssea/metabolismo , Perfilação da Expressão Gênica/métodos , Reação em Cadeia da Polimerase/métodos , Adulto , Separação Celular , DNA Complementar/genética , Biblioteca Gênica , Humanos , RNA Mensageiro/genética , Reprodutibilidade dos Testes
8.
Genome Res ; 17(1): 108-16, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17135571

RESUMO

We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of approximately 5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling.


Assuntos
Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Animais , Caenorhabditis elegans/genética , Linhagem Celular , Separação Celular , Bases de Dados de Ácidos Nucleicos , Células-Tronco Embrionárias/química , Citometria de Fluxo , Genoma , Humanos , Camundongos , Microdissecção , Análise de Sequência de DNA , Software , Peixe-Zebra/genética
9.
Genome Biol ; 8(6): R113, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17570852

RESUMO

To facilitate discovery of novel human embryonic stem cell (ESC) transcripts, we generated 2.5 million LongSAGE tags from 9 human ESC lines. Analysis of this data revealed that ESCs express proportionately more RNA binding proteins compared with terminally differentiated cells, and identified novel ESC transcripts, at least one of which may represent a marker of the pluripotent state.


Assuntos
Células-Tronco Embrionárias/metabolismo , Perfilação da Expressão Gênica , Células-Tronco Pluripotentes/metabolismo , Sequência de Bases , Linhagem Celular , Humanos , Proteínas de Ligação a RNA/genética , Alinhamento de Sequência
10.
Plant J ; 50(6): 1063-78, 2007 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-17488239

RESUMO

As part of a larger project to sequence the Populus genome and generate genomic resources for this emerging model tree, we constructed a physical map of the Populus genome, representing one of the few such maps of an undomesticated, highly heterozygous plant species. The physical map, consisting of 2802 contigs, was constructed from fingerprinted bacterial artificial chromosome (BAC) clones. The map represents approximately 9.4-fold coverage of the Populus genome, which has been estimated from the genome sequence assembly to be 485 +/- 10 Mb in size. BAC ends were sequenced to assist long-range assembly of whole-genome shotgun sequence scaffolds and to anchor the physical map to the genome sequence. Simple sequence repeat-based markers were derived from the end sequences and used to initiate integration of the BAC and genetic maps. A total of 2411 physical map contigs, representing 97% of all clones assigned to contigs, were aligned to the sequence assembly (JGI Populus trichocarpa, version 1.0). These alignments represent a total coverage of 384 Mb (79%) of the entire poplar sequence assembly and 295 Mb (96%) of linkage group sequence assemblies. A striking result of the physical map contig alignments to the sequence assembly was the co-localization of multiple contigs across numerous regions of the 19 linkage groups. Targeted sequencing of BAC clones and genetic analysis in a small number of representative regions showed that these co-aligning contigs represent distinct haplotypes in the heterozygous individual sequenced, and revealed the nature of these haplotype sequence differences.


Assuntos
Genoma de Planta , Mapeamento Físico do Cromossomo , Populus/genética , Cromossomos Artificiais Bacterianos , Haplótipos , Repetições Minissatélites , Polimorfismo Genético , Alinhamento de Sequência , Análise de Sequência de DNA
11.
Am J Hum Genet ; 79(3): 500-13, 2006 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16909388

RESUMO

The cause of mental retardation in one-third to one-half of all affected individuals is unknown. Microscopically detectable chromosomal abnormalities are the most frequently recognized cause, but gain or loss of chromosomal segments that are too small to be seen by conventional cytogenetic analysis has been found to be another important cause. Array-based methods offer a practical means of performing a high-resolution survey of the entire genome for submicroscopic copy-number variants. We studied 100 children with idiopathic mental retardation and normal results of standard chromosomal analysis, by use of whole-genome sampling analysis with Affymetrix GeneChip Human Mapping 100K arrays. We found de novo deletions as small as 178 kb in eight cases, de novo duplications as small as 1.1 Mb in two cases, and unsuspected mosaic trisomy 9 in another case. This technology can detect at least twice as many potentially pathogenic de novo copy-number variants as conventional cytogenetic analysis can in people with mental retardation.


Assuntos
Aberrações Cromossômicas , Deficiência Intelectual/diagnóstico , Análise de Sequência com Séries de Oligonucleotídeos , Criança , Dosagem de Genes , Genoma Humano , Humanos , Deleção de Sequência
12.
Proc Natl Acad Sci U S A ; 102(51): 18485-90, 2005 Dec 20.
Artigo em Inglês | MEDLINE | ID: mdl-16352711

RESUMO

We analyzed 8.55 million LongSAGE tags generated from 72 libraries. Each LongSAGE library was prepared from a different mouse tissue. Analysis of the data revealed extensive overlap with existing gene data sets and evidence for the existence of approximately 24,000 previously undescribed genomic loci. The visual cortex, pancreas, mammary gland, preimplantation embryo, and placenta contain the largest number of differentially expressed transcripts, 25% of which are previously undescribed loci.


Assuntos
Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento/genética , Camundongos Endogâmicos C57BL/genética , Camundongos/genética , Processamento Alternativo/genética , Animais , Família Multigênica/genética , RNA não Traduzido/genética , Reprodutibilidade dos Testes , Transcrição Gênica/genética
13.
Genome Res ; 14(4): 766-79, 2004 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-15060021

RESUMO

As part of the effort to sequence the genome of Rattus norvegicus, we constructed a physical map comprised of fingerprinted bacterial artificial chromosome (BAC) clones from the CHORI-230 BAC library. These BAC clones provide approximately 13-fold redundant coverage of the genome and have been assembled into 376 fingerprint contigs. A yeast artificial chromosome (YAC) map was also constructed and aligned with the BAC map via fingerprinted BAC and P1 artificial chromosome clones (PACs) sharing interspersed repetitive sequence markers with the YAC-based physical map. We have annotated 95% of the fingerprint map clones in contigs with coordinates on the version 3.1 rat genome sequence assembly, using BAC-end sequences and in silico mapping methods. These coordinates have allowed anchoring 358 of the 376 fingerprint map contigs onto the sequence assembly. Of these, 324 contigs are anchored to rat genome sequences localized to chromosomes, and 34 contigs are anchored to unlocalized portions of the rat sequence assembly. The remaining 18 contigs, containing 54 clones, still require placement. The fingerprint map is a high-resolution integrative data resource that provides genome-ordered associations among BAC, YAC, and PAC clones and the assembled sequence of the rat genome.


Assuntos
Cromossomos Artificiais Bacterianos/genética , Cromossomos Artificiais de Levedura/genética , Genoma , Mapeamento Físico do Cromossomo/métodos , Animais , Automação , Cromossomos/genética , Clonagem Molecular/métodos , Biologia Computacional/métodos , Biologia Computacional/normas , Mapeamento de Sequências Contíguas/métodos , Mapeamento de Sequências Contíguas/normas , Impressões Digitais de DNA/métodos , Impressões Digitais de DNA/normas , Marcadores Genéticos/genética , Mapeamento Físico do Cromossomo/normas , Reação em Cadeia da Polimerase/métodos , Ratos , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas
14.
Science ; 300(5624): 1399-404, 2003 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-12730501

RESUMO

We sequenced the 29,751-base genome of the severe acute respiratory syndrome (SARS)-associated coronavirus known as the Tor2 isolate. The genome sequence reveals that this coronavirus is only moderately related to other known coronaviruses, including two human coronaviruses, HCoV-OC43 and HCoV-229E. Phylogenetic analysis of the predicted viral proteins indicates that the virus does not closely resemble any of the three previously known groups of coronaviruses. The genome sequence will aid in the diagnosis of SARS virus infection in humans and potential animal hosts (using polymerase chain reaction and immunological tests), in the development of antivirals (including neutralizing antibodies), and in the identification of putative epitopes for vaccine development.


Assuntos
Genoma Viral , RNA Viral/genética , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/genética , Proteínas Virais/genética , Regiões 3' não Traduzidas , Regiões 5' não Traduzidas , Animais , Sequência de Bases , Sequência Conservada , Coronavirus/classificação , Coronavirus/genética , Proteínas M de Coronavírus , Proteínas do Nucleocapsídeo de Coronavírus , DNA Complementar , Mudança da Fase de Leitura do Gene Ribossômico , Humanos , Glicoproteínas de Membrana/química , Glicoproteínas de Membrana/genética , Proteínas do Nucleocapsídeo/química , Proteínas do Nucleocapsídeo/genética , Fases de Leitura Aberta , Filogenia , RNA Viral/isolamento & purificação , RNA Polimerase Dependente de RNA/química , RNA Polimerase Dependente de RNA/genética , Sequências Reguladoras de Ácido Nucleico , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/classificação , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave/isolamento & purificação , Análise de Sequência de DNA , Síndrome Respiratória Aguda Grave/virologia , Glicoproteína da Espícula de Coronavírus , Proteínas do Envelope Viral/química , Proteínas do Envelope Viral/genética , Proteínas da Matriz Viral/química , Proteínas da Matriz Viral/genética , Proteínas Virais/química
15.
Nature ; 418(6899): 743-50, 2002 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-12181558

RESUMO

A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy.


Assuntos
Genoma , Camundongos/genética , Mapeamento Físico do Cromossomo/métodos , Animais , Cromossomos/genética , Cromossomos Humanos Par 6/genética , Clonagem Molecular , Sequência Conservada/genética , Mapeamento de Sequências Contíguas/métodos , Genoma Humano , Humanos , Dados de Sequência Molecular , Mapeamento de Híbridos Radioativos , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Especificidade da Espécie , Sintenia
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa