Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de documento
País/Região como assunto
Intervalo de ano de publicação
1.
Annu Rev Genomics Hum Genet ; 16: 133-51, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25939052

RESUMO

High-throughput DNA sequencing has considerably changed the possibilities for conducting biomedical research by measuring billions of short DNA or RNA fragments. A central computational problem, and for many applications a first step, consists of determining where the fragments came from in the original genome. In this article, we review the main techniques for generating the fragments, the main applications, and the main algorithmic ideas for computing a solution to the read alignment problem. In addition, we describe pitfalls and difficulties connected to determining the correct positions of reads.


Assuntos
Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Genoma , Poliploidia , Sequências Repetitivas de Ácido Nucleico , Software
2.
Nature ; 456(7218): 53-9, 2008 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-18987734

RESUMO

DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400-800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.


Assuntos
Genoma Humano/genética , Genômica/métodos , Análise de Sequência de DNA/métodos , Cromossomos Humanos X/genética , Sequência Consenso/genética , Genômica/economia , Genótipo , Humanos , Masculino , Nigéria , Polimorfismo de Nucleotídeo Único/genética , Sensibilidade e Especificidade , Análise de Sequência de DNA/economia
3.
Bioinformatics ; 26(24): 3051-8, 2010 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-20966003

RESUMO

MOTIVATION: Copy number abnormalities (CNAs) represent an important type of genetic mutation that can lead to abnormal cell growth and proliferation. New high-throughput sequencing technologies promise comprehensive characterization of CNAs. In contrast to microarrays, where probe design follows a carefully developed protocol, reads represent a random sample from a library and may be prone to representation biases due to GC content and other factors. The discrimination between true and false positive CNAs becomes an important issue. RESULTS: We present a novel approach, called CNAseg, to identify CNAs from second-generation sequencing data. It uses depth of coverage to estimate copy number states and flowcell-to-flowcell variability in cancer and normal samples to control the false positive rate. We tested the method using the COLO-829 melanoma cell line sequenced to 40-fold coverage. An extensive simulation scheme was developed to recreate different scenarios of copy number changes and depth of coverage by altering a real dataset with spiked-in CNAs. Comparison to alternative approaches using both real and simulated datasets showed that CNAseg achieves superior precision and improved sensitivity estimates. AVAILABILITY: The CNAseg package and test data are available at http://www.compbio.group.cam.ac.uk/software.html.


Assuntos
Algoritmos , Variações do Número de Cópias de DNA , Neoplasias/genética , Composição de Bases , Linhagem Celular Tumoral , Genoma Humano , Humanos , Mutação , Análise de Sequência de DNA
4.
Nucleic Acids Res ; 35(Web Server issue): W330-4, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17567618

RESUMO

RNA Movies is a simple, yet powerful visualization tool in likeness to a media player application, which enables to browse sequential paths through RNA secondary structure landscapes. It can be used to visualize structural rearrangement processes of RNA, such as folding pathways and conformational switches, or to browse lists of alternative structure candidates. Besides extending the feature set, retaining and improving usability and availability in the web is the main aim of this new version. RNA Movies now supports the DCSE and RNAStructML input formats besides its own RNM format. Pseudoknots and 'entangled helices' can be superimposed on the RNA secondary structure layout. Publication quality output is provided through the Scalable Vector Graphics output format understood by most current drawing programs. The software has been completely re-implemented in Java to enable pure client-side operation as applet and web-start application available at the Bielefeld Bioinformatics Server http://bibiserv.techfak.uni-bielefeld.de/rnamovies.


Assuntos
Biologia Computacional/métodos , Apresentação de Dados , Natronobacterium/genética , Conformação de Ácido Nucleico , RNA/química , Algoritmos , Simulação por Computador , Internet , Multimídia , Linguagens de Programação , RNA Bacteriano/química , Interface Usuário-Computador
5.
Nat Biotechnol ; 29(10): 915-21, 2011 Sep 18.
Artigo em Inglês | MEDLINE | ID: mdl-21926975

RESUMO

Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies.


Assuntos
Bactérias/citologia , Bactérias/genética , Bases de Dados de Ácidos Nucleicos , Genoma Bacteriano/genética , Análise de Sequência de DNA/métodos , Análise de Célula Única/métodos , Algoritmos , Sequência de Bases , Mapeamento de Sequências Contíguas , Deltaproteobacteria/citologia , Deltaproteobacteria/genética , Escherichia coli/citologia , Escherichia coli/genética , Funções Verossimilhança , Staphylococcus aureus/citologia , Staphylococcus aureus/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA