Búsqueda | Portal de Búsqueda de la BVS Colombia

Alignment of Next-Generation Sequencing Reads.

Reinert, Knut; Langmead, Ben; Weese, David; Evers, Dirk J.

Annu Rev Genomics Hum Genet ; 16: 133-51, 2015.

Artículo en Inglés | MEDLINE | ID: mdl-25939052

RESUMEN

High-throughput DNA sequencing has considerably changed the possibilities for conducting biomedical research by measuring billions of short DNA or RNA fragments. A central computational problem, and for many applications a first step, consists of determining where the fragments came from in the original genome. In this article, we review the main techniques for generating the fragments, the main applications, and the main algorithmic ideas for computing a solution to the read alignment problem. In addition, we describe pitfalls and difficulties connected to determining the correct positions of reads.

Asunto(s)

Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Genoma , Poliploidía , Secuencias Repetitivas de Ácidos Nucleicos , Programas Informáticos

Accurate whole human genome sequencing using reversible terminator chemistry.

Bentley, David R; Balasubramanian, Shankar; Swerdlow, Harold P; Smith, Geoffrey P; Milton, John; Brown, Clive G; Hall, Kevin P; Evers, Dirk J; Barnes, Colin L; Bignell, Helen R; Boutell, Jonathan M; Bryant, Jason; Carter, Richard J; Keira Cheetham, R; Cox, Anthony J; Ellis, Darren J; Flatbush, Michael R; Gormley, Niall A; Humphray, Sean J; Irving, Leslie J; Karbelashvili, Mirian S; Kirk, Scott M; Li, Heng; Liu, Xiaohai; Maisinger, Klaus S; Murray, Lisa J; Obradovic, Bojan; Ost, Tobias; Parkinson, Michael L; Pratt, Mark R; Rasolonjatovo, Isabelle M J; Reed, Mark T; Rigatti, Roberto; Rodighiero, Chiara; Ross, Mark T; Sabot, Andrea; Sankar, Subramanian V; Scally, Aylwyn; Schroth, Gary P; Smith, Mark E; Smith, Vincent P; Spiridou, Anastassia; Torrance, Peta E; Tzonev, Svilen S; Vermaas, Eric H; Walter, Klaudia; Wu, Xiaolin; Zhang, Lu; Alam, Mohammed D; Anastasi, Carole.

Nature ; 456(7218): 53-9, 2008 Nov 06.

Artículo en Inglés | MEDLINE | ID: mdl-18987734

RESUMEN

DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400-800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.

Asunto(s)

Genoma Humano/genética , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Cromosomas Humanos X/genética , Secuencia de Consenso/genética , Genómica/economía , Genotipo , Humanos , Masculino , Nigeria , Polimorfismo de Nucleótido Simple/genética , Sensibilidad y Especificidad , Análisis de Secuencia de ADN/economía

CNAseg--a novel framework for identification of copy number changes in cancer from second-generation sequencing data.

Ivakhno, Sergii; Royce, Tom; Cox, Anthony J; Evers, Dirk J; Cheetham, R Keira; Tavaré, Simon.

Bioinformatics ; 26(24): 3051-8, 2010 Dec 15.

Artículo en Inglés | MEDLINE | ID: mdl-20966003

RESUMEN

MOTIVATION: Copy number abnormalities (CNAs) represent an important type of genetic mutation that can lead to abnormal cell growth and proliferation. New high-throughput sequencing technologies promise comprehensive characterization of CNAs. In contrast to microarrays, where probe design follows a carefully developed protocol, reads represent a random sample from a library and may be prone to representation biases due to GC content and other factors. The discrimination between true and false positive CNAs becomes an important issue. RESULTS: We present a novel approach, called CNAseg, to identify CNAs from second-generation sequencing data. It uses depth of coverage to estimate copy number states and flowcell-to-flowcell variability in cancer and normal samples to control the false positive rate. We tested the method using the COLO-829 melanoma cell line sequenced to 40-fold coverage. An extensive simulation scheme was developed to recreate different scenarios of copy number changes and depth of coverage by altering a real dataset with spiked-in CNAs. Comparison to alternative approaches using both real and simulated datasets showed that CNAseg achieves superior precision and improved sensitivity estimates. AVAILABILITY: The CNAseg package and test data are available at http://www.compbio.group.cam.ac.uk/software.html.

Asunto(s)

Algoritmos , Variaciones en el Número de Copia de ADN , Neoplasias/genética , Composición de Base , Línea Celular Tumoral , Genoma Humano , Humanos , Mutación , Análisis de Secuencia de ADN

RNA Movies 2: sequential animation of RNA secondary structures.

Kaiser, Alexander; Krüger, Jan; Evers, Dirk J.

Nucleic Acids Res ; 35(Web Server issue): W330-4, 2007 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-17567618

RESUMEN

RNA Movies is a simple, yet powerful visualization tool in likeness to a media player application, which enables to browse sequential paths through RNA secondary structure landscapes. It can be used to visualize structural rearrangement processes of RNA, such as folding pathways and conformational switches, or to browse lists of alternative structure candidates. Besides extending the feature set, retaining and improving usability and availability in the web is the main aim of this new version. RNA Movies now supports the DCSE and RNAStructML input formats besides its own RNM format. Pseudoknots and 'entangled helices' can be superimposed on the RNA secondary structure layout. Publication quality output is provided through the Scalable Vector Graphics output format understood by most current drawing programs. The software has been completely re-implemented in Java to enable pure client-side operation as applet and web-start application available at the Bielefeld Bioinformatics Server http://bibiserv.techfak.uni-bielefeld.de/rnamovies.

Asunto(s)

Biología Computacional/métodos , Presentación de Datos , Natronobacterium/genética , Conformación de Ácido Nucleico , ARN/química , Algoritmos , Simulación por Computador , Internet , Multimedia , Lenguajes de Programación , ARN Bacteriano/química , Interfaz Usuario-Computador

Efficient de novo assembly of single-cell bacterial genomes from short-read data sets.

Chitsaz, Hamidreza; Yee-Greenbaum, Joyclyn L; Tesler, Glenn; Lombardo, Mary-Jane; Dupont, Christopher L; Badger, Jonathan H; Novotny, Mark; Rusch, Douglas B; Fraser, Louise J; Gormley, Niall A; Schulz-Trieglaff, Ole; Smith, Geoffrey P; Evers, Dirk J; Pevzner, Pavel A; Lasken, Roger S.

Nat Biotechnol ; 29(10): 915-21, 2011 Sep 18.

Artículo en Inglés | MEDLINE | ID: mdl-21926975

RESUMEN

Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies.

Asunto(s)

Bacterias/citología , Bacterias/genética , Bases de Datos de Ácidos Nucleicos , Genoma Bacteriano/genética , Análisis de Secuencia de ADN/métodos , Análisis de la Célula Individual/métodos , Algoritmos , Secuencia de Bases , Mapeo Contig , Deltaproteobacteria/citología , Deltaproteobacteria/genética , Escherichia coli/citología , Escherichia coli/genética , Funciones de Verosimilitud , Staphylococcus aureus/citología , Staphylococcus aureus/genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA