Búsqueda | Portal de Búsqueda de la BVS España

1.

Fast, accurate, and lightweight analysis of BS-treated reads with ERNE 2.

Prezza, Nicola; Vezzi, Francesco; Käller, Max; Policriti, Alberto.

BMC Bioinformatics ; 17 Suppl 4: 69, 2016 Mar 02.

Artículo en Inglés | MEDLINE | ID: mdl-26961371

RESUMEN

BACKGROUND: Bisulfite treatment of DNA followed by sequencing (BS-seq) has become a standard technique in epigenetic studies, providing researchers with tools for generating single-base resolution maps of whole methylomes. Aligning bisulfite-treated reads, however, is a computationally difficult task: bisulfite treatment decreases the (lexical) complexity of low-methylated genomic regions, and C-to-T mismatches may reflect cytosine unmethylation rather than SNPs or sequencing errors. Further challenges arise both during and after the alignment phase: data structures used by the aligner should be fast and should fit into main memory, and the methylation-caller output should be somehow compressed, due to its significant size. METHODS: As far as data structures employed to align bisulfite-treated reads are concerned, solutions proposed in the literature can be roughly grouped into two main categories: those storing pointers at each text position (e.g. hash tables, suffix trees/arrays), and those using the information-theoretic minimum number of bits (e.g. FM indexes and compressed suffix arrays). The former are fast and memory consuming. The latter are much slower and light. In this paper, we try to close this gap proposing a data structure for aligning bisulfite-treated reads which is at the same time fast, light, and very accurate. We reach this objective by combining a recent theoretical result on succinct hashing with a bisulfite-aware hash function. Furthermore, the new versions of the tools implementing our ideas|the aligner ERNE-BS5 2 and the caller ERNE-METH 2|have been extended with increased downstream compatibility (EPP/Bismark cov output formats), output compression, and support for target enrichment protocols. RESULTS: Experimental results on public and simulated WGBS libraries show that our algorithmic solution is a competitive tradeoff between hash-based and BWT-based indexes, being as fast and accurate as the former, and as memory-efficient as the latter. CONCLUSIONS: The new functionalities of our bisulfite aligner and caller make it a fast and memory efficient tool, useful to analyze big datasets with little computational resources, to easily process target enrichment data, and produce statistics such as protocol efficiency and coverage as a function of the distance from target regions.

Asunto(s)

Metilación de ADN , ADN/química , Epigenómica , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Sulfitos/química , Islas de CpG , Compresión de Datos , Genoma Humano , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos

2.

Fast randomized approximate string matching with succinct hash data structures.

Policriti, Alberto; Prezza, Nicola.

BMC Bioinformatics ; 16 Suppl 9: S4, 2015.

Artículo en Inglés | MEDLINE | ID: mdl-26051265

RESUMEN

BACKGROUND: The high throughput of modern NGS sequencers coupled with the huge sizes of genomes currently analysed, poses always higher algorithmic challenges to align short reads quickly and accurately against a reference sequence. A crucial, additional, requirement is that the data structures used should be light. The available modern solutions usually are a compromise between the mentioned constraints: in particular, indexes based on the Burrows-Wheeler transform offer reduced memory requirements at the price of lower sensitivity, while hash-based text indexes guarantee high sensitivity at the price of significant memory consumption. METHODS: In this work we describe a technique that permits to attain the advantages granted by both classes of indexes. This is achieved using Hamming-aware hash functions--hash functions designed to search the entire Hamming sphere in reduced time--which are also homomorphisms on de Bruijn graphs. We show that, using this particular class of hash functions, the corresponding hash index can be represented in linear space introducing only a logarithmic slowdown (in the query length) for the lookup operation. We point out that our data structure reaches its goals without compressing its input: another positive feature, as in biological applications data is often very close to be un-compressible. RESULTS: The new data structure introduced in this work is called dB-hash and we show how its implementation--BW-ERNE--maintains the high sensitivity and speed of its (hash-based) predecessor ERNE, while drastically reducing space consumption. Extensive comparison experiments conducted with several popular alignment tools on both simulated and real NGS data, show, finally, that BW-ERNE is able to attain both the positive features of succinct data structures (that is, small space) and hash indexes (that is, sensitivity). CONCLUSIONS: In applications where space and speed are both a concern, standard methods often sacrifice accuracy to obtain competitive throughputs and memory footprints. In this work we show that, combining hashing and succinct indexing techniques, we can attain good performances and accuracy with a memory footprint comparable to that of the most popular compressed indexes.

Asunto(s)

Algoritmos , Genoma Humano , Genoma de Planta , Análisis de Secuencia de ADN/métodos , Vitis/genética , Simulación por Computador , Humanos

3.

Chimera: a Bioconductor package for secondary analysis of fusion products.

Beccuti, Marco; Carrara, Matteo; Cordero, Francesca; Lazzarato, Fulvio; Donatelli, Susanna; Nadalin, Francesca; Policriti, Alberto; Calogero, Raffaele A.

Bioinformatics ; 30(24): 3556-7, 2014 Dec 15.

Artículo en Inglés | MEDLINE | ID: mdl-25286921

RESUMEN

SUMMARY: Chimera is a Bioconductor package that organizes, annotates, analyses and validates fusions reported by different fusion detection tools; current implementation can deal with output from bellerophontes, chimeraScan, deFuse, fusionCatcher, FusionFinder, FusionHunter, FusionMap, mapSplice, Rsubread, tophat-fusion and STAR. The core of Chimera is a fusion data structure that can store fusion events detected with any of the aforementioned tools. Fusions are then easily manipulated with standard R functions or through the set of functionalities specifically developed in Chimera with the aim of supporting the user in managing fusions and discriminating false-positive results.

Asunto(s)

Fusión Génica , Programas Informáticos , Animales , Anotación de Secuencia Molecular

4.

GAM-NGS: genomic assemblies merger for next generation sequencing.

Vicedomini, Riccardo; Vezzi, Francesco; Scalabrin, Simone; Arvestad, Lars; Policriti, Alberto.

BMC Bioinformatics ; 14 Suppl 7: S6, 2013.

Artículo en Inglés | MEDLINE | ID: mdl-23815503

RESUMEN

BACKGROUND: In recent years more than 20 assemblers have been proposed to tackle the hard task of assembling NGS data. A common heuristic when assembling a genome is to use several assemblers and then select the best assembly according to some criteria. However, recent results clearly show that some assemblers lead to better statistics than others on specific regions but are outperformed on other regions or on different evaluation measures. To limit these problems we developed GAM-NGS (Genomic Assemblies Merger for Next Generation Sequencing), whose primary goal is to merge two or more assemblies in order to enhance contiguity and correctness of both. GAM-NGS does not rely on global alignment: regions of the two assemblies representing the same genomic locus (called blocks) are identified through reads' alignments and stored in a weighted graph. The merging phase is carried out with the help of this weighted graph that allows an optimal resolution of local problematic regions. RESULTS: GAM-NGS has been tested on six different datasets and compared to other assembly reconciliation tools. The availability of a reference sequence for three of them allowed us to show how GAM-NGS is a tool able to output an improved reliable set of sequences. GAM-NGS is also a very efficient tool able to merge assemblies using substantially less computational resources than comparable tools. In order to achieve such goals, GAM-NGS avoids global alignment between contigs, making its strategy unique among other assembly reconciliation tools. CONCLUSIONS: The difficulty to obtain correct and reliable assemblies using a single assembler is forcing the introduction of new algorithms able to enhance de novo assemblies. GAM-NGS is a tool able to merge two or more assemblies in order to improve contiguity and correctness. It can be used on all NGS-based assembly projects and it shows its full potential with multi-library Illumina-based projects. With more than 20 available assemblers it is hard to select the best tool. In this context we propose a tool that improves assemblies (and, as a by-product, perhaps even assemblers) by merging them and selecting the generating that is most likely to be correct.

Asunto(s)

Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN/métodos , Algoritmos , Cromosomas/genética , Genoma Bacteriano , Genoma Humano , Humanos , Rhodobacter sphaeroides/genética , Programas Informáticos , Staphylococcus aureus/genética

5.

rNA: a fast and accurate short reads numerical aligner.

Vezzi, Francesco; Del Fabbro, Cristian; Tomescu, Alexandru I; Policriti, Alberto.

Bioinformatics ; 28(1): 123-4, 2012 Jan 01.

Artículo en Inglés | MEDLINE | ID: mdl-22084252

RESUMEN

SUMMARY: The advent of high-throughput sequencers (HTS) introduced the need of new tools in order to analyse the large amount of data that those machines are able to produce. The mandatory first step for a wide range of analyses is the alignment of the sequences against a reference genome. We present a major update to our rNA (randomized Numerical Aligner) tool. The main feature of rNA is the fact that it achieves an accuracy greater than the majority of other tools in a feasible amount of time. rNA executables and source codes are freely downloadable at http://iga-rna.sourceforge.net/. CONTACT: vezzi@appliedgenomics.org; delfabbro@appliedgenomics.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Humanos

6.

The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla.

Jaillon, Olivier; Aury, Jean-Marc; Noel, Benjamin; Policriti, Alberto; Clepet, Christian; Casagrande, Alberto; Choisne, Nathalie; Aubourg, Sébastien; Vitulo, Nicola; Jubin, Claire; Vezzi, Alessandro; Legeai, Fabrice; Hugueney, Philippe; Dasilva, Corinne; Horner, David; Mica, Erica; Jublot, Delphine; Poulain, Julie; Bruyère, Clémence; Billault, Alain; Segurens, Béatrice; Gouyvenoux, Michel; Ugarte, Edgardo; Cattonaro, Federica; Anthouard, Véronique; Vico, Virginie; Del Fabbro, Cristian; Alaux, Michaël; Di Gaspero, Gabriele; Dumas, Vincent; Felice, Nicoletta; Paillard, Sophie; Juman, Irena; Moroldo, Marco; Scalabrin, Simone; Canaguier, Aurélie; Le Clainche, Isabelle; Malacrida, Giorgio; Durand, Eléonore; Pesole, Graziano; Laucou, Valérie; Chatelet, Philippe; Merdinoglu, Didier; Delledonne, Massimo; Pezzotti, Mario; Lecharny, Alain; Scarpelli, Claude; Artiguenave, François; Pè, M Enrico; Valle, Giorgio.

Nature ; 449(7161): 463-7, 2007 Sep 27.

Artículo en Inglés | MEDLINE | ID: mdl-17721507

RESUMEN

The analysis of the first plant genomes provided unexpected evidence for genome duplication events in species that had previously been considered as true diploids on the basis of their genetics. These polyploidization events may have had important consequences in plant evolution, in particular for species radiation and adaptation and for the modulation of functional capacities. Here we report a high-quality draft of the genome sequence of grapevine (Vitis vinifera) obtained from a highly homozygous genotype. The draft sequence of the grapevine genome is the fourth one produced so far for flowering plants, the second for a woody species and the first for a fruit crop (cultivated for both fruit and beverage). Grapevine was selected because of its important place in the cultural heritage of humanity beginning during the Neolithic period. Several large expansions of gene families with roles in aromatic features are observed. The grapevine genome has not undergone recent genome duplication, thus enabling the discovery of ancestral traits and features of the genetic organization of flowering plants. This analysis reveals the contribution of three ancestral genomes to the grapevine haploid content. This ancestral arrangement is common to many dicotyledonous plants but is absent from the genome of rice, which is a monocotyledon. Furthermore, we explain the chronology of previously described whole-genome duplication events in the evolution of flowering plants.

Asunto(s)

Evolución Molecular , Genoma de Planta/genética , Poliploidía , Vitis/clasificación , Vitis/genética , Arabidopsis/genética , ADN Intergénico/genética , Exones/genética , Genes de Plantas/genética , Intrones/genética , Cariotipificación , MicroARNs/genética , Datos de Secuencia Molecular , Oryza/genética , Populus/genética , ARN de Planta/genética , ARN de Transferencia/genética , Análisis de Secuencia de ADN

7.

GapFiller: a de novo assembly approach to fill the gap within paired reads.

Nadalin, Francesca; Vezzi, Francesco; Policriti, Alberto.

BMC Bioinformatics ; 13 Suppl 14: S8, 2012.

Artículo en Inglés | MEDLINE | ID: mdl-23095524

RESUMEN

BACKGROUND: Next Generation Sequencing technologies are able to provide high genome coverages at a relatively low cost. However, due to limited reads' length (from 30 bp up to 200 bp), specific bioinformatics problems have become even more difficult to solve. De novo assembly with short reads, for example, is more complicated at least for two reasons: first, the overall amount of "noisy" data to cope with increased and, second, as the reads' length decreases the number of unsolvable repeats grows. Our work's aim is to go at the root of the problem by providing a pre-processing tool capable to produce (in-silico) longer and highly accurate sequences from a collection of Next Generation Sequencing reads. RESULTS: In this paper a seed-and-extend local assembler is presented. The kernel algorithm is a loop that, starting from a read used as seed, keeps extending it using heuristics whose main goal is to produce a collection of error-free and longer sequences. In particular, GapFiller carefully detects reliable overlaps and operates clustering similar reads in order to reconstruct the missing part between the two ends of the same insert. Our tool's output has been validated on 24 experiments using both simulated and real paired reads datasets. The output sequences are declared correct when the seed-mate is found. In the experiments performed, GapFiller was able to extend high percentages of the processed seeds and find their mates, with a false positives rate that turned out to be nearly negligible. CONCLUSIONS: GapFiller, starting from a sufficiently high short reads coverage, is able to produce high coverages of accurate longer sequences (from 300 bp up to 3500 bp). The procedure to perform safe extensions, together with the mate-found check, turned out to be a powerful criterion to guarantee contigs' correctness. GapFiller has further potential, as it could be applied in a number of different scenarios, including the post-processing validation of insertions/deletions detection pipelines, pre-processing routines on datasets for de novo assembly pipelines, or in any hierarchical approach designed to assemble, analyse or validate pools of sequences.

Asunto(s)

Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN/métodos , Mapeo Contig , Genoma , Genoma Humano , Humanos , Rhodobacter sphaeroides/genética , Programas Informáticos , Staphylococcus aureus/genética

8.

Reduction of heterozygosity (ROH) as a method to detect mosaic structural variation.

Marroni, Fabio; Scaglione, Davide; Pinosio, Sara; Policriti, Alberto; Miculan, Mara; Di Gaspero, Gabriele; Morgante, Michele.

Plant Biotechnol J ; 15(7): 791-793, 2017 07.

Artículo en Inglés | MEDLINE | ID: mdl-28055146

Asunto(s)

Tamización de Portadores Genéticos/métodos , Pérdida de Heterocigocidad , Mosaicismo , Vitis/genética , Frecuencia de los Genes , Haplotipos , Fenotipo , Pigmentación , Polimorfismo de Nucleótido Simple

9.

Correction: High throughput approaches reveal splicing of primary microRNA transcripts and tissue specific expression of mature microRNAs in Vitis vinifera.

Mica, Erica; Piccolo, Viviana; Delledonne, Massimo; Ferrarini, Alberto; Pezzotti, Mario; Casati, Cesare; Del Fabbro, Cristian; Valle, Giorgio; Policriti, Alberto; Morgante, Michele; Pesole, Graziano; Pè, M Enrico; Horner, David S.

BMC Genomics ; 11: 109, 2010 Feb 12.

Artículo en Inglés | MEDLINE | ID: mdl-20152027

RESUMEN

UNLABELLED: The version of this article published in BMC Genomics 2009, 10:558, contains data in Table 1 which are now known to be unreliable, and an illustration, in Figure 1, of unusual miRNA processing events predicted by these unreliable data. In this full-length correction, new data replace those found to be unreliable, leading to a more straightforward interpretation without altering the principle conclusions of the study. Table 1 and associated methods have been corrected, Figure 1 deleted, supplementary file 1 added, and modifications made to the sections "Deep sequencing of small RNAs from grapevine leaf tissue" and "Microarray analysis of miRNA expression". The editors and authors regret the inconvenience caused to readers by premature publication of the original paper. BACKGROUND: MicroRNAs are short (~21 base) single stranded RNAs that, in plants, are generally coded by specific genes and cleaved specifically from hairpin precursors. MicroRNAs are critical for the regulation of multiple developmental, stress related and other physiological processes in plants. The recent annotation of the genome of the grapevine (Vitis vinifera L.) allowed the identification of many putative conserved microRNA precursors, grouped into multiple gene families. RESULTS: Here we use oligonucleotide arrays to provide the first indication that many of these microRNAs show differential expression patterns between tissues and during the maturation of fruit in the grapevine. Furthermore we demonstrate that whole transcriptome sequencing and deep-sequencing of small RNA fractions can be used both to identify which microRNA precursors are expressed in different tissues and to estimate genomic coordinates and patterns of splicing and alternative splicing for many primary miRNA transcripts. CONCLUSIONS: Our results show that many microRNAs are differentially expressed in different tissues and during fruit maturation in the grapevine. Furthermore, the demonstration that whole transcriptome sequencing can be used to identify candidate splicing events and approximate primary microRNA transcript coordinates represents a significant step towards the large-scale elucidation of mechanisms regulating the expression of microRNAs at the transcriptional and post-transcriptional levels.

Asunto(s)

MicroARNs/genética , Empalme del ARN , Vitis/genética , ARN de Planta/genética , Análisis de Secuencia de ARN

10.

Physical mapping in highly heterozygous genomes: a physical contig map of the Pinot Noir grapevine cultivar.

Scalabrin, Simone; Troggio, Michela; Moroldo, Marco; Pindo, Massimo; Felice, Nicoletta; Coppola, Giuseppina; Prete, Giacomo; Malacarne, Giulia; Marconi, Raffaella; Faes, Giorgia; Jurman, Irena; Grando, Stella; Jesse, Taco; Segala, Cinzia; Valle, Giorgio; Policriti, Alberto; Fontana, Paolo; Morgante, Michele; Velasco, Riccardo.

BMC Genomics ; 11: 204, 2010 Mar 26.

Artículo en Inglés | MEDLINE | ID: mdl-20346114

RESUMEN

BACKGROUND: Most of the grapevine (Vitis vinifera L.) cultivars grown today are those selected centuries ago, even though grapevine is one of the most important fruit crops in the world. Grapevine has therefore not benefited from the advances in modern plant breeding nor more recently from those in molecular genetics and genomics: genes controlling important agronomic traits are practically unknown. A physical map is essential to positionally clone such genes and instrumental in a genome sequencing project. RESULTS: We report on the first whole genome physical map of grapevine built using high information content fingerprinting of 49,104 BAC clones from the cultivar Pinot Noir. Pinot Noir, as most grape varieties, is highly heterozygous at the sequence level. This resulted in the two allelic haplotypes sometimes assembling into separate contigs that had to be accommodated in the map framework or in local expansions of contig maps. We performed computer simulations to assess the effects of increasing levels of sequence heterozygosity on BAC fingerprint assembly and showed that the experimental assembly results are in full agreement with the theoretical expectations, given the heterozygosity levels reported for grape. The map is anchored to a dense linkage map consisting of 994 markers. 436 contigs are anchored to the genetic map, covering 342 of the 475 Mb that make up the grape haploid genome. CONCLUSIONS: We have developed a resource that makes it possible to access the grapevine genome, opening the way to a new era both in grape genetics and breeding and in wine making. The effects of heterozygosity on the assembly have been analyzed and characterized by using several complementary approaches which could be easily transferred to the study of other genomes which present the same features.

Asunto(s)

Vitis/clasificación , Vitis/genética , Mapeo Cromosómico , Genoma de Planta , Vino

11.

The SSR-based molecular profile of 1005 grapevine (Vitis vinifera L.) accessions uncovers new synonymy and parentages, and reveals a large admixture amongst varieties of different geographic origin.

Cipriani, Guido; Spadotto, Alessandro; Jurman, Irena; Di Gaspero, Gabriele; Crespan, Manna; Meneghetti, Stefano; Frare, Enrica; Vignani, Rita; Cresti, Mauro; Morgante, Michele; Pezzotti, Mario; Pe, Enrico; Policriti, Alberto; Testolin, Raffaele.

Theor Appl Genet ; 121(8): 1569-85, 2010 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-20689905

RESUMEN

A collection of 1005 grapevine accessions was genotyped at 34 microsatellite loci (SSR) with the aim of analysing genetic diversity and exploring parentages. The comparison of molecular profiles revealed 200 groups of synonymy. The removal of perfect synonyms reduced the database to 745 unique genotypes, on which population genetic parameters were calculated. The analysis of kinship uncovered 74 complete pedigrees, with both parents identified. Many of these parentages were not previously known and are of considerable historical interest, e.g. Chenin blanc (Sauvignon × Traminer rot), Covè (Harslevelu selfed), Incrocio Manzoni 2-14 and 2-15 (Cabernet franc × Prosecco), Lagrein (Schiava gentile × Teroldego), Malvasia nera of Bolzano (Perera × Schiava gentile), Manzoni moscato (Raboso veronese × Moscato d'Amburgo), Moscato violetto (Moscato bianco × Duraguzza), Muscat of Alexandria (Muscat blanc à petit grain × Axina de tres bias) and others. Statistical robustness of unexpected pedigrees was reinforced with the analysis of an additional 7-30 SSRs. Grouping the accessions by profile resulted in a weak correlation with their geographical origin and/or current area of cultivation, revealing a large admixture of local varieties with those most widely cultivated, as a result of ancient commerce and population flow. The SSRs with tri- to penta-nucleotide repeats adopted for the present study showed a great capacity for discriminating amongst accessions, with probabilities of identity by chance as low as 1.45 × 10(-27) and 9.35 × 10(-12) for unrelated and full sib individuals, respectively. A database of allele frequencies and SSR profiles of 32 reference cultivars are provided.

Asunto(s)

Pool de Genes , Geografía , Repeticiones de Microsatélite/genética , Filogenia , Vitis/clasificación , Vitis/genética , Alelos , Cruzamientos Genéticos , Frecuencia de los Genes , Sitios Genéticos , Variación Genética , Genética de Población , Genotipo , Linaje , Filogeografía , Dinámica Poblacional , Análisis de Componente Principal

12.

Automated FingerPrint Background removal: FPB.

Scalabrin, Simone; Morgante, Michele; Policriti, Alberto.

BMC Bioinformatics ; 10: 127, 2009 Apr 30.

Artículo en Inglés | MEDLINE | ID: mdl-19405935

RESUMEN

BACKGROUND: The construction of a whole-genome physical map has been an essential component of numerous genome projects initiated since the inception of the Human Genome Project. Its usefulness has been proved for whole-genome shotgun projects as a post-assembly validation and recently it has also been used in the assembly step to constrain on BACs positions. Fingerprinting is usually the method of choice for construction of physical maps. A clone fingerprint is composed of true peaks representing real fragments and background peaks, mainly composed of E. coli genomic DNA, partial digestions, star activity by-products, and machine background. High-throughput fingerprinting leads to the production of thousands of BAC clone fingerprints per day. That is why background peaks removal has become an important issue and needs to be automatized, especially in capillary electrophoresis based fingerprints. RESULTS: At the moment, the only tools available for such a task are GenoProfiler and its descendant FPMiner. The large variation in the quality of fingerprints that is usually present in large fingerprinting projects represents a major difficulty in the correct removal of background peaks that has only been partially addressed by the methods so far adopted that all require a long manual optimization of parameters. Thus, we implemented a new data-independent tool, FPB (FingerPrint Background removal), suitable for large scale projects as well as mapping of few clones. CONCLUSION: FPB is freely available at http://www.appliedgenomics.org/tools.php. FPB was used to remove the background from all fingerprints of three grapevine physical map projects. The first project consists of about 50,000 fingerprints, the second one consists of about 70,000 fingerprints, and the third one consists of about 45,000 fingerprints. In all cases a successful assembly was built.

Asunto(s)

Biología Computacional/métodos , Mapeo Físico de Cromosoma/métodos , Mapeo Cromosómico , Cromosomas Artificiales Bacterianos , Bases de Datos Genéticas , Genoma Humano , Genómica/métodos , Proyecto Genoma Humano , Humanos

13.

High throughput approaches reveal splicing of primary microRNA transcripts and tissue specific expression of mature microRNAs in Vitis vinifera.

Mica, Erica; Piccolo, Viviana; Delledonne, Massimo; Ferrarini, Alberto; Pezzotti, Mario; Casati, Cesare; Del Fabbro, Cristian; Valle, Giorgio; Policriti, Alberto; Morgante, Michele; Pesole, Graziano; Pè, M Enrico; Horner, David S.

BMC Genomics ; 10: 558, 2009 Nov 25.

Artículo en Inglés | MEDLINE | ID: mdl-19939267

RESUMEN

BACKGROUND: MicroRNAs are short (approximately 21 base) single stranded RNAs that, in plants, are generally coded by specific genes and cleaved specifically from hairpin precursors. MicroRNAs are critical for the regulation of multiple developmental, stress related and other physiological processes in plants. The recent annotation of the genome of the grapevine (Vitis vinifera L.) allowed the identification of many putative conserved microRNA precursors, grouped into multiple gene families. RESULTS: Here we use oligonucleotide arrays to provide the first indication that many of these microRNAs show differential expression patterns between tissues and during the maturation of fruit in the grapevine. Furthermore we demonstrate that whole transcriptome sequencing and deep-sequencing of small RNA fractions can be used both to identify which microRNA precursors are expressed in different tissues and to estimate genomic coordinates and patterns of splicing and alternative splicing for many primary miRNA transcripts. CONCLUSION: Our results show that many microRNAs are differentially expressed in different tissues and during fruit maturation in the grapevine. Furthermore, the demonstration that whole transcriptome sequencing can be used to identify candidate splicing events and approximate primary microRNA transcript coordinates represents a significant step towards the large-scale elucidation of mechanisms regulating the expression of microRNAs at the transcriptional and post-transcriptional levels.

Asunto(s)

MicroARNs/genética , Análisis de Secuencia de ARN , Vitis/genética , Empalme Alternativo , Secuencia de Bases , Biología Computacional , Frutas/genética , Perfilación de la Expresión Génica , Datos de Secuencia Molecular , Conformación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Empalme del ARN , ARN de Planta/genética

14.

Structured motifs search.

Morgante, Michele; Policriti, Alberto; Vitacolonna, Nicola; Zuccolo, Andrea.

J Comput Biol ; 12(8): 1065-82, 2005 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-16241898

RESUMEN

In this paper, we describe an algorithm for the localization of structured models, i.e. sequences of (simple) motifs and distance constraints. It basically combines standard pattern matching procedures with a constraint satisfaction solver, and it has the ability, not present in similar tools, to search for partial matches. A significant feature of our approach, especially in terms of efficiency for the application context, is that the (potentially) exponentially many solutions to the considered problem are represented in compact form as a graph. Moreover, the time and space necessary to build the graph are linear in the number of occurrences of the component patterns.

Asunto(s)

Algoritmos , Secuencia de Bases , Análisis por Conglomerados , Reconocimiento de Normas Patrones Automatizadas , Análisis de Secuencia de ADN , Biología Computacional , Elementos Transponibles de ADN/genética , Genoma , Almacenamiento y Recuperación de la Información , Modelos Genéticos , Análisis Numérico Asistido por Computador , Alineación de Secuencia , Programas Informáticos

15.

On the impact of discreteness and abstractions on modelling noise in gene regulatory networks.

Bodei, Chiara; Bortolussi, Luca; Chiarugi, Davide; Guerriero, Maria Luisa; Policriti, Alberto; Romanel, Alessandro.

Comput Biol Chem ; 56: 98-108, 2015 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-25909953

RESUMEN

In this paper, we explore the impact of different forms of model abstraction and the role of discreteness on the dynamical behaviour of a simple model of gene regulation where a transcriptional repressor negatively regulates its own expression. We first investigate the relation between a minimal set of parameters and the system dynamics in a purely discrete stochastic framework, with the twofold purpose of providing an intuitive explanation of the different behavioural patterns exhibited and of identifying the main sources of noise. Then, we explore the effect of combining hybrid approaches and quasi-steady state approximations on model behaviour (and simulation time), to understand to what extent dynamics and quantitative features such as noise intensity can be preserved.

Asunto(s)

Redes Reguladoras de Genes , Modelos Genéticos , Animales , Simulación por Computador , Regulación de la Expresión Génica , Humanos , Proteínas Represoras/genética , Procesos Estocásticos , Activación Transcripcional

16.

A sense of life: computational and experimental investigations with models of biochemical and evolutionary processes.

Mishra, Bud; Daruwala, Raoul-Sam; Zhou, Yi; Ugel, Nadia; Policriti, Alberto; Antoniotti, Marco; Paxia, Salvatore; Rejali, Marc; Rudra, Archisman; Cherepinsky, Vera; Silver, Naomi; Casey, William; Piazza, Carla; Simeoni, Marta; Barbano, Paolo; Spivak, Marina; Feng, Jiawu; Gill, Ofer; Venkatesh, Mysore; Cheng, Fang; Sun, Bing; Ioniata, Iuliana; Anantharaman, Thomas; Hubbard, E Jane Albert; Pnueli, Amir; Harel, David; Chandru, Vijay; Hariharan, Ramesh; Wigler, Michael; Park, Frank; Lin, Shih-Chieh; Lazebnik, Yuri; Winkler, Franz; Cantor, Charles R; Carbone, Alessandra; Gromov, Mikhael.

OMICS ; 7(3): 253-68, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-14583115

RESUMEN

We collaborate in a research program aimed at creating a rigorous framework, experimental infrastructure, and computational environment for understanding, experimenting with, manipulating, and modifying a diverse set of fundamental biological processes at multiple scales and spatio-temporal modes. The novelty of our research is based on an approach that (i) requires coevolution of experimental science and theoretical techniques and (ii) exploits a certain universality in biology guided by a parsimonious model of evolutionary mechanisms operating at the genomic level and manifesting at the proteomic, transcriptomic, phylogenic, and other higher levels. Our current program in "systems biology" endeavors to marry large-scale biological experiments with the tools to ponder and reason about large, complex, and subtle natural systems. To achieve this ambitious goal, ideas and concepts are combined from many different fields: biological experimentation, applied mathematical modeling, computational reasoning schemes, and large-scale numerical and symbolic simulations. From a biological viewpoint, the basic issues are many: (i) understanding common and shared structural motifs among biological processes; (ii) modeling biological noise due to interactions among a small number of key molecules or loss of synchrony; (iii) explaining the robustness of these systems in spite of such noise; and (iv) cataloging multistatic behavior and adaptation exhibited by many biological processes.

Asunto(s)

Biología Computacional/métodos , Evolución Molecular , Modelos Biológicos , Animales , Bioquímica/métodos , Células/citología , Células/metabolismo , Humanos , Modelos Genéticos , Purinas/metabolismo , Programas Informáticos , Análisis de Sistemas

17.

Model building and model checking for biochemical processes.

Antoniotti, Marco; Policriti, Alberto; Ugel, Nadia; Mishra, Bud.

Cell Biochem Biophys ; 38(3): 271-86, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-12794268

RESUMEN

A central claim of computational systems biology is that, by drawing on mathematical approaches developed in the context of dynamic systems, kinetic analysis, computational theory and logic, it is possible to create powerful simulation, analysis, and reasoning tools for working biologists to decipher existing data, devise new experiments, and ultimately to understand functional properties of genomes, proteomes, cells, organs, and organisms. In this article, a novel computational tool is described that achieves many of the goals of this new discipline. The novelty of this system involves an automaton-based semantics of the temporal evolution of complex biochemical reactions starting from the representation given as a set of differential equations. The related tools also provide ability to qualitatively reason about the systems using a propositional temporal logic that can express an ordered sequence of events succinctly and unambiguously. The implementation of mathematical and computational models in the Simpathica and XSSYS systems is described briefly. Several example applications of these systems to cellular and biochemical processes are presented: the two most prominent are Leibler et al.'s repressilator (an artificial synthesized oscillatory network), and Curto- Voit-Sorribas-Cascante's purine metabolism reaction model.

Asunto(s)

Biología Computacional/métodos , Modelos Biológicos , Fenómenos Bioquímicos , Bioquímica , Simulación por Computador , Humanos , Cinética , Cómputos Matemáticos , Modelos Teóricos , Purinas/metabolismo , Programas Informáticos

18.

The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution.

Verde, Ignazio; Abbott, Albert G; Scalabrin, Simone; Jung, Sook; Shu, Shengqiang; Marroni, Fabio; Zhebentyayeva, Tatyana; Dettori, Maria Teresa; Grimwood, Jane; Cattonaro, Federica; Zuccolo, Andrea; Rossini, Laura; Jenkins, Jerry; Vendramin, Elisa; Meisel, Lee A; Decroocq, Veronique; Sosinski, Bryon; Prochnik, Simon; Mitros, Therese; Policriti, Alberto; Cipriani, Guido; Dondini, Luca; Ficklin, Stephen; Goodstein, David M; Xuan, Pengfei; Del Fabbro, Cristian; Aramini, Valeria; Copetti, Dario; Gonzalez, Susana; Horner, David S; Falchi, Rachele; Lucas, Susan; Mica, Erica; Maldonado, Jonathan; Lazzari, Barbara; Bielenberg, Douglas; Pirona, Raul; Miculan, Mara; Barakat, Abdelali; Testolin, Raffaele; Stella, Alessandra; Tartarini, Stefano; Tonutti, Pietro; Arús, Pere; Orellana, Ariel; Wells, Christina; Main, Dorrie; Vizzotto, Giannina; Silva, Herman; Salamini, Francesco.

Nat Genet ; 45(5): 487-94, 2013 May.

Artículo en Inglés | MEDLINE | ID: mdl-23525075

RESUMEN

Rosaceae is the most important fruit-producing clade, and its key commercially relevant genera (Fragaria, Rosa, Rubus and Prunus) show broadly diverse growth habits, fruit types and compact diploid genomes. Peach, a diploid Prunus species, is one of the best genetically characterized deciduous trees. Here we describe the high-quality genome sequence of peach obtained from a completely homozygous genotype. We obtained a complete chromosome-scale assembly using Sanger whole-genome shotgun methods. We predicted 27,852 protein-coding genes, as well as noncoding RNAs. We investigated the path of peach domestication through whole-genome resequencing of 14 Prunus accessions. The analyses suggest major genetic bottlenecks that have substantially shaped peach genome diversity. Furthermore, comparative analyses showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.

Asunto(s)

Agricultura , Evolución Biológica , Variación Genética , Genoma de Planta/genética , Prunus/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Datos de Secuencia Molecular , Polímeros/metabolismo , Propanoles/metabolismo , Prunus/clasificación

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA