Pesquisa | Portal de Pesquisa da BVS

Scalable high-throughput identification of genetic targets by network filtering.

Bevilacqua, Vitoantonio; Pannarale, Paolo.

BMC Bioinformatics ; 14 Suppl 8: S5, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23815162

RESUMO

BACKGROUND: Discovering the molecular targets of compounds or the cause of physiological conditions, among the multitude of known genes, is one of the major challenges of bioinformatics. One of the most common approaches to this problem is finding sets of differentially expressed, and more recently differentially co-expressed, genes. Other approaches require libraries of genetic mutants or require to perform a large number of assays. Another elegant approach is the filtering of mRNA expression profiles using reverse-engineered gene network models of the target cell. This approach has the advantage of not needing control samples, libraries or numerous assays. Nevertheless, the impementations of this strategy proposed so far are computationally demanding. Moreover the user has to arbitrarily choose a threshold on the number of potentially relevant genes from the algorithm output. RESULTS: Our solution, while performing comparably to state of the art algorithms in terms of discovered targets, is more efficient in terms of memory and time consumption. The proposed algorithm computes the likelihood associated to each gene and outputs to the user only the list of likely perturbed genes. CONCLUSIONS: The proposed algorithm is a valid alternative to existing algorithms and is particularly suited to contemporary gene expression microarrays, given the number of probe sets in each chip, also when executed on common desktop computers.

Assuntos

Algoritmos , Perfilação da Expressão Gênica/métodos , Software , Expressão Gênica/efeitos dos fármacos , Terapia de Alvo Molecular , Análise de Sequência com Séries de Oligonucleotídeos , Saccharomyces cerevisiae/genética

Comparison of data-merging methods with SVM attribute selection and classification in breast cancer gene expression.

Bevilacqua, Vitoantonio; Pannarale, Paolo; Abbrescia, Mirko; Cava, Claudia; Paradiso, Angelo; Tommasi, Stefania.

BMC Bioinformatics ; 13 Suppl 7: S9, 2012 May 08.

Artigo em Inglês | MEDLINE | ID: mdl-22595006

RESUMO

BACKGROUND: DNA microarray data are used to identify genes which could be considered prognostic markers. However, due to the limited sample size of each study, the signatures are unstable in terms of the composing genes and may be limited in terms of performances. It is therefore of great interest to integrate different studies, thus increasing sample size. RESULTS: In the past, several studies explored the issue of microarray data merging, but the arrival of new techniques and a focus on SVM based classification needed further investigation. We used distant metastasis prediction based on SVM attribute selection and classification to three breast cancer data sets. CONCLUSIONS: The results showed that breast cancer classification does not benefit from data merging, confirming the results found by other studies with different techniques.

Assuntos

Neoplasias da Mama/genética , Perfilação da Expressão Gênica/métodos , Máquina de Vetores de Suporte , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Metástase Neoplásica/genética , Análise de Sequência com Séries de Oligonucleotídeos

GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity Database.

Pannarale, Paolo; Catalano, Domenico; De Caro, Giorgio; Grillo, Giorgio; Leo, Pietro; Pappadà, Graziano; Rubino, Francesco; Scioscia, Gaetano; Licciulli, Flavio.

BMC Bioinformatics ; 13 Suppl 4: S4, 2012 Mar 28.

Artigo em Inglês | MEDLINE | ID: mdl-22536971

RESUMO

BACKGROUND: In the scientific biodiversity community, it is increasingly perceived the need to build a bridge between molecular and traditional biodiversity studies. We believe that the information technology could have a preeminent role in integrating the information generated by these studies with the large amount of molecular data we can find in bioinformatics public databases. This work is primarily aimed at building a bioinformatic infrastructure for the integration of public and private biodiversity data through the development of GIDL, an Intelligent Data Loader coupled with the Molecular Biodiversity Database. The system presented here organizes in an ontological way and locally stores the sequence and annotation data contained in the GenBank primary database. METHODS: The GIDL architecture consists of a relational database and of an intelligent data loader software. The relational database schema is designed to manage biodiversity information (Molecular Biodiversity Database) and it is organized in four areas: MolecularData, Experiment, Collection and Taxonomy. The MolecularData area is inspired to an established standard in Generic Model Organism Databases, the Chado relational schema. The peculiarity of Chado, and also its strength, is the adoption of an ontological schema which makes use of the Sequence Ontology. The Intelligent Data Loader (IDL) component of GIDL is an Extract, Transform and Load software able to parse data, to discover hidden information in the GenBank entries and to populate the Molecular Biodiversity Database. The IDL is composed by three main modules: the Parser, able to parse GenBank flat files; the Reasoner, which automatically builds CLIPS facts mapping the biological knowledge expressed by the Sequence Ontology; the DBFiller, which translates the CLIPS facts into ordered SQL statements used to populate the database. In GIDL Semantic Web technologies have been adopted due to their advantages in data representation, integration and processing. RESULTS AND CONCLUSIONS: Entries coming from Virus (814,122), Plant (1,365,360) and Invertebrate (959,065) divisions of GenBank rel.180 have been loaded in the Molecular Biodiversity Database by GIDL. Our system, combining the Sequence Ontology and the Chado schema, allows a more powerful query expressiveness compared with the most commonly used sequence retrieval systems like Entrez or SRS.

Assuntos

Biodiversidade , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Sistemas Inteligentes , Animais , Internet , Software

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA