Search | Nursing VHL Search Portal

Selectome update: quality control and computational improvements to a database of positive selection.

Moretti, Sébastien; Laurenczy, Balazs; Gharib, Walid H; Castella, Briséïs; Kuzniar, Arnold; Schabauer, Hannes; Studer, Romain A; Valle, Mario; Salamin, Nicolas; Stockinger, Heinz; Robinson-Rechavi, Marc.

Nucleic Acids Res ; 42(Database issue): D917-21, 2014 Jan.

Article in English | MEDLINE | ID: mdl-24225318

ABSTRACT

Selectome (http://selectome.unil.ch/) is a database of positive selection, based on a branch-site likelihood test. This model estimates the number of nonsynonymous substitutions (dN) and synonymous substitutions (dS) to evaluate the variation in selective pressure (dN/dS ratio) over branches and over sites. Since the original release of Selectome, we have benchmarked and implemented a thorough quality control procedure on multiple sequence alignments, aiming to provide minimum false-positive results. We have also improved the computational efficiency of the branch-site test implementation, allowing larger data sets and more frequent updates. Release 6 of Selectome includes all gene trees from Ensembl for Primates and Glires, as well as a large set of vertebrate gene trees. A total of 6810 gene trees have some evidence of positive selection. Finally, the web interface has been improved to be more responsive and to facilitate searches and browsing.

Subject(s)

Databases, Nucleic Acid , Selection, Genetic , Genetic Variation , Genomics/standards , Humans , Internet , Quality Control , Sequence Alignment

gcodeml: a Grid-enabled tool for detecting positive selection in biological evolution.

Moretti, Sébastien; Murri, Riccardo; Maffioletti, Sergio; Kuzniar, Arnold; Castella, Briséïs; Salamin, Nicolas; Robinson-Rechavi, Marc; Stockinger, Heinz.

Stud Health Technol Inform ; 175: 59-68, 2012.

Article in English | MEDLINE | ID: mdl-22941988

ABSTRACT

One of the important questions in biological evolution is to know if certain changes along protein coding genes have contributed to the adaptation of species. This problem is known to be biologically complex and computationally very expensive. It, therefore, requires efficient Grid or cluster solutions to overcome the computational challenge. We have developed a Grid-enabled tool (gcodeml) that relies on the PAML (codeml) package to help analyse large phylogenetic datasets on both Grids and computational clusters. Although we report on results for gcodeml, our approach is applicable and customisable to related problems in biology or other scientific domains.

Subject(s)

Algorithms , DNA/genetics , Data Mining/methods , Databases, Genetic , Evolution, Molecular , Proteins/genetics , Sequence Analysis/methods , Software , User-Computer Interface

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL