Your browser doesn't support javascript.
loading
An expanded evaluation of protein function prediction methods shows an improvement in accuracy.
Jiang, Yuxiang; Oron, Tal Ronnen; Clark, Wyatt T; Bankapur, Asma R; D'Andrea, Daniel; Lepore, Rosalba; Funk, Christopher S; Kahanda, Indika; Verspoor, Karin M; Ben-Hur, Asa; Koo, Da Chen Emily; Penfold-Brown, Duncan; Shasha, Dennis; Youngs, Noah; Bonneau, Richard; Lin, Alexandra; Sahraeian, Sayed M E; Martelli, Pier Luigi; Profiti, Giuseppe; Casadio, Rita; Cao, Renzhi; Zhong, Zhaolong; Cheng, Jianlin; Altenhoff, Adrian; Skunca, Nives; Dessimoz, Christophe; Dogan, Tunca; Hakala, Kai; Kaewphan, Suwisa; Mehryary, Farrokh; Salakoski, Tapio; Ginter, Filip; Fang, Hai; Smithers, Ben; Oates, Matt; Gough, Julian; Törönen, Petri; Koskinen, Patrik; Holm, Liisa; Chen, Ching-Tai; Hsu, Wen-Lian; Bryson, Kevin; Cozzetto, Domenico; Minneci, Federico; Jones, David T; Chapman, Samuel; Bkc, Dukka; Khan, Ishita K; Kihara, Daisuke; Ofer, Dan.
Affiliation
  • Jiang Y; Department of Computer Science and Informatics, Indiana University, Bloomington, IN, USA.
  • Oron TR; Buck Institute for Research on Aging, Novato, CA, USA.
  • Clark WT; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA.
  • Bankapur AR; Department of Microbiology, Miami University, Oxford, OH, USA.
  • D'Andrea D; University of Rome, La Sapienza, Rome, Italy.
  • Lepore R; University of Rome, La Sapienza, Rome, Italy.
  • Funk CS; Computational Bioscience Program, University of Colorado School of Medicine, Aurora, CO, USA.
  • Kahanda I; Department of Computer Science, Colorado State University, Fort Collins, CO, USA.
  • Verspoor KM; Department of Computing and Information Systems, University of Melbourne, Parkville, Victoria, Australia.
  • Ben-Hur A; Health and Biomedical Informatics Centre, University of Melbourne, Parkville, Victoria, Australia.
  • Koo da CE; Department of Computer Science, Colorado State University, Fort Collins, CO, USA.
  • Penfold-Brown D; Department of Biology, New York University, New York, NY, USA.
  • Shasha D; Social Media and Political Participation Lab, New York University, New York, NY, USA.
  • Youngs N; CY Data Science, New York, NY, USA.
  • Bonneau R; Department of Computer Science, New York University, New York, NY, USA.
  • Lin A; CY Data Science, New York, NY, USA.
  • Sahraeian SM; Department of Computer Science, New York University, New York, NY, USA.
  • Martelli PL; Simons Center for Data Analysis, New York, NY, USA.
  • Profiti G; Department of Computer Science, New York University, New York, NY, USA.
  • Casadio R; Simons Center for Data Analysis, New York, NY, USA.
  • Cao R; Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA.
  • Zhong Z; Department of Electrical Engineering and Computer Sciences, University of California Berkeley, Berkeley, CA, USA.
  • Cheng J; Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA.
  • Altenhoff A; Biocomputing Group, BiGeA, University of Bologna, Bologna, Italy.
  • Skunca N; Biocomputing Group, BiGeA, University of Bologna, Bologna, Italy.
  • Dessimoz C; Biocomputing Group, BiGeA, University of Bologna, Bologna, Italy.
  • Dogan T; Computer Science Department, University of Missouri, Columbia, MO, USA.
  • Hakala K; Computer Science Department, University of Missouri, Columbia, MO, USA.
  • Kaewphan S; Computer Science Department, University of Missouri, Columbia, MO, USA.
  • Mehryary F; ETH Zurich, Zurich, Switzerland.
  • Salakoski T; Swiss Institute of Bioinformatics, Zurich, Switzerland.
  • Ginter F; ETH Zurich, Zurich, Switzerland.
  • Fang H; Swiss Institute of Bioinformatics, Zurich, Switzerland.
  • Smithers B; Bioinformatics Group, Department of Computer Science, University College London, London, UK.
  • Oates M; University of Lausanne, Lausanne, Switzerland.
  • Gough J; Swiss Institute of Bioinformatics, Lausanne, Switzerland.
  • Törönen P; European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK.
  • Koskinen P; Department of Information Technology, University of Turku, Turku, Finland.
  • Holm L; University of Turku Graduate School, University of Turku, Turku, Finland.
  • Chen CT; Department of Information Technology, University of Turku, Turku, Finland.
  • Hsu WL; University of Turku Graduate School, University of Turku, Turku, Finland.
  • Bryson K; Turku Centre for Computer Science, Turku, Finland.
  • Cozzetto D; Department of Information Technology, University of Turku, Turku, Finland.
  • Minneci F; University of Turku Graduate School, University of Turku, Turku, Finland.
  • Jones DT; Department of Information Technology, University of Turku, Turku, Finland.
  • Chapman S; Turku Centre for Computer Science, Turku, Finland.
  • Bkc D; Department of Information Technology, University of Turku, Turku, Finland.
  • Khan IK; University of Bristol, Bristol, UK.
  • Kihara D; University of Bristol, Bristol, UK.
  • Ofer D; University of Bristol, Bristol, UK.
Genome Biol ; 17(1): 184, 2016 09 07.
Article de En | MEDLINE | ID: mdl-27604469
BACKGROUND: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. RESULTS: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. CONCLUSIONS: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.
Sujet(s)
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Relation structure-activité / Logiciel / Protéines / Biologie informatique Type d'étude: Prognostic_studies / Risk_factors_studies Limites: Humans Langue: En Journal: Genome Biol Sujet du journal: BIOLOGIA MOLECULAR / GENETICA Année: 2016 Type de document: Article Pays d'affiliation: États-Unis d'Amérique Pays de publication: Royaume-Uni

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Relation structure-activité / Logiciel / Protéines / Biologie informatique Type d'étude: Prognostic_studies / Risk_factors_studies Limites: Humans Langue: En Journal: Genome Biol Sujet du journal: BIOLOGIA MOLECULAR / GENETICA Année: 2016 Type de document: Article Pays d'affiliation: États-Unis d'Amérique Pays de publication: Royaume-Uni