Búsqueda | Portal de Búsqueda de la BVS Colombia

Improving deconvolution methods in biology through open innovation competitions: an application to the connectivity map.

Blasco, Andrea; Natoli, Ted; Endres, Michael G; Sergeev, Rinat A; Randazzo, Steven; Paik, Jin H; Macaluso, N J Maximilian; Narayan, Rajiv; Lu, Xiaodong; Peck, David; Lakhani, Karim R; Subramanian, Aravind.

Bioinformatics ; 37(18): 2889-2895, 2021 09 29.

Artículo en Inglés | MEDLINE | ID: mdl-33824954

RESUMEN

MOTIVATION: Do machine learning methods improve standard deconvolution techniques for gene expression data? This article uses a unique new dataset combined with an open innovation competition to evaluate a wide range of approaches developed by 294 competitors from 20 countries. The competition's objective was to address a deconvolution problem critical to analyzing genetic perturbations from the Connectivity Map. The issue consists of separating gene expression of individual genes from raw measurements obtained from gene pairs. We evaluated the outcomes using ground-truth data (direct measurements for single genes) obtained from the same samples. RESULTS: We find that the top-ranked algorithm, based on random forest regression, beat the other methods in accuracy and reproducibility; more traditional gaussian-mixture methods performed well and tended to be faster, and the best deep learning approach yielded outcomes slightly inferior to the above methods. We anticipate researchers in the field will find the dataset and algorithms developed in this study to be a powerful research tool for benchmarking their deconvolution methods and a resource useful for multiple applications. AVAILABILITY AND IMPLEMENTATION: The data is freely available at clue.io/data (section Contests) and the software is on GitHub at https://github.com/cmap/gene_deconvolution_challenge. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Algoritmos , Programas Informáticos , Reproducibilidad de los Resultados , Bosques Aleatorios , Biología

Use of Crowd Innovation to Develop an Artificial Intelligence-Based Solution for Radiation Therapy Targeting.

Mak, Raymond H; Endres, Michael G; Paik, Jin H; Sergeev, Rinat A; Aerts, Hugo; Williams, Christopher L; Lakhani, Karim R; Guinan, Eva C.

JAMA Oncol ; 5(5): 654-661, 2019 May 01.

Artículo en Inglés | MEDLINE | ID: mdl-30998808

RESUMEN

IMPORTANCE: Radiation therapy (RT) is a critical cancer treatment, but the existing radiation oncologist work force does not meet growing global demand. One key physician task in RT planning involves tumor segmentation for targeting, which requires substantial training and is subject to significant interobserver variation. OBJECTIVE: To determine whether crowd innovation could be used to rapidly produce artificial intelligence (AI) solutions that replicate the accuracy of an expert radiation oncologist in segmenting lung tumors for RT targeting. DESIGN, SETTING, AND PARTICIPANTS: We conducted a 10-week, prize-based, online, 3-phase challenge (prizes totaled $55â¯000). A well-curated data set, including computed tomographic (CT) scans and lung tumor segmentations generated by an expert for clinical care, was used for the contest (CT scans from 461 patients; median 157 images per scan; 77â¯942 images in total; 8144 images with tumor present). Contestants were provided a training set of 229 CT scans with accompanying expert contours to develop their algorithms and given feedback on their performance throughout the contest, including from the expert clinician. MAIN OUTCOMES AND MEASURES: The AI algorithms generated by contestants were automatically scored on an independent data set that was withheld from contestants, and performance ranked using quantitative metrics that evaluated overlap of each algorithm's automated segmentations with the expert's segmentations. Performance was further benchmarked against human expert interobserver and intraobserver variation. RESULTS: A total of 564 contestants from 62 countries registered for this challenge, and 34 (6%) submitted algorithms. The automated segmentations produced by the top 5 AI algorithms, when combined using an ensemble model, had an accuracy (Dice coefficient = 0.79) that was within the benchmark of mean interobserver variation measured between 6 human experts. For phase 1, the top 7 algorithms had average custom segmentation scores (S scores) on the holdout data set ranging from 0.15 to 0.38, and suboptimal performance using relative measures of error. The average S scores for phase 2 increased to 0.53 to 0.57, with a similar improvement in other performance metrics. In phase 3, performance of the top algorithm increased by an additional 9%. Combining the top 5 algorithms from phase 2 and phase 3 using an ensemble model, yielded an additional 9% to 12% improvement in performance with a final S score reaching 0.68. CONCLUSIONS AND RELEVANCE: A combined crowd innovation and AI approach rapidly produced automated algorithms that replicated the skills of a highly trained physician for a critical task in radiation therapy. These AI algorithms could improve cancer care globally by transferring the skills of expert clinicians to under-resourced health care settings.

Asunto(s)

Inteligencia Artificial , Colaboración de las Masas , Invenciones , Neoplasias Pulmonares/diagnóstico por imagen , Neoplasias Pulmonares/radioterapia , Tomografía Computarizada por Rayos X , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Neoplasias Pulmonares/patología , Masculino , Persona de Mediana Edad , Carga Tumoral

Advancing computational biology and bioinformatics research through open innovation competitions.

Blasco, Andrea; Endres, Michael G; Sergeev, Rinat A; Jonchhe, Anup; Macaluso, N J Maximilian; Narayan, Rajiv; Natoli, Ted; Paik, Jin H; Briney, Bryan; Wu, Chunlei; Su, Andrew I; Subramanian, Aravind; Lakhani, Karim R.

PLoS One ; 14(9): e0222165, 2019.

Artículo en Inglés | MEDLINE | ID: mdl-31560691

RESUMEN

Open data science and algorithm development competitions offer a unique avenue for rapid discovery of better computational strategies. We highlight three examples in computational biology and bioinformatics research in which the use of competitions has yielded significant performance gains over established algorithms. These include algorithms for antibody clustering, imputing gene expression data, and querying the Connectivity Map (CMap). Performance gains are evaluated quantitatively using realistic, albeit sanitized, data sets. The solutions produced through these competitions are then examined with respect to their utility and the prospects for implementation in the field. We present the decision process and competition design considerations that lead to these successful outcomes as a model for researchers who want to use competitions and non-domain crowds as collaborators to further their research.

Asunto(s)

Biología Computacional/tendencias , Algoritmos , Anticuerpos/clasificación , Anticuerpos/genética , Análisis por Conglomerados , Colaboración de las Masas/tendencias , Perfilación de la Expresión Génica/estadística & datos numéricos , Humanos , Invenciones/tendencias

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA