Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
1.
Cancer Discov ; 11(5): 1082-1099, 2021 05.
Article in English | MEDLINE | ID: mdl-33408242

ABSTRACT

Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.This article is highlighted in the In This Issue feature, p. 995.


Subject(s)
Anemia, Sickle Cell/genetics , Cloud Computing , Genomics , Information Dissemination , Neoplasms/genetics , Child , Ecosystem , Hospitals, Pediatric , Humans
2.
Bioinformatics ; 22(7): 837-42, 2006 Apr 01.
Article in English | MEDLINE | ID: mdl-16428263

ABSTRACT

MOTIVATION: Given a large set of potential features, such as the set of all gene-expression values from a microarray, it is necessary to find a small subset with which to classify. The task of finding an optimal feature set of a given size is inherently combinatoric because to assure optimality all feature sets of a given size must be checked. Thus, numerous suboptimal feature-selection algorithms have been proposed. There are strong impediments to evaluate feature-selection algorithms using real data when data are limited, a common situation in genetic classification. The difficulty is compound. First, there are no class-conditional distributions from which to draw data points, only a single small labeled sample. Second, there are no test data with which to estimate the feature-set errors, and one must depend on a training-data-based error estimator. Finally, there is no optimal feature set with which to compare the feature sets found by the algorithms. RESULTS: This paper describes a genetic test bed for the evaluation of feature-selection algorithms. It begins with a large biological feature-label dataset that is used as an empirical distribution and, using massively parallel computation, finds the top feature sets of various sizes based on a given sample size and classification rule. The user can draw random samples from the data, apply a proposed algorithm, and evaluate the proficiency of the proposed algorithm via three different measures (code provided). A key feature of the test bed is that, once a dataset is input, a single command creates the entire test bed relative to the dataset. The particular dataset used for the first version of the test bed comes from a microarray-based classification study that analyzes a large number of microarrays, prepared with RNA from breast tumor samples from each of 295 patients. AVAILABILITY: The software and supplementary material are available at http://public.tgen.org/tgen-cb/support/testbed/ CONTACT: edward@ece.tamu.edu.


Subject(s)
Algorithms , Computer Simulation , Gene Expression Profiling/methods , Breast Neoplasms , Data Collection , Databases, Genetic , Female , Humans , Models, Statistical , Oligonucleotide Array Sequence Analysis/methods
3.
Genome Res ; 13(10): 2341-7, 2003 Oct.
Article in English | MEDLINE | ID: mdl-14525932

ABSTRACT

RNA interference (RNAi) mediated by small interfering RNAs (siRNAs) is a powerful new tool for analyzing gene knockdown phenotypes in living mammalian cells. To facilitate large-scale, high-throughput functional genomics studies using RNAi, we have developed a microarray-based technology for highly parallel analysis. Specifically, siRNAs in a transfection matrix were first arrayed on glass slides, overlaid with a monolayer of adherent cells, incubated to allow reverse transfection, and assessed for the effects of gene silencing by digital image analysis at a single cell level. Validation experiments with HeLa cells stably expressing GFP showed spatially confined, sequence-specific, time- and dose-dependent inhibition of green fluorescence for those cells growing directly on microspots containing siRNA targeting the GFP sequence. Microarray-based siRNA transfections analyzed with a custom-made quantitative image analysis system produced results that were identical to those from traditional well-based transfection, quantified by flow cytometry. Finally, to integrate experimental details, image analysis, data display, and data archiving, we developed a prototype information management system for high-throughput cell-based analyses. In summary, this RNAi microarray platform, together with ongoing efforts to develop large-scale human siRNA libraries, should facilitate genomic-scale cell-based analyses of gene function.


Subject(s)
Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , RNA, Small Interfering/genetics , Cell Line, Tumor , Gene Expression Regulation, Neoplastic/genetics , Green Fluorescent Proteins , HeLa Cells , Humans , Luminescent Proteins/biosynthesis , Luminescent Proteins/genetics , RNA Interference , RNA Probes/genetics , Transfection/methods
SELECTION OF CITATIONS
SEARCH DETAIL