RESUMO
Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.This article is highlighted in the In This Issue feature, p. 995.
Assuntos
Anemia Falciforme/genética , Computação em Nuvem , Genômica , Disseminação de Informação , Neoplasias/genética , Criança , Ecossistema , Hospitais Pediátricos , HumanosRESUMO
To evaluate the potential of an integrated clinical test to detect diverse classes of somatic and germline mutations relevant to pediatric oncology, we performed three-platform whole-genome (WGS), whole exome (WES) and transcriptome (RNA-Seq) sequencing of tumors and normal tissue from 78 pediatric cancer patients in a CLIA-certified, CAP-accredited laboratory. Our analysis pipeline achieves high accuracy by cross-validating variants between sequencing types, thereby removing the need for confirmatory testing, and facilitates comprehensive reporting in a clinically-relevant timeframe. Three-platform sequencing has a positive predictive value of 97-99, 99, and 91% for somatic SNVs, indels and structural variations, respectively, based on independent experimental verification of 15,225 variants. We report 240 pathogenic variants across all cases, including 84 of 86 known from previous diagnostic testing (98% sensitivity). Combined WES and RNA-Seq, the current standard for precision oncology, achieved only 78% sensitivity. These results emphasize the critical need for incorporating WGS in pediatric oncology testing.