Your browser doesn't support javascript.
loading
Open-access synthetic spike-in mRNA-seq data for cancer gene fusions.
Tembe, Waibhav D; Pond, Stephanie J K; Legendre, Christophe; Chuang, Han-Yu; Liang, Winnie S; Kim, Nancy E; Montel, Valerie; Wong, Shukmei; McDaniel, Timothy K; Craig, David W; Carpten, John D.
Afiliación
  • Tembe WD; Translational Genomics Research Institute (TGen), 445 N 5th Street, SUITE 600, Phoenix, AZ 85004, USA. wtembe@tgen.org.
BMC Genomics ; 15: 824, 2014 Sep 30.
Article en En | MEDLINE | ID: mdl-25266161
BACKGROUND: Oncogenic fusion genes underlie the mechanism of several common cancers. Next-generation sequencing based RNA-seq analyses have revealed an increasing number of recurrent fusions in a variety of cancers. However, absence of a publicly available gene-fusion focused RNA-seq data impedes comparative assessment and collaborative development of novel gene fusions detection algorithms. We have generated nine synthetic poly-adenylated RNA transcripts that correspond to previously reported oncogenic gene fusions. These synthetic RNAs were spiked at known molarity over a wide range into total RNA prior to construction of next-generation sequencing mRNA libraries to generate RNA-seq data. RESULTS: Leveraging a priori knowledge about replicates and molarity of each synthetic fusion transcript, we demonstrate utility of this dataset to compare multiple gene fusion algorithms' detection ability. In general, more fusions are detected at higher molarity, indicating that our constructs performed as expected. However, systematic detection differences are observed based on molarity or algorithm-specific characteristics. Fusion-sequence specific detection differences indicate that for applications where specific sequences are being investigated, additional constructs may be added to provide quantitative data that is specific for the sequence of interest. CONCLUSIONS: To our knowledge, this is the first publicly available synthetic RNA-seq data that specifically leverages known cancer gene-fusions. The proposed method of designing multiple gene-fusion constructs over a wide range of molarity allows granular performance analyses of multiple fusion-detection algorithms. The community can leverage and augment this publicly available data to further collaborative development of analytical tools and performance assessment frameworks for gene fusions from next-generation sequencing data.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Secuencia de ARN / Fusión Génica / Genes Relacionados con las Neoplasias / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2014 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Análisis de Secuencia de ARN / Fusión Génica / Genes Relacionados con las Neoplasias / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Humans Idioma: En Revista: BMC Genomics Asunto de la revista: GENETICA Año: 2014 Tipo del documento: Article País de afiliación: Estados Unidos