Búsqueda | Portal Regional de la BVS

Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.

Zhao, Yongmei; Fang, Li Tai; Shen, Tsai-Wei; Choudhari, Sulbha; Talsania, Keyur; Chen, Xiongfong; Shetty, Jyoti; Kriga, Yuliya; Tran, Bao; Zhu, Bin; Chen, Zhong; Chen, Wanqiu; Wang, Charles; Jaeger, Erich; Meerzaman, Daoud; Lu, Charles; Idler, Kenneth; Ren, Luyao; Zheng, Yuanting; Shi, Leming; Petitjean, Virginie; Sultan, Marc; Hung, Tiffany; Peters, Eric; Drabek, Jiri; Vojta, Petr; Maestro, Roberta; Gasparotto, Daniela; Kõks, Sulev; Reimann, Ene; Scherer, Andreas; Nordlund, Jessica; Liljedahl, Ulrika; Foox, Jonathan; Mason, Christopher E; Xiao, Chunlin; Hong, Huixiao; Xiao, Wenming.

Sci Data ; 8(1): 296, 2021 11 09.

Artículo en Inglés | MEDLINE | ID: mdl-34753956

RESUMEN

With the rapid advancement of sequencing technologies, next generation sequencing (NGS) analysis has been widely applied in cancer genomics research. More recently, NGS has been adopted in clinical oncology to advance personalized medicine. Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from artifacts introduced during NGS processes or data analysis. Therefore, there is an urgent need to develop best practices in cancer mutation detection using NGS and the need for standard reference data sets for systematically measuring accuracy and reproducibility across platforms and methods. Within the SEQC2 consortium context, we established paired tumor-normal reference samples and generated whole-genome (WGS) and whole-exome sequencing (WES) data using sixteen library protocols, seven sequencing platforms at six different centers. We systematically interrogated somatic mutations in the reference samples to identify factors affecting detection reproducibility and accuracy in cancer genomes. These large cross-platform/site WGS and WES datasets using well-characterized reference samples will represent a powerful resource for benchmarking NGS technologies, bioinformatics pipelines, and for the cancer genomics studies.

Asunto(s)

Secuenciación del Exoma , Genoma Humano , Neoplasias/genética , Secuenciación Completa del Genoma , Benchmarking , Línea Celular Tumoral , Biología Computacional , Genómica , Humanos , Medicina de Precisión

Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing.

Xiao, Wenming; Ren, Luyao; Chen, Zhong; Fang, Li Tai; Zhao, Yongmei; Lack, Justin; Guan, Meijian; Zhu, Bin; Jaeger, Erich; Kerrigan, Liz; Blomquist, Thomas M; Hung, Tiffany; Sultan, Marc; Idler, Kenneth; Lu, Charles; Scherer, Andreas; Kusko, Rebecca; Moos, Malcolm; Xiao, Chunlin; Sherry, Stephen T; Abaan, Ogan D; Chen, Wanqiu; Chen, Xin; Nordlund, Jessica; Liljedahl, Ulrika; Maestro, Roberta; Polano, Maurizio; Drabek, Jiri; Vojta, Petr; Kõks, Sulev; Reimann, Ene; Madala, Bindu Swapna; Mercer, Timothy; Miller, Chris; Jacob, Howard; Truong, Tiffany; Moshrefi, Ali; Natarajan, Aparna; Granat, Ana; Schroth, Gary P; Kalamegham, Rasika; Peters, Eric; Petitjean, Virginie; Walton, Ashley; Shen, Tsai-Wei; Talsania, Keyur; Vera, Cristobal Juan; Langenbach, Kurt; de Mars, Maryellen; Hipp, Jennifer A.

Nat Biotechnol ; 39(9): 1141-1150, 2021 09.

Artículo en Inglés | MEDLINE | ID: mdl-34504346

RESUMEN

Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.

Asunto(s)

Benchmarking , Secuenciación del Exoma/normas , Neoplasias/genética , Análisis de Secuencia de ADN/normas , Secuenciación Completa del Genoma/normas , Línea Celular , Línea Celular Tumoral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación , Neoplasias/patología , Reproducibilidad de los Resultados

Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing.

Fang, Li Tai; Zhu, Bin; Zhao, Yongmei; Chen, Wanqiu; Yang, Zhaowei; Kerrigan, Liz; Langenbach, Kurt; de Mars, Maryellen; Lu, Charles; Idler, Kenneth; Jacob, Howard; Zheng, Yuanting; Ren, Luyao; Yu, Ying; Jaeger, Erich; Schroth, Gary P; Abaan, Ogan D; Talsania, Keyur; Lack, Justin; Shen, Tsai-Wei; Chen, Zhong; Stanbouly, Seta; Tran, Bao; Shetty, Jyoti; Kriga, Yuliya; Meerzaman, Daoud; Nguyen, Cu; Petitjean, Virginie; Sultan, Marc; Cam, Margaret; Mehta, Monika; Hung, Tiffany; Peters, Eric; Kalamegham, Rasika; Sahraeian, Sayed Mohammad Ebrahim; Mohiyuddin, Marghoob; Guo, Yunfei; Yao, Lijing; Song, Lei; Lam, Hugo Y K; Drabek, Jiri; Vojta, Petr; Maestro, Roberta; Gasparotto, Daniela; Kõks, Sulev; Reimann, Ene; Scherer, Andreas; Nordlund, Jessica; Liljedahl, Ulrika; Jensen, Roderick V.

Nat Biotechnol ; 39(9): 1151-1160, 2021 09.

Artículo en Inglés | MEDLINE | ID: mdl-34504347

RESUMEN

The lack of samples for generating standardized DNA datasets for setting up a sequencing pipeline or benchmarking the performance of different algorithms limits the implementation and uptake of cancer genomics. Here, we describe reference call sets obtained from paired tumor-normal genomic DNA (gDNA) samples derived from a breast cancer cell line-which is highly heterogeneous, with an aneuploid genome, and enriched in somatic alterations-and a matched lymphoblastoid cell line. We partially validated both somatic mutations and germline variants in these call sets via whole-exome sequencing (WES) with different sequencing platforms and targeted sequencing with >2,000-fold coverage, spanning 82% of genomic regions with high confidence. Although the gDNA reference samples are not representative of primary cancer cells from a clinical sample, when setting up a sequencing pipeline, they not only minimize potential biases from technologies, assays and informatics but also provide a unique resource for benchmarking 'tumor-only' or 'matched tumor-normal' analyses.

Asunto(s)

Benchmarking , Neoplasias de la Mama/genética , Análisis Mutacional de ADN/normas , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Secuenciación Completa del Genoma/normas , Línea Celular Tumoral , Conjuntos de Datos como Asunto , Células Germinativas , Humanos , Mutación , Estándares de Referencia , Reproducibilidad de los Resultados

A comprehensive assessment of RNA-seq protocols for degraded and low-quantity samples.

Schuierer, Sven; Carbone, Walter; Knehr, Judith; Petitjean, Virginie; Fernandez, Anita; Sultan, Marc; Roma, Guglielmo.

BMC Genomics ; 18(1): 442, 2017 06 05.

Artículo en Inglés | MEDLINE | ID: mdl-28583074

RESUMEN

BACKGROUND: RNA-sequencing (RNA-seq) has emerged as one of the most sensitive tool for gene expression analysis. Among the library preparation methods available, the standard poly(A) + enrichment provides a comprehensive, detailed, and accurate view of polyadenylated RNAs. However, on samples of suboptimal quality ribosomal RNA depletion and exon capture methods have recently been reported as better alternatives. METHODS: We compared for the first time three commercial Illumina library preparation kits (TruSeq Stranded mRNA, TruSeq Ribo-Zero rRNA Removal, and TruSeq RNA Access) as representatives of these three different approaches using well-established human reference RNA samples from the MAQC/SEQC consortium on a wide range of input amounts (from 100 ng down to 1 ng) and degradation levels (intact, degraded, and highly degraded). RESULTS: We assessed the accuracy of the generated expression values by comparison to gold standard TaqMan qPCR measurements and gained unprecedented insight into the limits of applicability in terms of input quantity and sample quality of each protocol. We found that each protocol generates highly reproducible results (R 2 > 0.92) on intact RNA samples down to input amounts of 10 ng. For degraded RNA samples, Ribo-Zero showed clear performance advantages over the other two protocols as it generated more accurate and better reproducible gene expression results even at very low input amounts such as 1 ng and 2 ng. For highly degraded RNA samples, RNA Access performed best generating reliable data down to 5 ng input. CONCLUSIONS: We found that the ribosomal RNA depletion protocol from Illumina works very well at amounts far below recommendation and over a good range of intact and degraded material. We also infer that the exome-capture protocol (RNA Access, Illumina) performs better than other methods on highly degraded and low amount samples.

Asunto(s)

Análisis de Secuencia de ARN/métodos , Humanos , Control de Calidad , Estabilidad del ARN , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , Alineación de Secuencia , Polimerasa Taq/metabolismo

High-resolution chemical dissection of a model eukaryote reveals targets, pathways and gene functions.

Hoepfner, Dominic; Helliwell, Stephen B; Sadlish, Heather; Schuierer, Sven; Filipuzzi, Ireos; Brachat, Sophie; Bhullar, Bhupinder; Plikat, Uwe; Abraham, Yann; Altorfer, Marc; Aust, Thomas; Baeriswyl, Lukas; Cerino, Raffaele; Chang, Lena; Estoppey, David; Eichenberger, Juerg; Frederiksen, Mathias; Hartmann, Nicole; Hohendahl, Annika; Knapp, Britta; Krastel, Philipp; Melin, Nicolas; Nigsch, Florian; Oakeley, Edward J; Petitjean, Virginie; Petersen, Frank; Riedl, Ralph; Schmitt, Esther K; Staedtler, Frank; Studer, Christian; Tallarico, John A; Wetzel, Stefan; Fishman, Mark C; Porter, Jeffrey A; Movva, N Rao.

Microbiol Res ; 169(2-3): 107-20, 2014.

Artículo en Inglés | MEDLINE | ID: mdl-24360837

RESUMEN

Due to evolutionary conservation of biology, experimental knowledge captured from genetic studies in eukaryotic model organisms provides insight into human cellular pathways and ultimately physiology. Yeast chemogenomic profiling is a powerful approach for annotating cellular responses to small molecules. Using an optimized platform, we provide the relative sensitivities of the heterozygous and homozygous deletion collections for nearly 1800 biologically active compounds. The data quality enables unique insights into pathways that are sensitive and resistant to a given perturbation, as demonstrated with both known and novel compounds. We present examples of novel compounds that inhibit the therapeutically relevant fatty acid synthase and desaturase (Fas1p and Ole1p), and demonstrate how the individual profiles facilitate hypothesis-driven experiments to delineate compound mechanism of action. Importantly, the scale and diversity of tested compounds yields a dataset where the number of modulated pathways approaches saturation. This resource can be used to map novel biological connections, and also identify functions for unannotated genes. We validated hypotheses generated by global two-way hierarchical clustering of profiles for (i) novel compounds with a similar mechanism of action acting upon microtubules or vacuolar ATPases, and (ii) an un-annotated ORF, YIL060w, that plays a role in respiration in the mitochondria. Finally, we identify and characterize background mutations in the widely used yeast deletion collection which should improve the interpretation of past and future screens throughout the community. This comprehensive resource of cellular responses enables the expansion of our understanding of eukaryotic pathway biology.

Asunto(s)

Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/genética , Antifúngicos/farmacología , Vías Biosintéticas , Farmacorresistencia Fúngica , Regulación Fúngica de la Expresión Génica , Ensayos Analíticos de Alto Rendimiento , Datos de Secuencia Molecular , Filogenia , Saccharomyces cerevisiae/clasificación , Saccharomyces cerevisiae/efectos de los fármacos , Proteínas de Saccharomyces cerevisiae/metabolismo

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA