Pesquisa | BVS Violência e Saúde

VarSAn: associating pathways with a set of genomic variants using network analysis.

Xie, Xiaoman; Kendzior, Matthew C; Ge, Xiyu; Mainzer, Liudmila S; Sinha, Saurabh.

Nucleic Acids Res ; 49(15): 8471-8487, 2021 09 07.

Artigo em Inglês | MEDLINE | ID: mdl-34313777

RESUMO

There is a pressing need today to mechanistically interpret sets of genomic variants associated with diseases. Here we present a tool called 'VarSAn' that uses a network analysis algorithm to identify pathways relevant to a given set of variants. VarSAn analyzes a configurable network whose nodes represent variants, genes and pathways, using a Random Walk with Restarts algorithm to rank pathways for relevance to the given variants, and reports P-values for pathway relevance. It treats non-coding and coding variants differently, properly accounts for the number of pathways impacted by each variant and identifies relevant pathways even if many variants do not directly impact genes of the pathway. We use VarSAn to identify pathways relevant to variants related to cancer and several other diseases, as well as drug response variation. We find VarSAn's pathway ranking to be complementary to the standard approach of enrichment tests on genes related to the query set. We adopt a novel benchmarking strategy to quantify its advantage over this baseline approach. Finally, we use VarSAn to discover key pathways, including the VEGFA-VEGFR2 pathway, related to de novo variants in patients of Hypoplastic Left Heart Syndrome, a rare and severe congenital heart defect.

Assuntos

Genômica/métodos , Polimorfismo de Nucleotídeo Único , Software , Algoritmos , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Interpretação Estatística de Dados , Feminino , Genes , Humanos , Síndrome do Coração Esquerdo Hipoplásico/genética , Síndrome do Coração Esquerdo Hipoplásico/metabolismo , Masculino , Neoplasias da Próstata/genética , Neoplasias da Próstata/metabolismo , Transdução de Sinais/genética

Variant Detection in 3' Exons of PMS2 Using Exome Sequencing Data.

Mistry, Nipun A; Roellinger, Samantha E; Manninen, Matthew C; Gandham, Mallika; Koganti, Tejaswi; Balan, Jagadheshwar; Basu, Shubham; Blake, Emily J; Tandale, Pratyush P; Holdren, Megan A; Hoenig, Megan F; Urban, Rhianna M; Veith, Rebecca L; Kendzior, Matthew C; Wang, Chen; Gupta, Sounak; Shen, Wei.

J Mol Diagn ; 2024 Jun 24.

Artigo em Inglês | MEDLINE | ID: mdl-38925456

RESUMO

PMS2 is one of the DNA-mismatch repair genes included in routine genetic testing for Lynch syndrome and colorectal, ovarian, and endometrial cancers. PMS2 is also included in the American College of Medical Genetics and Genomics' List of Secondary Findings Genes in the context of clinical exome and genome sequencing. However, sequencing of PMS2 by short-read-based next-generation sequencing technologies is complicated by the presence of the pseudogene PMS2CL, and is often supplemented by long-range-based approaches, such as long-range PCR or long-read-based next-generation sequencing, which increases the complexity and cost. This article describes a bioinformatics homology triage workflow that can eliminate the need for long-read-based testing for PMS2 in the vast majority of patients undergoing exome sequencing, thus simplifying PMS2 testing and reducing the associated cost.

Managing genomic variant calling workflows with Swift/T.

Ahmed, Azza E; Heldenbrand, Jacob; Asmann, Yan; Fadlelmola, Faisal M; Katz, Daniel S; Kendig, Katherine; Kendzior, Matthew C; Li, Tiffany; Ren, Yingxue; Rodriguez, Elliott; Weber, Matthew R; Wozniak, Justin M; Zermeno, Jennie; Mainzer, Liudmila S.

PLoS One ; 14(7): e0211608, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31287816

RESUMO

Bioinformatics research is frequently performed using complex workflows with multiple steps, fans, merges, and conditionals. This complexity makes management of the workflow difficult on a computer cluster, especially when running in parallel on large batches of data: hundreds or thousands of samples at a time. Scientific workflow management systems could help with that. Many are now being proposed, but is there yet the "best" workflow management system for bioinformatics? Such a system would need to satisfy numerous, sometimes conflicting requirements: from ease of use, to seamless deployment at peta- and exa-scale, and portability to the cloud. We evaluated Swift/T as a candidate for such role by implementing a primary genomic variant calling workflow in the Swift/T language, focusing on workflow management, performance and scalability issues that arise from production-grade big data genomic analyses. In the process we introduced novel features into the language, which are now part of its open repository. Additionally, we formalized a set of design criteria for quality, robust, maintainable workflows that must function at-scale in a production setting, such as a large genomic sequencing facility or a major hospital system. The use of Swift/T conveys two key advantages. (1) It operates transparently in multiple cluster scheduling environments (PBS Torque, SLURM, Cray aprun environment, etc.), thus a single workflow is trivially portable across numerous clusters. (2) The leaf functions of Swift/T permit developers to easily swap executables in and out of the workflow, which makes it easy to maintain and to request resources optimal for each stage of the pipeline. While Swift/T's data-level parallelism eliminates the need to code parallel analysis of multiple samples, it does make debugging more difficult, as is common for implicitly parallel code. Nonetheless, the language gives users a powerful and portable way to scale up analyses in many computing architectures. The code for our implementation of a variant calling workflow using Swift/T can be found on GitHub at https://github.com/ncsa/Swift-T-Variant-Calling, with full documentation provided at http://swift-t-variant-calling.readthedocs.io/en/latest/.

Assuntos

Biologia Computacional , Genômica , Software , Animais , Humanos , Fluxo de Trabalho

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA