Pesquisa | Biblioteca Virtual em Saúde

Investigation of somatic single nucleotide variations in human endogenous retrovirus elements and their potential association with cancer.

Chang, Ting-Chia; Goud, Santosh; Torcivia-Rodriguez, John; Hu, Yu; Pan, Qing; Kahsay, Robel; Blomberg, Jonas; Mazumder, Raja.

PLoS One ; 14(4): e0213770, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-30934003

RESUMO

Human endogenous retroviruses (HERVs) have been investigated for potential links with human cancer. However, the distribution of somatic nucleotide variations in HERV elements has not been explored in detail. This study aims to identify HERV elements with an over-representation of somatic mutations (hot spots) in cancer patients. Four HERV elements with mutation hotspots were identified that overlap with exons of four human protein coding genes. These hotspots were identified based on the significant over-representation (p<8.62e-4) of non-synonymous single-nucleotide variations (nsSNVs). These genes are TNN (HERV-9/LTR12), OR4K15 (HERV-IP10F/LTR10F), ZNF99 (HERV-W/HERV17/LTR17), and KIR2DL1 (MST/MaLR). In an effort to identify mutations that effect survival, all nsSNVs were further evaluated and it was found that kidney cancer patients with mutation C2270G in ZNF99 have a significantly lower survival rate (hazard ratio = 2.6) compared to those without it. Among HERV elements in the human non-protein coding regions, we found 788 HERVs with significantly elevated numbers of somatic single-nucleotide variations (SNVs) (p<1.60e-5). From this category the top three HERV elements with significantly over-represented SNVs are HERV-H/LTR7, HERV-9/LTR12 and HERV-L/MLT2. Majority of the SNVs in these 788 HERV elements are located in three DNA functional groups: long non-coding RNAs (lncRNAs) (60%), introns (22.2%) and transcriptional factor binding sites (TFBS) (14.8%). This study provides a list of mutational hotspots in HERVs, which could potentially be used as biomarkers and therapeutic targets.

Assuntos

Retrovirus Endógenos/genética , Genoma Humano/genética , Neoplasias Renais/genética , Polimorfismo de Nucleotídeo Único/genética , Éxons/genética , Regulação Neoplásica da Expressão Gênica , Humanos , Íntrons/genética , Neoplasias Renais/patologia , Mutação , RNA Longo não Codificante/genética , Receptores KIR2DL1/genética , Análise de Sobrevida , Tenascina/genética , Sequências Repetidas Terminais/genética

A Primer for Access to Repositories of Cancer-Related Genomic Big Data.

Torcivia-Rodriguez, John; Dingerdissen, Hayley; Chang, Ting-Chia; Mazumder, Raja.

Methods Mol Biol ; 1878: 1-37, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-30378067

RESUMO

The use of large datasets has become ubiquitous in biomedical sciences. Researchers in the field of cancer genomics have, in recent years, generated large volumes of data from their experiments. Those responsible for production of this data often analyze a narrow subset of this data based on the research question they are trying to address: this is the case whether or not they are acting independently or in conjunction with a large-scale cancer genomics project. The reality of this situation creates the opportunity for other researchers to repurpose this data for different hypotheses if the data is made easily and freely available. New insights in biology resulting from more researchers having access to data they otherwise would be unable to generate on their own are a boon for the field. The following chapter reviews several cancer genomics-related databases and outlines the type of data they contain, as well as the methods required to access each database. While this list is not comprehensive, it should provide a basis for cancer researchers to begin exploring some of the many large datasets that are available to them.

Assuntos

Neoplasias/genética , Bases de Dados Genéticas , Genômica/métodos , Humanos , Pesquisa

A complete Leishmania donovani reference genome identifies novel genetic variations associated with virulence.

Lypaczewski, Patrick; Hoshizaki, Johanna; Zhang, Wen-Wei; McCall, Laura-Isobel; Torcivia-Rodriguez, John; Simonyan, Vahan; Kaur, Amanpreet; Dewar, Ken; Matlashewski, Greg.

Sci Rep ; 8(1): 16549, 2018 11 08.

Artigo em Inglês | MEDLINE | ID: mdl-30409989

RESUMO

Leishmania donovani is responsible for visceral leishmaniasis, a neglected and lethal parasitic disease with limited treatment options and no vaccine. The study of L. donovani has been hindered by the lack of a high-quality reference genome and this can impact experimental outcomes including the identification of virulence genes, drug targets and vaccine development. We therefore generated a complete genome assembly by deep sequencing using a combination of second generation (Illumina) and third generation (PacBio) sequencing technologies. Compared to the current L. donovani assembly, the genome assembly reported within resulted in the closure over 2,000 gaps, the extension of several chromosomes up to telomeric repeats and the re-annotation of close to 15% of protein coding genes and the annotation of hundreds of non-coding RNA genes. It was possible to correctly assemble the highly repetitive A2 and Amastin virulence gene clusters. A comparative sequence analysis using the improved reference genome confirmed 70 published and identified 15 novel genomic differences between closely related visceral and atypical cutaneous disease-causing L. donovani strains providing a more complete map of genes associated with virulence and visceral organ tropism. Bioinformatic tools including protein variation effect analyzer and basic local alignment search tool were used to prioritize a list of potential virulence genes based on mutation severity, gene conservation and function. This complete genome assembly and novel information on virulence factors will support the identification of new drug targets and the development of a vaccine for L. donovani.

Assuntos

Leishmania donovani/patogenicidade , Fatores de Virulência/genética , Sequenciamento Completo do Genoma/métodos , Animais , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Leishmania donovani/genética , Leishmaniose Visceral/parasitologia , Anotação de Sequência Molecular , Sri Lanka , Tropismo

BioMuta and BioXpress: mutation and expression knowledgebases for cancer biomarker discovery.

Dingerdissen, Hayley M; Torcivia-Rodriguez, John; Hu, Yu; Chang, Ting-Chia; Mazumder, Raja; Kahsay, Robel.

Nucleic Acids Res ; 46(D1): D1128-D1136, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-30053270

RESUMO

Single-nucleotide variation and gene expression of disease samples represent important resources for biomarker discovery. Many databases have been built to host and make available such data to the community, but these databases are frequently limited in scope and/or content. BioMuta, a database of cancer-associated single-nucleotide variations, and BioXpress, a database of cancer-associated differentially expressed genes and microRNAs, differ from other disease-associated variation and expression databases primarily through the aggregation of data across many studies into a single source with a unified representation and annotation of functional attributes. Early versions of these resources were initiated by pilot funding for specific research applications, but newly awarded funds have enabled hardening of these databases to production-level quality and will allow for sustained development of these resources for the next few years. Because both resources were developed using a similar methodology of integration, curation, unification, and annotation, we present BioMuta and BioXpress as allied databases that will facilitate a more comprehensive view of gene associations in cancer. BioMuta and BioXpress are hosted on the High-performance Integrated Virtual Environment (HIVE) server at the George Washington University at https://hive.biochemistry.gwu.edu/biomuta and https://hive.biochemistry.gwu.edu/bioxpress, respectively.

Assuntos

Biomarcadores Tumorais/genética , Bases de Dados Genéticas , Bases de Conhecimento , Mutação , Neoplasias/genética , Regulação Neoplásica da Expressão Gênica , Humanos , MicroRNAs , Interface Usuário-Computador

Distribution bias analysis of germline and somatic single-nucleotide variations that impact protein functional site and neighboring amino acids.

Pan, Yang; Yan, Cheng; Hu, Yu; Fan, Yu; Pan, Qing; Wan, Quan; Torcivia-Rodriguez, John; Mazumder, Raja.

Sci Rep ; 7: 42169, 2017 02 08.

Artigo em Inglês | MEDLINE | ID: mdl-28176830

RESUMO

Single nucleotide variations (SNVs) can result in loss or gain of protein functional sites. We analyzed the effects of SNVs on enzyme active sites, ligand binding sites, and various types of post translational modification (PTM) sites. We found that, for most types of protein functional sites, the SNV pattern differs between germline and somatic mutations as well as between synonymous and non-synonymous mutations. From a total of 51,138 protein functional site affecting SNVs (pfsSNVs), a pan-cancer analysis revealed 142 somatic pfsSNVs in five or more cancer types. By leveraging patient information for somatic pfsSNVs, we identified 17 loss of functional site SNVs and 60 gain of functional site SNVs which are significantly enriched in patients with specific cancer types. Of the key pfsSNVs identified in our analysis above, we highlight 132 key pfsSNVs within 17 genes that are found in well-established cancer associated gene lists. For illustrating how key pfsSNVs can be prioritized further, we provide a use case where we performed survival analysis showing that a loss of phosphorylation site pfsSNV at position 105 in MEF2A is significantly associated with decreased pancreatic cancer patient survival rate. These 132 pfsSNVs can be used in developing genetic testing pipelines.

Assuntos

Regulação Neoplásica da Expressão Gênica , Mutação em Linhagem Germinativa , Proteínas de Neoplasias/genética , Neoplasias/genética , Polimorfismo de Nucleotídeo Único , Processamento de Proteína Pós-Traducional , Acetilação , Domínio Catalítico , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Ontologia Genética , Glicosilação , Humanos , Metilação , Anotação de Sequência Molecular , Proteínas de Neoplasias/química , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Neoplasias/mortalidade , Neoplasias/patologia , Fosforilação , Análise de Sobrevida , Ubiquitinação

Pubcast and Genecast: Browsing and Exploring Publications and Associated Curated Content in Biology Through Mobile Devices.

Goldweber, Scott; Theodore, Jamal; Torcivia-Rodriguez, John; Simonyan, Vahan; Mazumder, Raja.

IEEE/ACM Trans Comput Biol Bioinform ; 14(2): 498-500, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-28113865

RESUMO

Services such as Facebook, Amazon, and eBay were once solely accessed from stationary computers. These web services are now being used increasingly on mobile devices. We acknowledge this new reality by providing users a way to access publications and a curated cancer mutation database on their mobile device with daily automated updates. AVAILABILITY: http://hive. biochemistry.gwu.edu/tools/HivePubcast.

Assuntos

Mineração de Dados/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Publicações Periódicas como Assunto , Smartphone , Interface Usuário-Computador , Curadoria de Dados , Internet

High-performance integrated virtual environment (HIVE): a robust infrastructure for next-generation sequence data analysis.

Simonyan, Vahan; Chumakov, Konstantin; Dingerdissen, Hayley; Faison, William; Goldweber, Scott; Golikov, Anton; Gulzar, Naila; Karagiannis, Konstantinos; Vinh Nguyen Lam, Phuc; Maudru, Thomas; Muravitskaja, Olesja; Osipova, Ekaterina; Pan, Yang; Pschenichnov, Alexey; Rostovtsev, Alexandre; Santana-Quintero, Luis; Smith, Krista; Thompson, Elaine E; Tkachenko, Valery; Torcivia-Rodriguez, John; Voskanian, Alin; Wan, Quan; Wang, Jing; Wu, Tsung-Jung; Wilson, Carolyn; Mazumder, Raja.

Database (Oxford) ; 20162016.

Artigo em Inglês | MEDLINE | ID: mdl-26989153

RESUMO

The High-performance Integrated Virtual Environment (HIVE) is a distributed storage and compute environment designed primarily to handle next-generation sequencing (NGS) data. This multicomponent cloud infrastructure provides secure web access for authorized users to deposit, retrieve, annotate and compute on NGS data, and to analyse the outcomes using web interface visual environments appropriately built in collaboration with research and regulatory scientists and other end users. Unlike many massively parallel computing environments, HIVE uses a cloud control server which virtualizes services, not processes. It is both very robust and flexible due to the abstraction layer introduced between computational requests and operating system processes. The novel paradigm of moving computations to the data, instead of moving data to computational nodes, has proven to be significantly less taxing for both hardware and network infrastructure.The honeycomb data model developed for HIVE integrates metadata into an object-oriented model. Its distinction from other object-oriented databases is in the additional implementation of a unified application program interface to search, view and manipulate data of all types. This model simplifies the introduction of new data types, thereby minimizing the need for database restructuring and streamlining the development of new integrated information systems. The honeycomb model employs a highly secure hierarchical access control and permission system, allowing determination of data access privileges in a finely granular manner without flooding the security subsystem with a multiplicity of rules. HIVE infrastructure will allow engineers and scientists to perform NGS analysis in a manner that is both efficient and secure. HIVE is actively supported in public and private domains, and project collaborations are welcomed. Database URL: https://hive.biochemistry.gwu.edu.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/métodos , Interface Usuário-Computador , Biologia Computacional , Mutação/genética , Poliovirus/genética , Vacinas contra Poliovirus/imunologia , Proteômica , Recombinação Genética , Alinhamento de Sequência , Estatística como Assunto

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA