Pesquisa | BVS Violência e Saúde

EPIC-CoGe: managing and analyzing genomic data.

Nelson, Andrew D L; Haug-Baltzell, Asher K; Davey, Sean; Gregory, Brian D; Lyons, Eric.

Bioinformatics ; 34(15): 2651-2653, 2018 08 01.

Artigo em Inglês | MEDLINE | ID: mdl-29474529

RESUMO

Summary: The EPIC-CoGe browser is a web-based genome visualization utility that integrates the GMOD JBrowse genome browser with the extensive CoGe genome database (currently containing over 30 000 genomes). In addition, the EPIC-CoGe browser boasts many additional features over basic JBrowse, including enhanced search capability and on-the-fly analyses for comparisons and analyses between all types of functional and diversity genomics data. There is no installation required and data (genome, annotation, functional genomic and diversity data) can be loaded by following a simple point and click wizard, or using a REST API, making the browser widely accessible and easy to use by researchers of all computational skill levels. In addition, EPIC-CoGe and data tracks are easily embedded in other websites and JBrowse instances. Availability and implementation: EPIC-CoGe Browser is freely available for use online through CoGe (https://genomevolution.org). Source code (MIT open source) is available: https://github.com/LyonsLab/coge. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Visualização de Dados , Genoma , Anotação de Sequência Molecular , Análise de Sequência de DNA/métodos , Software , Genômica/métodos

COVID-19 susceptibility and severity risks in a cross-sectional survey of over 500 000 US adults.

Knight, Spencer C; McCurdy, Shannon R; Rhead, Brooke; Coignet, Marie V; Park, Danny S; Roberts, Genevieve H L; Berkowitz, Nathan D; Zhang, Miao; Turissini, David; Delgado, Karen; Pavlovic, Milos; Haug Baltzell, Asher K; Guturu, Harendra; Rand, Kristin A; Girshick, Ahna R; Hong, Eurie L; Ball, Catherine A.

BMJ Open ; 12(10): e049657, 2022 10 12.

Artigo em Inglês | MEDLINE | ID: mdl-36223959

RESUMO

OBJECTIVES: The enormous toll of the COVID-19 pandemic has heightened the urgency of collecting and analysing population-scale datasets in real time to monitor and better understand the evolving pandemic. The objectives of this study were to examine the relationship of risk factors to COVID-19 susceptibility and severity and to develop risk models to accurately predict COVID-19 outcomes using rapidly obtained self-reported data. DESIGN: A cross-sectional study. SETTING: AncestryDNA customers in the USA who consented to research. PARTICIPANTS: The AncestryDNA COVID-19 Study collected self-reported survey data on symptoms, outcomes, risk factors and exposures for over 563 000 adult individuals in the USA in just under 4 months, including over 4700 COVID-19 cases as measured by a self-reported positive test. RESULTS: We replicated previously reported associations between several risk factors and COVID-19 susceptibility and severity outcomes, and additionally found that differences in known exposures accounted for many of the susceptibility associations. A notable exception was elevated susceptibility for men even after adjusting for known exposures and age (adjusted OR=1.36, 95% CI=1.19 to 1.55). We also demonstrated that self-reported data can be used to build accurate risk models to predict individualised COVID-19 susceptibility (area under the curve (AUC)=0.84) and severity outcomes including hospitalisation and critical illness (AUC=0.87 and 0.90, respectively). The risk models achieved robust discriminative performance across different age, sex and genetic ancestry groups within the study. CONCLUSIONS: The results highlight the value of self-reported epidemiological data to rapidly provide public health insights into the evolving COVID-19 pandemic.

Assuntos

COVID-19 , Adulto , COVID-19/epidemiologia , Estudos Transversais , Humanos , Masculino , Pandemias , Fatores de Risco , SARS-CoV-2

Expanded COVID-19 phenotype definitions reveal distinct patterns of genetic association and protective effects.

Roberts, Genevieve H L; Partha, Raghavendran; Rhead, Brooke; Knight, Spencer C; Park, Danny S; Coignet, Marie V; Zhang, Miao; Berkowitz, Nathan; Turrisini, David A; Gaddis, Michael; McCurdy, Shannon R; Pavlovic, Milos; Ruiz, Luong; Sass, Chodon; Haug Baltzell, Asher K; Guturu, Harendra; Girshick, Ahna R; Ball, Catherine A; Hong, Eurie L; Rand, Kristin A.

Nat Genet ; 54(4): 374-381, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-35410379

RESUMO

Multiple COVID-19 genome-wide association studies (GWASs) have identified reproducible genetic associations indicating that there is a genetic component to susceptibility and severity risk. To complement these studies, we collected deep coronavirus disease 2019 (COVID-19) phenotype data from a survey of 736,723 AncestryDNA research participants. With these data, we defined eight phenotypes related to COVID-19 outcomes: four phenotypes that align with previously studied COVID-19 definitions and four 'expanded' phenotypes that focus on susceptibility given exposure, mild clinical manifestations and an aggregate score of symptom severity. We performed a replication analysis of 12 previously reported COVID-19 genetic associations with all eight phenotypes in a trans-ancestry meta-analysis of AncestryDNA research participants. In this analysis, we show distinct patterns of association at the 12 loci with the eight outcomes that we assessed. We also performed a genome-wide discovery analysis of all eight phenotypes, which did not yield new genome-wide significant loci but did suggest that three of the four 'ï»¿expanded'ï»¿ COVID-19 phenotypes have enhanced power to capture protective genetic associations relative to the previously studied phenotypes. Thus, we conclude that continued large-scale ascertainment of deep COVID-19 phenotype data would likely represent a boon for COVID-19 therapeutic target identification.

Assuntos

COVID-19 , Estudo de Associação Genômica Ampla , COVID-19/genética , Predisposição Genética para Doença , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética

Evolinc: A Tool for the Identification and Evolutionary Comparison of Long Intergenic Non-coding RNAs.

Nelson, Andrew D L; Devisetty, Upendra K; Palos, Kyle; Haug-Baltzell, Asher K; Lyons, Eric; Beilstein, Mark A.

Front Genet ; 8: 52, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-28536600

RESUMO

Long intergenic non-coding RNAs (lincRNAs) are an abundant and functionally diverse class of eukaryotic transcripts. Reported lincRNA repertoires in mammals vary, but are commonly in the thousands to tens of thousands of transcripts, covering ~90% of the genome. In addition to elucidating function, there is particular interest in understanding the origin and evolution of lincRNAs. Aside from mammals, lincRNA populations have been sparsely sampled, precluding evolutionary analyses focused on their emergence and persistence. Here we present Evolinc, a two-module pipeline designed to facilitate lincRNA discovery and characterize aspects of lincRNA evolution. The first module (Evolinc-I) is a lincRNA identification workflow that also facilitates downstream differential expression analysis and genome browser visualization of identified lincRNAs. The second module (Evolinc-II) is a genomic and transcriptomic comparative analysis workflow that determines the phylogenetic depth to which a lincRNA locus is conserved within a user-defined group of related species. Here we validate lincRNA catalogs generated with Evolinc-I against previously annotated Arabidopsis and human lincRNA data. Evolinc-I recapitulated earlier findings and uncovered an additional 70 Arabidopsis and 43 human lincRNAs. We demonstrate the usefulness of Evolinc-II by examining the evolutionary histories of a public dataset of 5,361 Arabidopsis lincRNAs. We used Evolinc-II to winnow this dataset to 40 lincRNAs conserved across species in Brassicaceae. Finally, we show how Evolinc-II can be used to recover the evolutionary history of a known lincRNA, the human telomerase RNA (TERC). These latter analyses revealed unexpected duplication events as well as the loss and subsequent acquisition of a novel TERC locus in the lineage leading to mice and rats. The Evolinc pipeline is currently integrated in CyVerse's Discovery Environment and is free for use by researchers.

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved (Non-model) Organisms.

Joyce, Blake L; Haug-Baltzell, Asher K; Hulvey, Jonathan P; McCarthy, Fiona; Devisetty, Upendra Kumar; Lyons, Eric.

J Vis Exp ; (123)2017 05 09.

Artigo em Inglês | MEDLINE | ID: mdl-28518075

RESUMO

This workflow allows novice researchers to leverage advanced computational resources such as cloud computing to carry out pairwise comparative transcriptomics. It also serves as a primer for biologists to develop data scientist computational skills, e.g. executing bash commands, visualization and management of large data sets. All command line code and further explanations of each command or step can be found on the wiki (https://wiki.cyverse.org/wiki/x/dgGtAQ). The Discovery Environment and Atmosphere platforms are connected together through the CyVerse Data Store. As such, once the initial raw sequencing data has been uploaded there is no more need to transfer large data files over an Internet connection, minimizing the amount of time needed to conduct analyses. This protocol is designed to analyze only two experimental treatments or conditions. Differential gene expression analysis is conducted through pairwise comparisons, and will not be suitable to test multiple factors. This workflow is also designed to be manual rather than automated. Each step must be executed and investigated by the user, yielding a better understanding of data and analytical outputs, and therefore better results for the user. Once complete, this protocol will yield de novo assembled transcriptome(s) for underserved (non-model) organisms without the need to map to previously assembled reference genomes (which are usually not available in underserved organism). These de novo transcriptomes are further used in pairwise differential gene expression analysis to investigate genes differing between two experimental conditions. Differentially expressed genes are then functionally annotated to understand the genetic response organisms have to experimental conditions. In total, the data derived from this protocol is used to test hypotheses about biological responses of underserved organisms.

Assuntos

Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Software , Animais , Biologia Computacional/educação , Internet , Análise de Sequência de RNA/métodos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA