Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
iScience ; 26(4): 106408, 2023 Apr 21.
Artículo en Inglés | MEDLINE | ID: mdl-36974157

RESUMEN

Identification of proteins dysregulated by COVID-19 infection is critically important for better understanding of its pathophysiology, building prognostic models, and identifying new targets. Plasma proteomic profiling of 4,301 proteins was performed in two independent datasets and tested for the association for three COVID-19 outcomes (infection, ventilation, and death). We identified 1,449 proteins consistently associated in both datasets with any of these three outcomes. We subsequently created highly accurate models that distinctively predict infection, ventilation, and death. These proteins were enriched in specific biological processes including cytokine signaling, Alzheimer's disease, and coronary artery disease. Mendelian randomization and gene network analyses identified eight causal proteins and 141 highly connected hub proteins including 35 with known drug targets. Our findings provide distinctive prognostic biomarkers for two severe COVID-19 outcomes, reveal their relationship to Alzheimer's disease and coronary artery disease, and identify potential therapeutic targets for COVID-19 outcomes.

2.
HGG Adv ; 4(1): 100150, 2023 01 12.
Artículo en Inglés | MEDLINE | ID: mdl-36340933

RESUMEN

The heritability of autism spectrum disorder (ASD), based on 680,000 families and five countries, is estimated to be nearly 80%, yet heritability reported from SNP-based studies are consistently lower, and few significant loci have been identified with genome-wide association studies. This gap in genomic information may reside in rare variants, interaction among variants (epistasis), or cryptic structural variation (SV) and may provide mechanisms that underlie ASD. Here we use a method to identify potential SVs based on non-Mendelian inheritance patterns in pedigrees using parent-child genotypes from ASD families and demonstrate that they are enriched in ASD-risk genes. Most are in non-coding genic space and are over-represented in expression quantitative trait loci, suggesting that they affect gene regulation, which we confirm with their overlap of differentially expressed genes in postmortem brain tissue of ASD individuals. We then identify an SV in the GRIK2 gene that alters RNA splicing and a regulatory region of the ACMSD gene in the kynurenine pathway as significantly associated with a non-verbal ASD phenotype, supporting our hypothesis that these currently excluded loci can provide a clearer mechanistic understanding of ASD. Finally, we use an explainable artificial intelligence approach to define subgroups demonstrating their use in the context of precision medicine.


Asunto(s)
Trastorno del Espectro Autista , Humanos , Trastorno del Espectro Autista/genética , Estudio de Asociación del Genoma Completo/métodos , Inteligencia Artificial , Sitios de Carácter Cuantitativo/genética , Patrón de Herencia/genética
3.
medRxiv ; 2022 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-35923315

RESUMEN

Identification of the plasma proteomic changes of Coronavirus disease 2019 (COVID-19) is essential to understanding the pathophysiology of the disease and developing predictive models and novel therapeutics. We performed plasma deep proteomic profiling from 332 COVID-19 patients and 150 controls and pursued replication in an independent cohort (297 cases and 76 controls) to find potential biomarkers and causal proteins for three COVID-19 outcomes (infection, ventilation, and death). We identified and replicated 1,449 proteins associated with any of the three outcomes (841 for infection, 833 for ventilation, and 253 for death) that can be query on a web portal ( https://covid.proteomics.wustl.edu/ ). Using those proteins and machine learning approached we created and validated specific prediction models for ventilation (AUC>0.91), death (AUC>0.95) and either outcome (AUC>0.80). These proteins were also enriched in specific biological processes, including immune and cytokine signaling (FDR ≤ 3.72×10 -14 ), Alzheimer's disease (FDR ≤ 5.46×10 -10 ) and coronary artery disease (FDR ≤ 4.64×10 -2 ). Mendelian randomization using pQTL as instrumental variants nominated BCAT2 and GOLM1 as a causal proteins for COVID-19. Causal gene network analyses identified 141 highly connected key proteins, of which 35 have known drug targets with FDA-approved compounds. Our findings provide distinctive prognostic biomarkers for two severe COVID-19 outcomes (ventilation and death), reveal their relationship to Alzheimer's disease and coronary artery disease, and identify potential therapeutic targets for COVID-19 outcomes.

4.
Patterns (N Y) ; 2(12): 100374, 2021 Dec 10.
Artículo en Inglés | MEDLINE | ID: mdl-34950902

RESUMEN

Network modeling transforms data into a structure of nodes and edges such that edges represent relationships between pairs of objects, then extracts clusters of densely connected nodes in order to capture high-dimensional relationships hidden in the data. This efficient and flexible strategy holds potential for unveiling complex patterns concealed within massive datasets, but standard implementations overlook several key issues that can undermine research efforts. These issues range from data imputation and discretization to correlation metrics, clustering methods, and validation of results. Here, we enumerate these pitfalls and provide practical strategies for alleviating their negative effects. These guidelines increase prospects for future research endeavors as they reduce type I and type II (false-positive and false-negative) errors and are generally applicable for network modeling applications across diverse domains.

5.
Patterns (N Y) ; 2(5): 100260, 2021 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-33880455

RESUMEN

The conundrums of choosing candidate genes, via differential expression between treated and mock specimens, are tackled by Ghandikota et al. in this issue of Patterns in their efforts to tease out genetic patterns that are characteristic of coronavirus disease 2019 (COVID-19) outcomes.

6.
Methods Mol Biol ; 2096: 197-215, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32720156

RESUMEN

We demonstrate a selection of network and machine learning techniques useful in the analysis of complex datasets, including 2-way similarity networks, Markov clustering, enrichment statistical networks, FCROS differential analysis, and random forests. We demonstrate each of these techniques on the Populus trichocarpa gene expression atlas.


Asunto(s)
Bases de Datos como Asunto , Redes Reguladoras de Genes , Populus/genética , Algoritmos , Análisis por Conglomerados , Regulación de la Expresión Génica de las Plantas , Programas Informáticos
7.
BMC Plant Biol ; 20(1): 120, 2020 Mar 18.
Artículo en Inglés | MEDLINE | ID: mdl-32183694

RESUMEN

BACKGROUND: Potato is the third most consumed crop in the world. Breeding for traits such as yield, product quality and pathogen resistance are main priorities. Identifying molecular signatures of these and other important traits is important in future breeding efforts. In this study, a progeny population from a cross between a breeding line, SW93-1015, and a cultivar, Désirée, was studied by trait analysis and RNA-seq in order to develop understanding of segregating traits at the molecular level and identify transcripts with expressional correlation to these traits. Transcript markers with predictive value for field performance applicable under controlled environments would be of great value for plant breeding. RESULTS: A total of 34 progeny lines from SW93-1015 and Désirée were phenotyped for 17 different traits in a field in Nordic climate conditions and controlled climate settings. A master transcriptome was constructed with all 34 progeny lines and the parents through a de novo assembly of RNA-seq reads. Gene expression data obtained in a controlled environment from the 34 lines was correlated to traits by different similarity indices, including Pearson and Spearman, as well as DUO, which calculates the co-occurrence between high and low values for gene expression and trait. Our study linked transcripts to traits such as yield, growth rate, high laying tubers, late and tuber blight, tuber greening and early flowering. We found several transcripts associated to late blight resistance and transcripts encoding receptors were associated to Dickeya solani susceptibility. Transcript levels of a UBX-domain protein was negatively associated to yield and a GLABRA2 expression modulator was negatively associated to growth rate. CONCLUSION: In our study, we identify 100's of transcripts, putatively linked based on expression with 17 traits of potato, representing both well-known and novel associations. This approach can be used to link the transcriptome to traits. We explore the possibility of associating the level of transcript expression from controlled, optimal environments to traits in a progeny population with different methods introducing the application of DUO for the first time on transcriptome data. We verify the expression pattern for five of the putative transcript markers in another progeny population.


Asunto(s)
Rasgos de la Historia de Vida , Fenotipo , Solanum tuberosum/genética , Transcriptoma , Tetraploidía
8.
G3 (Bethesda) ; 6(5): 1251-66, 2016 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-26921301

RESUMEN

The well-documented latitudinal clines of genes affecting human skin color presumably arise from the need for protection from intense ultraviolet radiation (UVR) vs. the need to use UVR for vitamin D synthesis. Sampling 751 subjects from a broad range of latitudes and skin colors, we investigated possible multilocus correlated adaptation of skin color genes with the vitamin D receptor gene (VDR), using a vector correlation metric and network method called BlocBuster. We discovered two multilocus networks involving VDR promoter and skin color genes that display strong latitudinal clines as multilocus networks, even though many of their single gene components do not. Considered one by one, the VDR components of these networks show diverse patterns: no cline, a weak declining latitudinal cline outside of Africa, and a strong in- vs. out-of-Africa frequency pattern. We confirmed these results with independent data from HapMap. Standard linkage disequilibrium analyses did not detect these networks. We applied BlocBuster across the entire genome, showing that our networks are significant outliers for interchromosomal disequilibrium that overlap with environmental variation relevant to the genes' functions. These results suggest that these multilocus correlations most likely arose from a combination of parallel selective responses to a common environmental variable and coadaptation, given the known Mendelian epistasis among VDR and the skin color genes.


Asunto(s)
Altitud , Interacción Gen-Ambiente , Receptores de Calcitriol/genética , Pigmentación de la Piel/genética , Adaptación Biológica/genética , Alelos , Biología Computacional/métodos , Epistasis Genética , Frecuencia de los Genes , Redes Reguladoras de Genes , Ligamiento Genético , Genoma Humano , Genómica/métodos , Genotipo , Humanos , Desequilibrio de Ligamiento , Polimorfismo de Nucleótido Simple
9.
Front Genet ; 6: 301, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26500678

RESUMEN

The substantial progress in the last few years toward uncovering genetic causes and risk factors for autism spectrum disorders (ASDs) has opened new experimental avenues for identifying the underlying neurobiological mechanism of the condition. The bounty of genetic findings has led to a variety of data-driven exploratory analyses aimed at deriving new insights about the shared features of these genes. These approaches leverage data from a variety of different sources such as co-expression in transcriptomic studies, protein-protein interaction networks, gene ontologies (GOs) annotations, or multi-level combinations of all of these. Here, we review the recurrent themes emerging from these analyses and highlight some of the challenges going forward. Themes include findings that ASD associated genes discovered by a variety of methods have been shown to contain disproportionate amounts of neurite outgrowth/cytoskeletal, synaptic, and more recently Wnt-related and chromatin modifying genes. Expression studies have highlighted a disproportionate expression of ASD gene sets during mid fetal cortical development, particularly for rare variants, with multiple analyses highlighting the striatum and cortical projection and interneurons as well. While these explorations have highlighted potentially interesting relationships among these ASD-related genes, there are challenges in how to best transition these insights into empirically testable hypotheses. Nonetheless, defining shared molecular or cellular pathology downstream of the diverse genes associated with ASDs could provide the cornerstones needed to build toward broadly applicable therapeutic approaches.

10.
Nat Commun ; 6: 6534, 2015 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-25813846

RESUMEN

Gephyrin is a highly conserved gene that is vital for the organization of proteins at inhibitory receptors, molybdenum cofactor biosynthesis and other diverse functions. Its specific function is intricately regulated and its aberrant activities have been observed for a number of human diseases. Here we report a remarkable yin-yang haplotype pattern encompassing gephyrin. Yin-yang haplotypes arise when a stretch of DNA evolves to present two disparate forms that bear differing states for nucleotide variations along their lengths. The gephyrin yin-yang pair consists of 284 divergent nucleotide states and both variants vary drastically from their mutual ancestral haplotype, suggesting rapid evolution. Several independent lines of evidence indicate strong positive selection on the region and suggest these high-frequency haplotypes represent two distinct functional mechanisms. This discovery holds potential to deepen our understanding of variable human-specific regulation of gephyrin while providing clues for rapid evolutionary events and allelic migrations buried within human history.


Asunto(s)
Proteínas Portadoras/genética , Evolución Molecular , Haplotipos/genética , Proteínas de la Membrana/genética , Secuencia de Bases , Bases de Datos Genéticas , Humanos
11.
PLoS Comput Biol ; 10(9): e1003766, 2014 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25233071

RESUMEN

Hundreds of genetic markers have shown associations with various complex diseases, yet the "missing heritability" remains alarmingly elusive. Combinatorial interactions may account for a substantial portion of this missing heritability, but their discoveries have been impeded by computational complexity and genetic heterogeneity. We present BlocBuster, a novel systems-level approach that efficiently constructs genome-wide, allele-specific networks that accurately segregate homogenous combinations of genetic factors, tests the associations of these combinations with the given phenotype, and rigorously validates the results using a series of unbiased validation methods. BlocBuster employs a correlation measure that is customized for single nucleotide polymorphisms and returns a multi-faceted collection of values that captures genetic heterogeneity. We applied BlocBuster to analyze psoriasis, discovering a combinatorial pattern with an odds ratio of 3.64 and Bonferroni-corrected p-value of 5.01×10(-16). This pattern was replicated in independent data, reflecting robustness of the method. In addition to improving prediction of disease susceptibility and broadening our understanding of the pathogenesis underlying psoriasis, these results demonstrate BlocBuster's potential for discovering combinatorial genetic associations within heterogeneous genome-wide data, thereby transcending the limiting "small effects" produced by individual markers examined in isolation.


Asunto(s)
Marcadores Genéticos/genética , Estudio de Asociación del Genoma Completo/métodos , Psoriasis/genética , Alelos , Biología Computacional , Humanos , Polimorfismo de Nucleótido Simple/genética
12.
Genet Epidemiol ; 38(7): 610-21, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25168954

RESUMEN

Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Algoritmos , Estudios de Casos y Controles , Análisis por Conglomerados , Simulación por Computador , Epistasis Genética , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Genoma Humano , Genotipo , Cardiopatías/genética , Humanos , Modelos Genéticos
13.
Bioinformatics ; 25(1): 68-74, 2009 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-18987010

RESUMEN

MOTIVATION: Inference of haplotypes from genotype data is crucial and challenging for many vitally important studies. The first, and most critical step, is the ascertainment of a biologically sound model to be optimized. Many models that have been proposed rely partially or entirely on reducing the number of unique haplotypes in the solution. RESULTS: This article examines the parsimony of haplotypes using known haplotypes as well as genotypes from the HapMap project. Our study reveals that there are relatively few unique haplotypes, but not always the least possible, for the datasets with known solutions. Furthermore, we show that there are frequently very large numbers of parsimonious solutions, and the number increases exponentially with increasing cardinality. Moreover, these solutions are quite varied, most of which are not consistent with the true solutions. These results quantify the limitations of the Pure Parsimony model and demonstrate the imperative need to consider additional properties for haplotype inference models. At a higher level, and with broad applicability, this article illustrates the power of combinatorial methods to tease out imperfections in a given biological model.


Asunto(s)
Haplotipos , Naturaleza , Bases de Datos Genéticas , Heterocigoto , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...