Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
PLoS Comput Biol ; 20(3): e1011814, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38527092

RESUMEN

As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. PathIntegrate is available as an open-source Python package.


Asunto(s)
Genómica , Multiómica , Genómica/métodos
2.
medRxiv ; 2024 Feb 28.
Artículo en Inglés | MEDLINE | ID: mdl-38464285

RESUMEN

Background: Studies have identified individual blood biomarkers associated with chronic obstructive pulmonary disease (COPD) and related phenotypes. However, complex diseases such as COPD typically involve changes in multiple molecules with interconnections that may not be captured when considering single molecular features. Methods: Leveraging proteomic data from 3,173 COPDGene Non-Hispanic White (NHW) and African American (AA) participants, we applied sparse multiple canonical correlation network analysis (SmCCNet) to 4,776 proteins assayed on the SomaScan v4.0 platform to derive sparse networks of proteins associated with current vs. former smoking status, airflow obstruction, and emphysema quantitated from high-resolution computed tomography scans. We then used NetSHy, a dimension reduction technique leveraging network topology, to produce summary scores of each proteomic network, referred to as NetSHy scores. We next performed genome-wide association study (GWAS) to identify variants associated with the NetSHy scores, or network quantitative trait loci (nQTLs). Finally, we evaluated the replicability of the networks in an independent cohort, SPIROMICS. Results: We identified networks of 13 to 104 proteins for each phenotype and exposure in NHW and AA, and the derived NetSHy scores significantly associated with the variable of interests. Networks included known (sRAGE, ALPP, MIP1) and novel molecules (CA10, CPB1, HIS3, PXDN) and interactions involved in COPD pathogenesis. We observed 7 nQTL loci associated with NetSHy scores, 4 of which remained after conditional analysis. Networks for smoking status and emphysema, but not airflow obstruction, demonstrated a high degree of replicability across race groups and cohorts. Conclusions: In this work, we apply state-of-the-art molecular network generation and summarization approaches to proteomic data from COPDGene participants to uncover protein networks associated with COPD phenotypes. We further identify genetic associations with networks. This work discovers protein networks containing known and novel proteins and protein interactions associated with clinically relevant COPD phenotypes across race groups and cohorts.

3.
bioRxiv ; 2024 Jan 25.
Artículo en Inglés | MEDLINE | ID: mdl-38328226

RESUMEN

Multiple -omics (genomics, proteomics, etc.) profiles are commonly generated to gain insight into a disease or physiological system. Constructing multi-omics networks with respect to the trait(s) of interest provides an opportunity to understand relationships between molecular features but integration is challenging due to multiple data sets with high dimensionality. One approach is to use canonical correlation to integrate one or two omics types and a single trait of interest. However, these types of methods may be limited due to (1) not accounting for higher-order correlations existing among features, (2) computational inefficiency when extending to more than two omics data when using a penalty term-based sparsity method, and (3) lack of flexibility for focusing on specific correlations (e.g., omics-to-phenotype correlation versus omics-to-omics correlations). In this work, we have developed a novel multi-omics network analysis pipeline called Sparse Generalized Tensor Canonical Correlation Analysis Network Inference (SGTCCA-Net) that can effectively overcome these limitations. We also introduce an implementation to improve the summarization of networks for downstream analyses. Simulation and real-data experiments demonstrate the effectiveness of our novel method for inferring omics networks and features of interest.

4.
bioRxiv ; 2024 Jan 09.
Artículo en Inglés | MEDLINE | ID: mdl-38260498

RESUMEN

As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. The PathIntegrate Python package is available at https://github.com/cwieder/PathIntegrate.

5.
Obesity (Silver Spring) ; 31(8): 2090-2102, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37475691

RESUMEN

OBJECTIVE: Fat content of adipocytes derived from infant umbilical cord mesenchymal stem cells (MSCs) predicts adiposity in children through 4 to 6 years of age. This study tested the hypothesis that MSCs from infants born to mothers with obesity (Ob-MSCs) exhibit adipocyte hypertrophy and perturbations in genes regulating adipogenesis compared with MSCs from infants of mothers with normal weight (NW-MSCs). METHODS: Adipogenesis was induced in MSCs embedded in three-dimensional hydrogel structures, and cell size and number were measured by three-dimensional imaging. Proliferation and protein markers of proliferation and adipogenesis in undifferentiated and adipocyte differentiating cells were measured. RNA sequencing was performed to determine pathways linked to adipogenesis phenotype. RESULTS: In undifferentiated MSCs, greater zinc finger protein (Zfp)423 protein content was observed in Ob- versus NW-MSCs. Adipocytes from Ob-MSCs were larger but fewer than adipocytes from NW-MSCs. RNA sequencing analysis showed that Zfp423 protein correlated with mRNA expression of genes enriched for cell cycle, MSC lineage specification, inflammation, and metabolism pathways. MSC proliferation was not different before differentiation but declined faster in Ob-MSCs upon adipogenic induction. CONCLUSIONS: Ob-MSCs have an intrinsic propensity for adipocyte hypertrophy and reduced hyperplasia during adipogenesis in vitro, perhaps linked to greater Zfp423 content and changes in cell cycle pathway gene expression.


Asunto(s)
Células Madre Mesenquimatosas , Madres , Femenino , Humanos , Obesidad/genética , Obesidad/metabolismo , Diferenciación Celular/genética , Adipogénesis/genética , Células Madre Mesenquimatosas/metabolismo , Factores de Transcripción/metabolismo , Adipocitos/metabolismo , Hipertrofia/metabolismo
6.
Sci Rep ; 13(1): 9254, 2023 06 07.
Artículo en Inglés | MEDLINE | ID: mdl-37286633

RESUMEN

Privacy protection is a core principle of genomic but not proteomic research. We identified independent single nucleotide polymorphism (SNP) quantitative trait loci (pQTL) from COPDGene and Jackson Heart Study (JHS), calculated continuous protein level genotype probabilities, and then applied a naïve Bayesian approach to link SomaScan 1.3K proteomes to genomes for 2812 independent subjects from COPDGene, JHS, SubPopulations and InteRmediate Outcome Measures In COPD Study (SPIROMICS) and Multi-Ethnic Study of Atherosclerosis (MESA). We correctly linked 90-95% of proteomes to their correct genome and for 95-99% we identify the 1% most likely links. The linking accuracy in subjects with African ancestry was lower (~ 60%) unless training included diverse subjects. With larger profiling (SomaScan 5K) in the Atherosclerosis Risk Communities (ARIC) correct identification was > 99% even in mixed ancestry populations. We also linked proteomes-to-proteomes and used the proteome only to determine features such as sex, ancestry, and first-degree relatives. When serial proteomes are available, the linking algorithm can be used to identify and correct mislabeled samples. This work also demonstrates the importance of including diverse populations in omics research and that large proteomic datasets (> 1000 proteins) can be accurately linked to a specific genome through pQTL knowledge and should not be considered unidentifiable.


Asunto(s)
Aterosclerosis , Proteoma , Humanos , Proteoma/genética , Teorema de Bayes , Privacidad , Estudio de Asociación del Genoma Completo , Aterosclerosis/genética , Polimorfismo de Nucleótido Simple
7.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36548341

RESUMEN

MOTIVATION: Biological networks can provide a system-level understanding of underlying processes. In many contexts, networks have a high degree of modularity, i.e. they consist of subsets of nodes, often known as subnetworks or modules, which are highly interconnected and may perform separate functions. In order to perform subsequent analyses to investigate the association between the identified module and a variable of interest, a module summarization, that best explains the module's information and reduces dimensionality is often needed. Conventional approaches for obtaining network representation typically rely only on the profiles of the nodes within the network while disregarding the inherent network topological information. RESULTS: In this article, we propose NetSHy, a hybrid approach which is capable of reducing the dimension of a network while incorporating topological properties to aid the interpretation of the downstream analyses. In particular, NetSHy applies principal component analysis (PCA) on a combination of the node profiles and the well-known Laplacian matrix derived directly from the network similarity matrix to extract a summarization at a subject level. Simulation scenarios based on random and empirical networks at varying network sizes and sparsity levels show that NetSHy outperforms the conventional PCA approach applied directly on node profiles, in terms of recovering the true correlation with a phenotype of interest and maintaining a higher amount of explained variation in the data when networks are relatively sparse. The robustness of NetSHy is also demonstrated by a more consistent correlation with the observed phenotype as the sample size decreases. Lastly, a genome-wide association study is performed as an application of a downstream analysis, where NetSHy summarization scores on the biological networks identify more significant single nucleotide polymorphisms than the conventional network representation. AVAILABILITY AND IMPLEMENTATION: R code implementation of NetSHy is available at https://github.com/thaovu1/NetSHy. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Simulación por Computador , Análisis de Componente Principal , Tamaño de la Muestra
8.
Front Big Data ; 5: 894632, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35811829

RESUMEN

Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death in the United States. COPD represents one of many areas of research where identifying complex pathways and networks of interacting biomarkers is an important avenue toward studying disease progression and potentially discovering cures. Recently, sparse multiple canonical correlation network analysis (SmCCNet) was developed to identify complex relationships between omics associated with a disease phenotype, such as lung function. SmCCNet uses two sets of omics datasets and an associated output phenotypes to generate a multi-omics graph, which can then be used to explore relationships between omics in the context of a disease. Detecting significant subgraphs within this multi-omics network, i.e., subgraphs which exhibit high correlation to a disease phenotype and high inter-connectivity, can help clinicians identify complex biological relationships involved in disease progression. The current approach to identifying significant subgraphs relies on hierarchical clustering, which can be used to inform clinicians about important pathways involved in the disease or phenotype of interest. The reliance on a hierarchical clustering approach can hinder subgraph quality by biasing toward finding more compact subgraphs and removing larger significant subgraphs. This study aims to introduce new significant subgraph detection techniques. In particular, we introduce two subgraph detection methods, dubbed Correlated PageRank and Correlated Louvain, by extending the Personalized PageRank Clustering and Louvain algorithms, as well as a hybrid approach combining the two proposed methods, and compare them to the hierarchical method currently in use. The proposed methods show significant improvement in the quality of the subgraphs produced when compared to the current state of the art.

9.
Artículo en Inglés | MEDLINE | ID: mdl-36776768

RESUMEN

The study of complex behavior of biological systems has become increasingly dependent on evolutionary network modeling. In particular, multi-omics networks capture interactions between biomolecules such as proteins and metabolites, providing a basis for predicting relationships between such biomolecules and various phenotypic traits of complex diseases. In this paper, we introduce an integrative framework that given a multi-omics network representing a cohort of subjects, learns expressive representations for network nodes, and combines the learned nodes representations with the biological profiles of individual subjects for enriched representation of the subjects. With extensive empirical evaluation using real-world multi-omics networks, we show that our proposed framework significantly outperforms existing and baseline methods in terms of subject representation accuracy, particularly when the multi-omics network representing the cohort is sparse and structured and therefore, more informative.

10.
PLoS One ; 16(8): e0255337, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34432807

RESUMEN

Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are molecularly defined clusters with distinct clinical phenotypes that may underlie the clinical heterogeneity. Subjects included 3,278 subjects from the COPDGene cohort with at least one of the following profiles: whole blood transcriptomes (2,650 subjects); plasma proteomes (1,013 subjects); and plasma metabolomes (1,136 subjects). 489 subjects had all three contemporaneous -omics profiles. Autoencoder embeddings were performed individually for each -omics dataset. Embeddings underwent subspace clustering using MineClus, either individually by -omics or combined, followed by recursive feature selection based on Support Vector Machines. Clusters were tested for associations with clinical variables. Optimal single -omics clustering typically resulted in two clusters. Although there was overlap for individual -omics cluster membership, each -omics cluster tended to be defined by unique molecular pathways. For example, prominent molecular features of the metabolome-based clustering included sphingomyelin, while key molecular features of the transcriptome-based clusters were related to immune and bacterial responses. We also found that when we integrated the -omics data at a later stage, we identified subtypes that varied based on age, severity of disease, in addition to diffusing capacity of the lungs for carbon monoxide, and precent on atrial fibrillation. In contrast, when we integrated the -omics data at an earlier stage by treating all data sets equally, there were no clinical differences between subtypes. Similar to clinical clustering, which has revealed multiple heterogenous clinical phenotypes, we show that transcriptomics, proteomics, and metabolomics tend to define clusters of COPD patients with different clinical characteristics. Thus, integrating these different -omics data sets affords additional insight into the molecular nature of COPD and its heterogeneity.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Metabolómica/métodos , Proteómica/métodos , Enfermedad Pulmonar Obstructiva Crónica/clasificación , Factores de Edad , Anciano , Análisis por Conglomerados , Bases de Datos Factuales , Femenino , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , Enfermedad Pulmonar Obstructiva Crónica/sangre , Enfermedad Pulmonar Obstructiva Crónica/genética , Máquina de Vectores de Soporte
11.
Metabolites ; 11(3)2021 Mar 11.
Artículo en Inglés | MEDLINE | ID: mdl-33799786

RESUMEN

Susceptibility and progression of lung disease, as well as response to treatment, often differ by sex, yet the metabolic mechanisms driving these sex-specific differences are still poorly understood. Women with chronic obstructive pulmonary disease (COPD) have less emphysema and more small airway disease on average than men, though these differences become less pronounced with more severe airflow limitation. While small studies of targeted metabolites have identified compounds differing by sex and COPD status, the sex-specific effect of COPD on systemic metabolism has yet to be interrogated. Significant sex differences were observed in 9 of the 11 modules identified in COPDGene. Sex-specific associations by COPD status and emphysema were observed in 3 modules for each phenotype. Sex stratified individual metabolite associations with COPD demonstrated male-specific associations in sphingomyelins and female-specific associations in acyl carnitines and phosphatidylethanolamines. There was high preservation of module assignments in SPIROMICS (SubPopulations and InteRmediate Outcome Measures In COPD Study) and similar female-specific shift in acyl carnitines. Several COPD associated metabolites differed by sex. Acyl carnitines and sphingomyelins demonstrate sex-specific abundances and may represent important metabolic signatures of sex differences in COPD. Accurately characterizing the sex-specific molecular differences in COPD is vital for personalized diagnostics and therapeutics.

12.
PLoS Negl Trop Dis ; 15(1): e0009020, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33406094

RESUMEN

Genomic approaches hold great promise for resolving unanswered questions about transmission patterns and responses to control efforts for schistosomiasis and other neglected tropical diseases. However, the cost of generating genomic data and the challenges associated with obtaining sufficient DNA from individual schistosome larvae (miracidia) from mammalian hosts have limited the application of genomic data for studying schistosomes and other complex macroparasites. Here, we demonstrate the feasibility of utilizing whole genome amplification and sequencing (WGS) to analyze individual archival miracidia. As an example, we sequenced whole genomes of 22 miracidia from 11 human hosts representing two villages in rural Sichuan, China, and used these data to evaluate patterns of relatedness and genetic diversity. We also down-sampled our dataset to test how lower coverage sequencing could increase the cost effectiveness of WGS while maintaining power to accurately infer relatedness. Collectively, our results illustrate that population-level WGS datasets are attainable for individual miracidia and represent a powerful tool for ultimately providing insight into overall genetic diversity, parasite relatedness, and transmission patterns for better design and evaluation of disease control efforts.


Asunto(s)
Schistosoma japonicum/genética , Secuenciación Completa del Genoma/métodos , Adulto , Anciano , Anciano de 80 o más Años , Animales , Femenino , Variación Genética , Humanos , Masculino , Persona de Mediana Edad , Esquistosomiasis Japónica/transmisión
13.
Diabetes ; 70(3): 745-751, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33414248

RESUMEN

An adverse intrauterine environment is associated with the future risk of obesity and type 2 diabetes. Changes in placental function may underpin the intrauterine origins of adult disease, but longitudinal studies linking placental function with childhood outcomes are rare. Here, we determined the abundance and phosphorylation of protein intermediates involved in insulin signaling, inflammation, cortisol metabolism, protein glycosylation, and mitochondrial biogenesis in placental villus samples from healthy mothers from the Healthy Start cohort. Using MANOVA, we tested the association between placental proteins and offspring adiposity (fat mass percentage) at birth (n = 109) and infancy (4-6 months, n = 104), and adiposity, skinfold thickness, triglycerides, and insulin in children (4-6 years, n = 66). Placental IGF-1 receptor protein was positively associated with serum triglycerides in children. GSK3ß phosphorylation at serine 9, a readout of insulin and growth factor signaling, and the ratio of phosphorylated to total JNK2 were both positively associated with midthigh skinfold thickness in children. Moreover, peroxisome proliferator-activated receptor γ coactivator (PGC)-1α abundance was positively associated with insulin in children. In conclusion, placental insulin/IGF-1 signaling, PGC-1α, and inflammation pathways were positively associated with metabolic outcomes in 4- to 6-year-old children, identifying a novel link between placental function and long-term metabolic outcomes.


Asunto(s)
11-beta-Hidroxiesteroide Deshidrogenasa de Tipo 2/metabolismo , Factor I del Crecimiento Similar a la Insulina/metabolismo , Insulina/metabolismo , Proteína Quinasa 9 Activada por Mitógenos/metabolismo , Coactivador 1-alfa del Receptor Activado por Proliferadores de Peroxisomas gamma/metabolismo , Placenta/metabolismo , Proteínas Quinasas p38 Activadas por Mitógenos/metabolismo , 11-beta-Hidroxiesteroide Deshidrogenasa de Tipo 2/genética , Índice de Masa Corporal , Niño , Preescolar , Femenino , Humanos , Técnicas In Vitro , Lactante , Proteína Quinasa 9 Activada por Mitógenos/genética , Coactivador 1-alfa del Receptor Activado por Proliferadores de Peroxisomas gamma/genética , Fosforilación , Embarazo , Proteínas Quinasas p38 Activadas por Mitógenos/genética
14.
Netw Syst Med ; 3(1): 159-181, 2020 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-33987620

RESUMEN

Background: Small studies have recently suggested that there are specific plasma metabolic signatures in chronic obstructive pulmonary disease (COPD), but there have been no large comprehensive study of metabolomic signatures in COPD that also integrate genetic variants. Materials and Methods: Fresh frozen plasma from 957 non-Hispanic white subjects in COPDGene was used to quantify 995 metabolites with Metabolon's global metabolomics platform. Metabolite associations with five COPD phenotypes (chronic bronchitis, exacerbation frequency, percent emphysema, post-bronchodilator forced expiratory volume at one second [FEV1]/forced vital capacity [FVC], and FEV1 percent predicted) were assessed. A metabolome-wide association study was performed to find genetic associations with metabolite levels. Significantly associated single-nucleotide polymorphisms were tested for replication with independent metabolomic platforms and independent cohorts. COPD phenotype-driven modules were identified in network analysis integrated with genetic associations to assess gene-metabolite-phenotype interactions. Results: Of metabolites tested, 147 (14.8%) were significantly associated with at least 1 COPD phenotype. Associations with airflow obstruction were enriched for diacylglycerols and branched chain amino acids. Genetic associations were observed with 109 (11%) metabolites, 72 (66%) of which replicated in an independent cohort. For 20 metabolites, more than 20% of variance was explained by genetics. A sparse network of COPD phenotype-driven modules was identified, often containing metabolites missed in previous testing. Of the 26 COPD phenotype-driven modules, 6 contained metabolites with significant met-QTLs, although little module variance was explained by genetics. Conclusion: A dysregulation of systemic metabolism was predominantly found in COPD phenotypes characterized by airflow obstruction, where we identified robust heritable effects on individual metabolite abundances. However, network analysis, which increased the statistical power to detect associations missed previously in classic regression analyses, revealed that the genetic influence on COPD phenotype-driven metabolomic modules was modest when compared with clinical and environmental factors.

15.
Hypertension ; 74(2): 375-383, 2019 08.
Artículo en Inglés | MEDLINE | ID: mdl-31230546

RESUMEN

Hypertensive disorders of pregnancy (HDP) are associated with low birth weight, shorter gestational age, and increased risk of maternal and offspring cardiovascular diseases later in life. The mechanisms involved are poorly understood, but epigenetic regulation of gene expression may play a part. We performed meta-analyses in the Pregnancy and Childhood Epigenetics Consortium to test the association between either maternal HDP (10 cohorts; n=5242 [cases=476]) or preeclampsia (3 cohorts; n=2219 [cases=135]) and epigenome-wide DNA methylation in cord blood using the Illumina HumanMethylation450 BeadChip. In models adjusted for confounders, and with Bonferroni correction, HDP and preeclampsia were associated with DNA methylation at 43 and 26 CpG sites, respectively. HDP was associated with higher methylation at 27 (63%) of the 43 sites, and across all 43 sites, the mean absolute difference in methylation was between 0.6% and 2.6%. Epigenome-wide associations of HDP with offspring DNA methylation were modestly consistent with the equivalent epigenome-wide associations of preeclampsia with offspring DNA methylation (R2=0.26). In longitudinal analyses conducted in 1 study (n=108 HDP cases; 550 controls), there were similar changes in DNA methylation in offspring of those with and without HDP up to adolescence. Pathway analysis suggested that genes located at/near HDP-associated sites may be involved in developmental, embryogenesis, or neurological pathways. HDP is associated with offspring DNA methylation with potential relevance to development.


Asunto(s)
Metilación de ADN/genética , Proteínas de Unión al ADN/genética , Estudio de Asociación del Genoma Completo , Hipertensión Inducida en el Embarazo/genética , Recien Nacido Prematuro , Resultado del Embarazo , Adulto , Estudios de Cohortes , Epigénesis Genética , Femenino , Sangre Fetal , Edad Gestacional , Humanos , Hipertensión Inducida en el Embarazo/diagnóstico , Recién Nacido , Embarazo
16.
PLoS One ; 12(10): e0185682, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-29016655

RESUMEN

Chronic obstructive pulmonary disease (COPD) occurs typically in current or former smokers, but only a minority of people with smoking history develops the disease. Besides environmental factors, genetics is an important risk factor for COPD. However, the relationship between genetics, environment and phenotypes is not well understood. Sample sizes for genome-wide expression studies based on lung tissue have been small due to the invasive nature of sample collection. Increasing evidence for the systemic nature of the disease makes blood a good alternative source to study the disease, but there have also been few large-scale blood genomic studies in COPD. Due to the complexity and heterogeneity of COPD, examining groups of interacting genes may have more relevance than identifying individual genes. Therefore, we used Weighted Gene Co-expression Network Analysis to find groups of genes (modules) that are highly connected. However, module definitions may vary between individual data sets. To alleviate this problem, we used a consensus module definition based on two cohorts, COPDGene and ECLIPSE. We studied the relationship between the consensus modules and COPD phenotypes airflow obstruction and emphysema. We also used these consensus module definitions on an independent cohort (TESRA) and performed a meta analysis involving all data sets. We found several modules that are associated with COPD phenotypes, are enriched in functional categories and are overrepresented for cell-type specific genes. Of the 14 consensus modules, three were strongly associated with airflow obstruction (meta p ≤ 0.0002), and two had some association with emphysema (meta p ≤ 0.06); some associations were stronger in the case-control cohorts, and others in the cases-only subcohorts. Gene Ontology terms that were overrepresented included "immune response" and "defense response." The cell types whose type-specific genes were overrepresented in modules (p < 0.05) included natural killer cells, dendritic cells, and neutrophils. Together, this is the largest investigation of gene blood expression in COPD with 469 cases in COPDGene, ECLIPSE and TESRA combined, with 6267 genes common to all data sets. Additional, we have 42 and 83 controls in COPDGene and ECLIPSE, respectively.


Asunto(s)
Interacción Gen-Ambiente , Genómica , Enfermedad Pulmonar Obstructiva Crónica/sangre , Enfisema Pulmonar/sangre , Epistasis Genética , Expresión Génica/genética , Ontología de Genes , Humanos , Monocitos/metabolismo , Monocitos/patología , Neutrófilos/metabolismo , Neutrófilos/patología , Enfermedad Pulmonar Obstructiva Crónica/genética , Enfermedad Pulmonar Obstructiva Crónica/fisiopatología , Enfisema Pulmonar/genética , Factores de Riesgo , Espirometría
17.
Sci Rep ; 7(1): 3633, 2017 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-28623356

RESUMEN

Human papillomavirus (HPV) infection distinctly alters methylation patterns in HPV-associated cancer. We have recently reported that HPV E7-dependent promoter hypermethylation leads to downregulation of the chemokine CXCL14 and suppression of antitumor immune responses. To investigate the extent of gene expression dysregulated by HPV E7-induced DNA methylation, we analyzed parallel global gene expression and DNA methylation using normal immortalized keratinocyte lines, NIKS, NIKS-16, NIKS-18, and NIKS-16∆E7. We show that expression of the MHC class I genes is downregulated in HPV-positive keratinocytes in an E7-dependent manner. Methylome analysis revealed hypermethylation at a distal CpG island (CGI) near the HLA-E gene in NIKS-16 cells compared to either NIKS cells or NIKS-16∆E7 cells, which lack E7 expression. The HLA-E CGI functions as an active promoter element which is dramatically repressed by DNA methylation. HLA-E protein expression on cell surface is downregulated by high-risk HPV16 and HPV18 E7 expression, but not by low-risk HPV6 and HPV11 E7 expression. Conversely, demethylation at the HLA-E CGI restores HLA-E protein expression in HPV-positive keratinocytes. Because HLA-E plays an important role in antiviral immunity by regulating natural killer and CD8+ T cells, epigenetic downregulation of HLA-E by high-risk HPV E7 may contribute to virus-induced immune evasion during HPV persistence.


Asunto(s)
Metilación de ADN , Regulación de la Expresión Génica , Antígenos de Histocompatibilidad Clase I/genética , Queratinocitos/metabolismo , Proteínas E7 de Papillomavirus/metabolismo , Transcriptoma , Presentación de Antígeno/genética , Transformación Celular Viral , Islas de CpG , Epigénesis Genética , Perfilación de la Expresión Génica , Humanos , Queratinocitos/virología , Modelos Biológicos , Regiones Promotoras Genéticas , Antígenos HLA-E
18.
Mamm Genome ; 27(9-10): 469-84, 2016 10.
Artículo en Inglés | MEDLINE | ID: mdl-27401171

RESUMEN

Gene co-expression analysis has proven to be a powerful tool for ascertaining the organization of gene products into networks that are important for organ function. An organ, such as the liver, engages in a multitude of functions important for the survival of humans, rats, and other animals; these liver functions include energy metabolism, metabolism of xenobiotics, immune system function, and hormonal homeostasis. With the availability of organ-specific transcriptomes, we can now examine the role of RNA transcripts (both protein-coding and non-coding) in these functions. A systems genetic approach for identifying and characterizing liver gene networks within a recombinant inbred panel of rats was used to identify genetically regulated transcriptional networks (modules). For these modules, biological consensus was found between functional enrichment analysis and publicly available phenotypic quantitative trait loci (QTL). In particular, the biological function of two liver modules could be linked to immune response. The eigengene QTLs for these co-expression modules were located at genomic regions coincident with highly significant phenotypic QTLs; these phenotypes were related to rheumatoid arthritis, food preference, and basal corticosterone levels in rats. Our analysis illustrates that genetically and biologically driven RNA-based networks, such as the ones identified as part of this research, provide insight into the genetic influences on organ functions. These networks can pinpoint phenotypes that manifest through the interaction of many organs/tissues and can identify unannotated or under-annotated RNA transcripts that play a role in these phenotypes.


Asunto(s)
Hígado/metabolismo , ARN/metabolismo , Animales , Femenino , Ontología de Genes , Sistema Inmunológico/metabolismo , Desequilibrio de Ligamiento , Hígado/inmunología , Escala de Lod , Masculino , Sitios de Carácter Cuantitativo , ARN/genética , Ratas Endogámicas SHR , Análisis de Secuencia de ARN , Transcriptoma
19.
J Invest Dermatol ; 135(8): 2068-2076, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-25822579

RESUMEN

In vitiligo, the autoimmune destruction of epidermal melanocytes produces white spots that can be repigmented by melanocyte precursors from the hair follicles, following stimulation with UV light. We examined by immunofluorescence the distribution of melanocyte markers (C-KIT, DCT, PAX3, and TYR) coupled with markers of proliferation (KI-67) and migration (MCAM) in precursors and mature melanocytes from the hair follicle and the epidermis of untreated and narrow band UVB (NBUVB)-treated human vitiligo skin. NBUVB was associated with a significant increase in the number of melanocytes in the infundibulum and with restoration of the normal melanocyte population in the epidermis, which was lacking in the untreated vitiligo. We identified several precursor populations (melanocyte stem cells, melanoblasts, and other immature phenotypes), and progressively differentiating melanocytes, some with putative migratory and/or proliferative abilities. The primary melanocyte germ was present in the untreated and treated hair follicle bulge, whereas a possible secondary melanocyte germ composed of C-KIT+ melanocytes was found in the infundibulum and interfollicular epidermis of UV-treated vitiligo. This is an exceptional model for studying the mobilization of melanocyte stem cells in human skin. Improved understanding of this process is essential for designing better treatments for vitiligo, ultimately based on melanocyte stem cell activation and mobilization.


Asunto(s)
Melanocitos/patología , Células Madre/patología , Rayos Ultravioleta , Terapia Ultravioleta , Vitíligo/patología , Vitíligo/radioterapia , Diferenciación Celular/efectos de la radiación , Movimiento Celular/efectos de la radiación , Proliferación Celular/efectos de la radiación , Epidermis/metabolismo , Epidermis/patología , Epidermis/efectos de la radiación , Folículo Piloso/metabolismo , Folículo Piloso/patología , Folículo Piloso/efectos de la radiación , Humanos , Oxidorreductasas Intramoleculares/metabolismo , Melanocitos/metabolismo , Melanocitos/efectos de la radiación , Factor de Transcripción PAX3 , Factores de Transcripción Paired Box/metabolismo , Fenotipo , Proteínas Proto-Oncogénicas c-kit/metabolismo , Células Madre/metabolismo , Células Madre/efectos de la radiación , Vitíligo/metabolismo
20.
Nucleic Acids Res ; 42(17): e133, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25063298

RESUMEN

microRNAs (miRNAs) regulate expression by promoting degradation or repressing translation of target transcripts. miRNA target sites have been catalogued in databases based on experimental validation and computational prediction using various algorithms. Several online resources provide collections of multiple databases but need to be imported into other software, such as R, for processing, tabulation, graphing and computation. Currently available miRNA target site packages in R are limited in the number of databases, types of databases and flexibility. We present multiMiR, a new miRNA-target interaction R package and database, which includes several novel features not available in existing R packages: (i) compilation of nearly 50 million records in human and mouse from 14 different databases, more than any other collection; (ii) expansion of databases to those based on disease annotation and drug microRNAresponse, in addition to many experimental and computational databases; and (iii) user-defined cutoffs for predicted binding strength to provide the most confident selection. Case studies are reported on various biomedical applications including mouse models of alcohol consumption, studies of chronic obstructive pulmonary disease in human subjects, and human cell line models of bladder cancer metastasis. We also demonstrate how multiMiR was used to generate testable hypotheses that were pursued experimentally.


Asunto(s)
Regiones no Traducidas 3' , Bases de Datos de Ácidos Nucleicos , MicroARNs/metabolismo , Programas Informáticos , Consumo de Bebidas Alcohólicas/genética , Animales , Antineoplásicos/farmacología , Línea Celular Tumoral , Humanos , Ratones , Metástasis de la Neoplasia , Enfermedad Pulmonar Obstructiva Crónica/genética , Neoplasias de la Vejiga Urinaria/genética , Neoplasias de la Vejiga Urinaria/patología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...