Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
1.
Sci Data ; 11(1): 363, 2024 Apr 11.
Article in English | MEDLINE | ID: mdl-38605048

ABSTRACT

Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoint resources and abstraction algorithms), and benchmarks (e.g., prebuilt KGs). We evaluated the ecosystem by systematically comparing it to existing open-source KG construction methods and by analyzing its computational performance when used to construct 12 different large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.


Subject(s)
Biological Science Disciplines , Knowledge Bases , Pattern Recognition, Automated , Algorithms , Translational Research, Biomedical
2.
Front Bioinform ; 2: 1054578, 2022.
Article in English | MEDLINE | ID: mdl-36568701

ABSTRACT

Molecular "cartoons," such as pathway diagrams, provide a visual summary of biomedical research results and hypotheses. Their ubiquitous appearance within the literature indicates their universal application in mechanistic communication. A recent survey of pathway diagrams identified 64,643 pathway figures published between 1995 and 2019 with 1,112,551 mentions of 13,464 unique human genes participating in a wide variety of biological processes. Researchers generally create these diagrams using generic diagram editing software that does not itself embody any biomedical knowledge. Biomedical knowledge graphs (KGs) integrate and represent knowledge in a semantically consistent way, systematically capturing biomedical knowledge similar to that in molecular cartoons. KGs have the potential to provide context and precise details useful in drawing such figures. However, KGs cannot generally be translated directly into figures. They include substantial material irrelevant to the scientific point of a given figure and are often more detailed than is appropriate. How could KGs be used to facilitate the creation of molecular diagrams? Here we present a new approach towards cartoon image creation that utilizes the semantic structure of knowledge graphs to aid the production of molecular diagrams. We introduce a set of "semantic graphical actions" that select and transform the relational information between heterogeneous entities (e.g., genes, proteins, pathways, diseases) in a KG to produce diagram schematics that meet the scientific communication needs of the user. These semantic actions search, select, filter, transform, group, arrange, connect and extract relevant subgraphs from KGs based on meaning in biological terms, e.g., a protein upstream of a target in a pathway. To demonstrate the utility of this approach, we show how semantic graphical actions on KGs could have been used to produce three existing pathway diagrams in diverse biomedical domains: Down Syndrome, COVID-19, and neuroinflammation. Our focus is on recapitulating the semantic content of the figures, not the layout, glyphs, or other aesthetic aspects. Our results suggest that the use of KGs and semantic graphical actions to produce biomedical diagrams will reduce the effort required and improve the quality of this visual form of scientific communication.

3.
PLoS One ; 16(8): e0255337, 2021.
Article in English | MEDLINE | ID: mdl-34432807

ABSTRACT

Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are molecularly defined clusters with distinct clinical phenotypes that may underlie the clinical heterogeneity. Subjects included 3,278 subjects from the COPDGene cohort with at least one of the following profiles: whole blood transcriptomes (2,650 subjects); plasma proteomes (1,013 subjects); and plasma metabolomes (1,136 subjects). 489 subjects had all three contemporaneous -omics profiles. Autoencoder embeddings were performed individually for each -omics dataset. Embeddings underwent subspace clustering using MineClus, either individually by -omics or combined, followed by recursive feature selection based on Support Vector Machines. Clusters were tested for associations with clinical variables. Optimal single -omics clustering typically resulted in two clusters. Although there was overlap for individual -omics cluster membership, each -omics cluster tended to be defined by unique molecular pathways. For example, prominent molecular features of the metabolome-based clustering included sphingomyelin, while key molecular features of the transcriptome-based clusters were related to immune and bacterial responses. We also found that when we integrated the -omics data at a later stage, we identified subtypes that varied based on age, severity of disease, in addition to diffusing capacity of the lungs for carbon monoxide, and precent on atrial fibrillation. In contrast, when we integrated the -omics data at an earlier stage by treating all data sets equally, there were no clinical differences between subtypes. Similar to clinical clustering, which has revealed multiple heterogenous clinical phenotypes, we show that transcriptomics, proteomics, and metabolomics tend to define clusters of COPD patients with different clinical characteristics. Thus, integrating these different -omics data sets affords additional insight into the molecular nature of COPD and its heterogeneity.


Subject(s)
Gene Expression Profiling/methods , Metabolomics/methods , Proteomics/methods , Pulmonary Disease, Chronic Obstructive/classification , Age Factors , Aged , Cluster Analysis , Databases, Factual , Female , Gene Expression Regulation , Gene Regulatory Networks , Humans , Male , Middle Aged , Phenotype , Pulmonary Disease, Chronic Obstructive/blood , Pulmonary Disease, Chronic Obstructive/genetics , Support Vector Machine
4.
Respir Res ; 22(1): 127, 2021 Apr 27.
Article in English | MEDLINE | ID: mdl-33906653

ABSTRACT

BACKGROUND: Soluble receptor for advanced glycation end products (sRAGE) is a proposed emphysema and airflow obstruction biomarker; however, previous publications have shown inconsistent associations and only one study has investigate the association between sRAGE and emphysema. No cohorts have examined the association between sRAGE and progressive decline of lung function. There have also been no evaluation of assay compatibility, receiver operating characteristics, and little examination of the effect of genetic variability in non-white population. This manuscript addresses these deficiencies and introduces novel data from Pittsburgh COPD SCCOR and as well as novel work on airflow obstruction. A meta-analysis is used to quantify sRAGE associations with clinical phenotypes. METHODS: sRAGE was measured in four independent longitudinal cohorts on different analytic assays: COPDGene (n = 1443); SPIROMICS (n = 1623); ECLIPSE (n = 2349); Pittsburgh COPD SCCOR (n = 399). We constructed adjusted linear mixed models to determine associations of sRAGE with baseline and follow up forced expiratory volume at one second (FEV1) and emphysema by quantitative high-resolution CT lung density at the 15th percentile (adjusted for total lung capacity). RESULTS: Lower plasma or serum sRAGE values were associated with a COPD diagnosis (P < 0.001), reduced FEV1 (P < 0.001), and emphysema severity (P < 0.001). In an inverse-variance weighted meta-analysis, one SD lower log10-transformed sRAGE was associated with 105 ± 22 mL lower FEV1 and 4.14 ± 0.55 g/L lower adjusted lung density. After adjusting for covariates, lower sRAGE at baseline was associated with greater FEV1 decline and emphysema progression only in the ECLIPSE cohort. Non-Hispanic white subjects carrying the rs2070600 minor allele (A) and non-Hispanic African Americans carrying the rs2071288 minor allele (A) had lower sRAGE measurements compare to those with the major allele, but their emphysema-sRAGE regression slopes were similar. CONCLUSIONS: Lower blood sRAGE is associated with more severe airflow obstruction and emphysema, but associations with progression are inconsistent in the cohorts analyzed. In these cohorts, genotype influenced sRAGE measurements and strengthened variance modelling. Thus, genotype should be included in sRAGE evaluations.


Subject(s)
Lung/physiopathology , Pulmonary Disease, Chronic Obstructive/blood , Pulmonary Emphysema/blood , Receptor for Advanced Glycation End Products/blood , Aged , Biomarkers/blood , Female , Forced Expiratory Volume , Humans , Longitudinal Studies , Lung/diagnostic imaging , Male , Middle Aged , Phenotype , Pulmonary Disease, Chronic Obstructive/diagnosis , Pulmonary Disease, Chronic Obstructive/physiopathology , Pulmonary Emphysema/diagnosis , Pulmonary Emphysema/physiopathology , Severity of Illness Index , Spirometry , Tomography, X-Ray Computed , Vital Capacity
5.
Metabolites ; 11(3)2021 Mar 11.
Article in English | MEDLINE | ID: mdl-33799786

ABSTRACT

Susceptibility and progression of lung disease, as well as response to treatment, often differ by sex, yet the metabolic mechanisms driving these sex-specific differences are still poorly understood. Women with chronic obstructive pulmonary disease (COPD) have less emphysema and more small airway disease on average than men, though these differences become less pronounced with more severe airflow limitation. While small studies of targeted metabolites have identified compounds differing by sex and COPD status, the sex-specific effect of COPD on systemic metabolism has yet to be interrogated. Significant sex differences were observed in 9 of the 11 modules identified in COPDGene. Sex-specific associations by COPD status and emphysema were observed in 3 modules for each phenotype. Sex stratified individual metabolite associations with COPD demonstrated male-specific associations in sphingomyelins and female-specific associations in acyl carnitines and phosphatidylethanolamines. There was high preservation of module assignments in SPIROMICS (SubPopulations and InteRmediate Outcome Measures In COPD Study) and similar female-specific shift in acyl carnitines. Several COPD associated metabolites differed by sex. Acyl carnitines and sphingomyelins demonstrate sex-specific abundances and may represent important metabolic signatures of sex differences in COPD. Accurately characterizing the sex-specific molecular differences in COPD is vital for personalized diagnostics and therapeutics.

6.
JCI Insight ; 5(14)2020 07 23.
Article in English | MEDLINE | ID: mdl-32573489

ABSTRACT

Cigarette smoking (CS) and genetic susceptibility determine the risk for development, progression, and severity of chronic obstructive pulmonary diseases (COPD). We posited that an incidental balanced reciprocal chromosomal translocation was linked to a patient's risk of severe COPD. We determined that 46,XX,t(1;4)(p13.1;q34.3) caused a breakpoint in the immunoglobulin superfamily member 3 (IGSF3) gene, with markedly decreased expression. Examination of COPDGene cohort identified 14 IGSF3 SNPs, of which rs1414272 and rs12066192 were directly and rs6703791 inversely associated with COPD severity, including COPD exacerbations. We confirmed that IGSF3 is a tetraspanin-interacting protein that colocalized with CD9 and integrin B1 in tetraspanin-enriched domains. IGSF3-deficient patient-derived lymphoblastoids exhibited multiple alterations in gene expression, especially in the unfolded protein response and ceramide pathways. IGSF3-deficient lymphoblastoids had high ceramide and sphingosine-1 phosphate but low glycosphingolipids and ganglioside levels, and they were less apoptotic and more adherent, with marked changes in multiple TNFRSF molecules. Similarly, IGSF3 knockdown increased ceramide in lung structural cells, rendering them more adherent, with impaired wound repair and weakened barrier function. These findings suggest that, by maintaining sphingolipid and membrane receptor homeostasis, IGSF3 is required for cell mobility-mediated lung injury repair. IGSF3 deficiency may increase susceptibility to CS-induced lung injury in COPD.


Subject(s)
Cigarette Smoking/genetics , Genetic Predisposition to Disease , Immunoglobulins/genetics , Membrane Proteins/genetics , Pulmonary Disease, Chronic Obstructive/genetics , Translocation, Genetic/genetics , Apoptosis/genetics , Cell Adhesion/genetics , Cell Movement/genetics , Chromosomes, Human, Pair 1/genetics , Chromosomes, Human, Pair 4/genetics , Cigarette Smoking/adverse effects , Female , Gene Expression Regulation/genetics , Humans , Integrin beta1/genetics , Male , Middle Aged , Mutation/genetics , Polymorphism, Single Nucleotide/genetics , Pulmonary Disease, Chronic Obstructive/chemically induced , Pulmonary Disease, Chronic Obstructive/pathology , Severity of Illness Index , Tetraspanin 29/genetics
7.
Proteomics ; 20(12): e1900278, 2020 06.
Article in English | MEDLINE | ID: mdl-32386347

ABSTRACT

Novel proteomics platforms, such as the aptamer-based SOMAscan platform, can quantify large numbers of proteins efficiently and cost-effectively and are rapidly growing in popularity. However, comparisons to conventional immunoassays remain underexplored, leaving investigators unsure when cross-assay comparisons are appropriate. The correlation of results from immunoassays with relative protein quantification is explored by SOMAscan. For 63 proteins assessed in two chronic obstructive pulmonary disease (COPD) cohorts, subpopulations and intermediate outcome measures in COPD Study (SPIROMICS), and COPDGene, using myriad rules based medicine multiplex immunoassays and SOMAscan, Spearman correlation coefficients range from -0.13 to 0.97, with a median correlation coefficient of ≈0.5 and consistent results across cohorts. A similar range is observed for immunoassays in the population-based Multi-Ethnic Study of Atherosclerosis and for other assays in COPDGene and SPIROMICS. Comparisons of relative quantification from the antibody-based Olink platform and SOMAscan in a small cohort of myocardial infarction patients also show a wide correlation range. Finally, cis pQTL data, mass spectrometry aptamer confirmation, and other publicly available data are integrated to assess relationships with observed correlations. Correlation between proteomics assays shows a wide range and should be carefully considered when comparing and meta-analyzing proteomics data across assays and studies.


Subject(s)
Myocardial Infarction/metabolism , Proteome/metabolism , Proteomics/methods , Pulmonary Disease, Chronic Obstructive/metabolism , Smokers/statistics & numerical data , Adult , Aged , Aged, 80 and over , Cohort Studies , Female , Humans , Immunoassay/methods , Male , Middle Aged , Myocardial Infarction/blood , Pulmonary Disease, Chronic Obstructive/blood
8.
Netw Syst Med ; 3(1): 159-181, 2020 Dec 01.
Article in English | MEDLINE | ID: mdl-33987620

ABSTRACT

Background: Small studies have recently suggested that there are specific plasma metabolic signatures in chronic obstructive pulmonary disease (COPD), but there have been no large comprehensive study of metabolomic signatures in COPD that also integrate genetic variants. Materials and Methods: Fresh frozen plasma from 957 non-Hispanic white subjects in COPDGene was used to quantify 995 metabolites with Metabolon's global metabolomics platform. Metabolite associations with five COPD phenotypes (chronic bronchitis, exacerbation frequency, percent emphysema, post-bronchodilator forced expiratory volume at one second [FEV1]/forced vital capacity [FVC], and FEV1 percent predicted) were assessed. A metabolome-wide association study was performed to find genetic associations with metabolite levels. Significantly associated single-nucleotide polymorphisms were tested for replication with independent metabolomic platforms and independent cohorts. COPD phenotype-driven modules were identified in network analysis integrated with genetic associations to assess gene-metabolite-phenotype interactions. Results: Of metabolites tested, 147 (14.8%) were significantly associated with at least 1 COPD phenotype. Associations with airflow obstruction were enriched for diacylglycerols and branched chain amino acids. Genetic associations were observed with 109 (11%) metabolites, 72 (66%) of which replicated in an independent cohort. For 20 metabolites, more than 20% of variance was explained by genetics. A sparse network of COPD phenotype-driven modules was identified, often containing metabolites missed in previous testing. Of the 26 COPD phenotype-driven modules, 6 contained metabolites with significant met-QTLs, although little module variance was explained by genetics. Conclusion: A dysregulation of systemic metabolism was predominantly found in COPD phenotypes characterized by airflow obstruction, where we identified robust heritable effects on individual metabolite abundances. However, network analysis, which increased the statistical power to detect associations missed previously in classic regression analyses, revealed that the genetic influence on COPD phenotype-driven metabolomic modules was modest when compared with clinical and environmental factors.

SELECTION OF CITATIONS
SEARCH DETAIL
...