Discovering and Summarizing Relationships Between Chemicals, Genes, Proteins, and Diseases in PubChem.
Front Res Metr Anal
; 6: 689059, 2021.
Article
in En
| MEDLINE
| ID: mdl-34322655
ABSTRACT
The literature knowledge panels developed and implemented in PubChem are described. These help to uncover and summarize important relationships between chemicals, genes, proteins, and diseases by analyzing co-occurrences of terms in biomedical literature abstracts. Named entities in PubMed records are matched with chemical names in PubChem, disease names in Medical Subject Headings (MeSH), and gene/protein names in popular gene/protein information resources, and the most closely related entities are identified using statistical analysis and relevance-based sampling. Knowledge panels for the co-occurrence of chemical, disease, and gene/protein entities are included in PubChem Compound, Protein, and Gene pages, summarizing these in a compact form. Statistical methods for removing redundancy and estimating relevance scores are discussed, along with benefits and pitfalls of relying on automated (i.e., not human-curated) methods operating on data from multiple heterogeneous sources.
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Type of study:
Prognostic_studies
Language:
En
Journal:
Front Res Metr Anal
Year:
2021
Type:
Article
Affiliation country:
United States