Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Front Artif Intell ; 7: 1366273, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38525301

RESUMO

High-throughput sequencing has created an exponential increase in the amount of gene expression data, much of which is freely, publicly available in repositories such as NCBI's Gene Expression Omnibus (GEO). Querying this data for patterns such as similarity and distance, however, becomes increasingly challenging as the total amount of data increases. Furthermore, vectorization of the data is commonly required in Artificial Intelligence and Machine Learning (AI/ML) approaches. We present BioVDB, a vector database for storage and analysis of gene expression data, which enhances the potential for integrating biological studies with AI/ML tools. We used a previously developed approach called Automatic Label Extraction (ALE) to extract sample labels from metadata, including age, sex, and tissue/cell-line. BioVDB stores 438,562 samples from eight microarray GEO platforms. We show that it allows for efficient querying of data using similarity search, which can also be useful for identifying and inferring missing labels of samples, and for rapid similarity analysis.

2.
Biophys Chem ; 303: 107107, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-37862761

RESUMO

The self-assembly of proteins is encoded in the underlying potential energy surface (PES), from which we can predict structure, dynamics, and thermodynamic properties. However, the corresponding analysis becomes increasingly challenging with larger protein sizes, due to the computational time required, which grows significantly with the number of atoms. Coarse-grained models offer an attractive approach to reduce the computational cost. In this Feature Article, we describe our implementation of the UNited RESidue (UNRES) coarse-grained potential in the Cambridge energy landscapes software. We have applied this framework to explore the energy landscapes of four proteins that exhibit native states involving different secondary structures. Here we have tested the ability of the UNRES potential to represent the global energy landscape of proteins containing up to 100 amino acid residues. The resulting potential energy landscapes exhibit good agreement with experiment, with low-lying minima close to the PDB geometries and to results obtained using the all-atom AMBER force field. The new program interfaces will allow us to investigate larger biomolecules in future work, using the UNRES potential in combination with all the methodology available in the computational energy landscapes framework.


Assuntos
Proteínas , Software , Conformação Proteica , Proteínas/química , Estrutura Secundária de Proteína , Termodinâmica , Simulação de Dinâmica Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA