Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
RNA Biol ; 21(1): 1-12, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38528797

RESUMO

The accurate classification of non-coding RNA (ncRNA) sequences is pivotal for advanced non-coding genome annotation and analysis, a fundamental aspect of genomics that facilitates understanding of ncRNA functions and regulatory mechanisms in various biological processes. While traditional machine learning approaches have been employed for distinguishing ncRNA, these often necessitate extensive feature engineering. Recently, deep learning algorithms have provided advancements in ncRNA classification. This study presents BioDeepFuse, a hybrid deep learning framework integrating convolutional neural networks (CNN) or bidirectional long short-term memory (BiLSTM) networks with handcrafted features for enhanced accuracy. This framework employs a combination of k-mer one-hot, k-mer dictionary, and feature extraction techniques for input representation. Extracted features, when embedded into the deep network, enable optimal utilization of spatial and sequential nuances of ncRNA sequences. Using benchmark datasets and real-world RNA samples from bacterial organisms, we evaluated the performance of BioDeepFuse. Results exhibited high accuracy in ncRNA classification, underscoring the robustness of our tool in addressing complex ncRNA sequence data challenges. The effective melding of CNN or BiLSTM with external features heralds promising directions for future research, particularly in refining ncRNA classifiers and deepening insights into ncRNAs in cellular processes and disease manifestations. In addition to its original application in the context of bacterial organisms, the methodologies and techniques integrated into our framework can potentially render BioDeepFuse effective in various and broader domains.


Assuntos
Aprendizado Profundo , RNA não Traduzido/genética , Algoritmos , RNA , Redes Neurais de Computação
2.
Brief Bioinform ; 23(4)2022 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-35753697

RESUMO

Recent technological advances have led to an exponential expansion of biological sequence data and extraction of meaningful information through Machine Learning (ML) algorithms. This knowledge has improved the understanding of mechanisms related to several fatal diseases, e.g. Cancer and coronavirus disease 2019, helping to develop innovative solutions, such as CRISPR-based gene editing, coronavirus vaccine and precision medicine. These advances benefit our society and economy, directly impacting people's lives in various areas, such as health care, drug discovery, forensic analysis and food processing. Nevertheless, ML-based approaches to biological data require representative, quantitative and informative features. Many ML algorithms can handle only numerical data, and therefore sequences need to be translated into a numerical feature vector. This process, known as feature extraction, is a fundamental step for developing high-quality ML-based models in bioinformatics, by allowing the feature engineering stage, with design and selection of suitable features. Feature engineering, ML algorithm selection and hyperparameter tuning are often manual and time-consuming processes, requiring extensive domain knowledge. To deal with this problem, we present a new package: BioAutoML. BioAutoML automatically runs an end-to-end ML pipeline, extracting numerical and informative features from biological sequence databases, using the MathFeature package, and automating the feature selection, ML algorithm(s) recommendation and tuning of the selected algorithm(s) hyperparameters, using Automated ML (AutoML). BioAutoML has two components, divided into four modules: (1) automated feature engineering (feature extraction and selection modules) and (2) Metalearning (algorithm recommendation and hyper-parameter tuning modules). We experimentally evaluate BioAutoML in two different scenarios: (i) prediction of the three main classes of noncoding RNAs (ncRNAs) and (ii) prediction of the eight categories of ncRNAs in bacteria, including housekeeping and regulatory types. To assess BioAutoML predictive performance, it is experimentally compared with two other AutoML tools (RECIPE and TPOT). According to the experimental results, BioAutoML can accelerate new studies, reducing the cost of feature engineering processing and either keeping or improving predictive performance. BioAutoML is freely available at https://github.com/Bonidia/BioAutoML.


Assuntos
Vacinas contra COVID-19 , COVID-19 , Algoritmos , Bactérias/genética , Humanos , Aprendizado de Máquina
3.
Entropy (Basel) ; 24(10)2022 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-37420418

RESUMO

In recent years, there has been an exponential growth in sequencing projects due to accelerated technological advances, leading to a significant increase in the amount of data and resulting in new challenges for biological sequence analysis. Consequently, the use of techniques capable of analyzing large amounts of data has been explored, such as machine learning (ML) algorithms. ML algorithms are being used to analyze and classify biological sequences, despite the intrinsic difficulty in extracting and finding representative biological sequence methods suitable for them. Thereby, extracting numerical features to represent sequences makes it statistically feasible to use universal concepts from Information Theory, such as Tsallis and Shannon entropy. In this study, we propose a novel Tsallis entropy-based feature extractor to provide useful information to classify biological sequences. To assess its relevance, we prepared five case studies: (1) an analysis of the entropic index q; (2) performance testing of the best entropic indices on new datasets; (3) a comparison made with Shannon entropy and (4) generalized entropies; (5) an investigation of the Tsallis entropy in the context of dimensionality reduction. As a result, our proposal proved to be effective, being superior to Shannon entropy and robust in terms of generalization, and also potentially representative for collecting information in fewer dimensions compared with methods such as Singular Value Decomposition and Uniform Manifold Approximation and Projection.

4.
J Infect Dis ; 222(4): 670-680, 2020 07 23.
Artigo em Inglês | MEDLINE | ID: mdl-32311029

RESUMO

BACKGROUND: Zika virus (ZIKV) is an emergent flavivirus initially considered a benign and self-limited exanthematic illness. In 2015, a new epidemic emerged in northeastern of Brazil with increased incidence of a previously rare clinical outcome, microcephaly, in newborns from mothers who were infected during pregnancy. Little is known about the immunopathogenesis of ZIKV-associated microcephaly. Understanding the inflammatory profile and degree of inflammation of persons affected with such condition is an important step towards development of innovative therapeutic strategies. METHODS: A case-control study compared plasma levels of several inflammatory biomarkers from newborns with ZIKV microcephaly, asymptomatic ZKV infection, or uninfected controls. Plasma biomarkers were assessed using Luminex. A series of multidimensional analysis was performed to characterize the systemic immune activation profile of the clinical groups. RESULTS: We identified an inflammatory signature associated with ZIKV microcephaly that suggested an increased inflammation. Network analysis suggested that ZIKV microcephaly is associated with imbalanced immune activation and inflammation. The cephalic perimeter was inversely proportional with the degree of inflammatory perturbation. Furthermore, a combination of plasma inflammatory biomarkers could discriminate ZIKV with microcephaly from those with ZIKV without microcephaly or uninfected neonates. CONCLUSIONS: An intense inflammatory imbalance that is proportional to the disease severity hallmarks ZIKV microcephaly.


Assuntos
Biomarcadores/sangue , Inflamação/complicações , Microcefalia/etiologia , Infecção por Zika virus/complicações , Brasil , Estudos de Casos e Controles , Feminino , Humanos , Recém-Nascido , Masculino , Microcefalia/diagnóstico , Zika virus/patogenicidade , Infecção por Zika virus/sangue , Infecção por Zika virus/virologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...