Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Brief Bioinform ; 23(4)2022 07 18.
Artículo en Inglés | MEDLINE | ID: mdl-35753697

RESUMEN

Recent technological advances have led to an exponential expansion of biological sequence data and extraction of meaningful information through Machine Learning (ML) algorithms. This knowledge has improved the understanding of mechanisms related to several fatal diseases, e.g. Cancer and coronavirus disease 2019, helping to develop innovative solutions, such as CRISPR-based gene editing, coronavirus vaccine and precision medicine. These advances benefit our society and economy, directly impacting people's lives in various areas, such as health care, drug discovery, forensic analysis and food processing. Nevertheless, ML-based approaches to biological data require representative, quantitative and informative features. Many ML algorithms can handle only numerical data, and therefore sequences need to be translated into a numerical feature vector. This process, known as feature extraction, is a fundamental step for developing high-quality ML-based models in bioinformatics, by allowing the feature engineering stage, with design and selection of suitable features. Feature engineering, ML algorithm selection and hyperparameter tuning are often manual and time-consuming processes, requiring extensive domain knowledge. To deal with this problem, we present a new package: BioAutoML. BioAutoML automatically runs an end-to-end ML pipeline, extracting numerical and informative features from biological sequence databases, using the MathFeature package, and automating the feature selection, ML algorithm(s) recommendation and tuning of the selected algorithm(s) hyperparameters, using Automated ML (AutoML). BioAutoML has two components, divided into four modules: (1) automated feature engineering (feature extraction and selection modules) and (2) Metalearning (algorithm recommendation and hyper-parameter tuning modules). We experimentally evaluate BioAutoML in two different scenarios: (i) prediction of the three main classes of noncoding RNAs (ncRNAs) and (ii) prediction of the eight categories of ncRNAs in bacteria, including housekeeping and regulatory types. To assess BioAutoML predictive performance, it is experimentally compared with two other AutoML tools (RECIPE and TPOT). According to the experimental results, BioAutoML can accelerate new studies, reducing the cost of feature engineering processing and either keeping or improving predictive performance. BioAutoML is freely available at https://github.com/Bonidia/BioAutoML.


Asunto(s)
Vacunas contra la COVID-19 , COVID-19 , Algoritmos , Bacterias/genética , Humanos , Aprendizaje Automático
2.
RNA Biol ; 21(1): 1-12, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38528797

RESUMEN

The accurate classification of non-coding RNA (ncRNA) sequences is pivotal for advanced non-coding genome annotation and analysis, a fundamental aspect of genomics that facilitates understanding of ncRNA functions and regulatory mechanisms in various biological processes. While traditional machine learning approaches have been employed for distinguishing ncRNA, these often necessitate extensive feature engineering. Recently, deep learning algorithms have provided advancements in ncRNA classification. This study presents BioDeepFuse, a hybrid deep learning framework integrating convolutional neural networks (CNN) or bidirectional long short-term memory (BiLSTM) networks with handcrafted features for enhanced accuracy. This framework employs a combination of k-mer one-hot, k-mer dictionary, and feature extraction techniques for input representation. Extracted features, when embedded into the deep network, enable optimal utilization of spatial and sequential nuances of ncRNA sequences. Using benchmark datasets and real-world RNA samples from bacterial organisms, we evaluated the performance of BioDeepFuse. Results exhibited high accuracy in ncRNA classification, underscoring the robustness of our tool in addressing complex ncRNA sequence data challenges. The effective melding of CNN or BiLSTM with external features heralds promising directions for future research, particularly in refining ncRNA classifiers and deepening insights into ncRNAs in cellular processes and disease manifestations. In addition to its original application in the context of bacterial organisms, the methodologies and techniques integrated into our framework can potentially render BioDeepFuse effective in various and broader domains.


Asunto(s)
Aprendizaje Profundo , ARN no Traducido/genética , Algoritmos , ARN , Redes Neurales de la Computación
3.
Entropy (Basel) ; 24(10)2022 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-37420418

RESUMEN

In recent years, there has been an exponential growth in sequencing projects due to accelerated technological advances, leading to a significant increase in the amount of data and resulting in new challenges for biological sequence analysis. Consequently, the use of techniques capable of analyzing large amounts of data has been explored, such as machine learning (ML) algorithms. ML algorithms are being used to analyze and classify biological sequences, despite the intrinsic difficulty in extracting and finding representative biological sequence methods suitable for them. Thereby, extracting numerical features to represent sequences makes it statistically feasible to use universal concepts from Information Theory, such as Tsallis and Shannon entropy. In this study, we propose a novel Tsallis entropy-based feature extractor to provide useful information to classify biological sequences. To assess its relevance, we prepared five case studies: (1) an analysis of the entropic index q; (2) performance testing of the best entropic indices on new datasets; (3) a comparison made with Shannon entropy and (4) generalized entropies; (5) an investigation of the Tsallis entropy in the context of dimensionality reduction. As a result, our proposal proved to be effective, being superior to Shannon entropy and robust in terms of generalization, and also potentially representative for collecting information in fewer dimensions compared with methods such as Singular Value Decomposition and Uniform Manifold Approximation and Projection.

4.
J Infect Dis ; 222(4): 670-680, 2020 07 23.
Artículo en Inglés | MEDLINE | ID: mdl-32311029

RESUMEN

BACKGROUND: Zika virus (ZIKV) is an emergent flavivirus initially considered a benign and self-limited exanthematic illness. In 2015, a new epidemic emerged in northeastern of Brazil with increased incidence of a previously rare clinical outcome, microcephaly, in newborns from mothers who were infected during pregnancy. Little is known about the immunopathogenesis of ZIKV-associated microcephaly. Understanding the inflammatory profile and degree of inflammation of persons affected with such condition is an important step towards development of innovative therapeutic strategies. METHODS: A case-control study compared plasma levels of several inflammatory biomarkers from newborns with ZIKV microcephaly, asymptomatic ZKV infection, or uninfected controls. Plasma biomarkers were assessed using Luminex. A series of multidimensional analysis was performed to characterize the systemic immune activation profile of the clinical groups. RESULTS: We identified an inflammatory signature associated with ZIKV microcephaly that suggested an increased inflammation. Network analysis suggested that ZIKV microcephaly is associated with imbalanced immune activation and inflammation. The cephalic perimeter was inversely proportional with the degree of inflammatory perturbation. Furthermore, a combination of plasma inflammatory biomarkers could discriminate ZIKV with microcephaly from those with ZIKV without microcephaly or uninfected neonates. CONCLUSIONS: An intense inflammatory imbalance that is proportional to the disease severity hallmarks ZIKV microcephaly.


Asunto(s)
Biomarcadores/sangre , Inflamación/complicaciones , Microcefalia/etiología , Infección por el Virus Zika/complicaciones , Brasil , Estudios de Casos y Controles , Femenino , Humanos , Recién Nacido , Masculino , Microcefalia/diagnóstico , Virus Zika/patogenicidad , Infección por el Virus Zika/sangre , Infección por el Virus Zika/virología
5.
Vet Parasitol Reg Stud Reports ; 53: 101060, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39025553

RESUMEN

Snakes of the genus Bothrops inhabit tropical forests in Central and South America and are important for the biomedical and pharmaceutical industries because of the chemical properties of their venom. They serve as either definitive or intermediate hosts for many parasitic helminths. The Marajó Island (Brazil) is the natural habitat of venomous snakes, Bothrops atrox and Bothrops marajoensis, which are often found around rural and peri-urban areas and are known to bite humans. Samples of helminths parasitizing the oral cavity, subcutaneous tissues, coelomic cavity, and intestine of four B. atrox from Marajó Island (Pará-Brazil) were collected. The specimens studied were taxonomically classified as trematodes of the species Stycholecitha serpentis, nematodes of the genera Eustrongylides and Camallanus and cystacanths of an acanthocephalan of the genus Centrorhynchus. The aims of the present study were: to record helminths found in B. atrox from the Marajó Island; to discuss their role as definitive, intermediate, or paratenic hosts; and to compile a list of helminths that have been recorded in snakes of the genus Bothrops of the Neotropical region.


Asunto(s)
Bothrops , Helmintiasis Animal , Animales , Bothrops/parasitología , Brasil/epidemiología , Helmintiasis Animal/parasitología , Helmintiasis Animal/epidemiología , Masculino , Helmintos/clasificación , Helmintos/aislamiento & purificación , Femenino , Bothrops atrox
SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda