Búsqueda | BVS Bolivia

multiMotif: a generalized tool for scanning and visualization of diverse and distant multiple motifs.

Luo, Sainan; Xiao, Binghan; Geng, Jianing; Hu, Songnian.

J Genet Genomics ; 2024 Jul 09.

Artículo en Inglés | MEDLINE | ID: mdl-38992773

Dynamic analysis of SARS-CoV-2 evolution based on different countries.

Xiao, Binghan; Wu, Linhuan; Sun, Qinglan; Shu, Chang; Hu, Songnian.

Gene ; 916: 148426, 2024 Jul 20.

Artículo en Inglés | MEDLINE | ID: mdl-38575101

RESUMEN

Since late 2019, COVID-19 has significantly impacted the world. Understanding the evolution of SARS-CoV-2 is crucial for protecting against future infectious pathogens. In this study, we conducted a comprehensive chronological analysis of SARS-CoV-2 evolution by examining mutation prevalence from the source countries of VOCs: United Kingdom, India, Brazil, South Africa, plus two countries: United States, Russia, utilizing genomic sequences from GISAID. Our methodological approach involved large-scale genomic sequence alignment using MAFFT, Python-based data processing on a high-performance computing platform, and advanced statistical methods the Maximal Information Coefficient (MIC), and also Long Short-Term Memory (LSTM) models for correlation analysis. Our findings elucidate the dynamics of SARS-CoV-2 evolution, highlighting the virus's changing behaviour over various pandemic stages. Key results include the discovery of three temporal mutation patterns-lineage distinct, long-span, and competitive mutations-with varying levels of impact on the virus. Notably, we observed a convergence of advantageous mutations in the spike protein, especially in the later stages of the pandemic, indicating a substantial evolutionary pressure on the virus. One of the most significant revelations is the predominant role of natural immunity over vaccination-induced immunity in driving these evolutionary changes. This emphasizes the critical need for regular vaccine updates to maintain efficacy against evolving strains. In conclusion, our study not only sheds light on the evolutionary trajectory of SARS-CoV-2 but also underscores the urgency for robust, continuous global data collection and sharing. It highlights the necessity for rapid adaptations in medical countermeasures, including vaccine development, to stay ahead of pathogen evolution. This research provides valuable insights for future pandemic preparedness and response strategies.

Asunto(s)

COVID-19 , Evolución Molecular , Mutación , SARS-CoV-2 , SARS-CoV-2/genética , SARS-CoV-2/inmunología , Humanos , COVID-19/epidemiología , COVID-19/virología , Sudáfrica/epidemiología , India/epidemiología , Glicoproteína de la Espiga del Coronavirus/genética , Glicoproteína de la Espiga del Coronavirus/inmunología , Brasil/epidemiología , Reino Unido/epidemiología , Federación de Rusia/epidemiología , Genoma Viral , Filogenia , Estados Unidos/epidemiología

Plasmer: an Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-mers and Genomic Features.

Zhu, Qianhui; Gao, Shenghan; Xiao, Binghan; He, Zilong; Hu, Songnian.

Microbiol Spectr ; 11(3): e0464522, 2023 06 15.

Artículo en Inglés | MEDLINE | ID: mdl-37191574

RESUMEN

Identification of plasmids in bacterial genomes is critical for many factors, including horizontal gene transfer, antibiotic resistance genes, host-microbe interactions, cloning vectors, and industrial production. There are several in silico methods to predict plasmid sequences in assembled genomes. However, existing methods have evident shortcomings, such as unbalance in sensitivity and specificity, dependency on species-specific models, and performance reduction in sequences shorter than 10 kb, which has limited their scope of applicability. In this work, we proposed Plasmer, a novel plasmid predictor based on machine-learning of shared k-mers and genomic features. Unlike existing k-mer or genomic-feature based methods, Plasmer employs the random forest algorithm to make predictions using the percent of shared k-mers with plasmid and chromosome databases combined with other genomic features, including alignment E value and replicon distribution scores (RDS). Plasmer can predict on multiple species and has achieved an average the area under the curve (AUC) of 0.996 with accuracy of 98.4%. Compared to existing methods, tests of both sliding sequences and simulated and de novo assemblies have consistently shown that Plasmer has outperforming accuracy and stable performance across long and short contigs above 500 bp, demonstrating its applicability for fragmented assemblies. Plasmer also has excellent and balanced performance on both sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which has eliminated the bias on sensitivity or specificity that was common in existing methods. Plasmer also provides taxonomy classification to help identify the origin of plasmids. IMPORTANCE In this study, we proposed a novel plasmid prediction tool named Plasmer. Technically, unlike existing k-mer or genomic features-based methods, Plasmer is the first tool to combine the advantages of the percent of shared k-mers and the alignment score of genomic features. This has given Plasmer (i) evident improvement in performance compared to other methods, with the best F1-score and accuracy on sliding sequences, simulated contigs, and de novo assemblies; (ii) applicability for contigs above 500 bp with highest accuracy, enabling plasmid prediction in fragmented short-read assemblies; (iii) excellent and balanced performance between sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which eliminated the bias on sensitivity or specificity that commonly existed in other methods; and (iv) no dependency of species-specific training models. We believe that Plasmer provides a more reliable alternative for plasmid prediction in bacterial genome assemblies.

Asunto(s)

Genoma Bacteriano , Genómica , Genómica/métodos , Plásmidos/genética , Aprendizaje Automático

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA