RESUMO
Accurate prediction of transcription factor binding sites (TFBSs) is essential for understanding gene regulation mechanisms and the etiology of diseases. Despite numerous advances in deep learning for predicting TFBSs, their performance can still be enhanced. In this study, we propose MLSNet, a novel deep learning architecture designed specifically to predict TFBSs. MLSNet innovatively integrates multisize convolutional fusion with long short-term memory (LSTM) networks to effectively capture DNA-sparse higher-order sequence features. Further, MLSNet incorporates super token attention and Bi-LSTM to systematically extract and integrate higher-order DNA shape features. Experimental results on 165 ChIP-seq (chromatin immunoprecipitation followed by sequencing) datasets indicate that MLSNet consistently outperforms several state-of-the-art algorithms in the prediction of TFBSs. Specifically, MLSNet reports average metrics: 0.8306 for ACC, 0.8992 for AUROC, and 0.9035 for AUPRC, surpassing the second-best methods by 1.82%, 1.68%, and 1.54%, respectively. This research delineates the effectiveness of combining multi-size convolutional layers with LSTM and DNA shape-based features in enhancing predictive accuracy. Moreover, this study comprehensively assesses the variability in model performance across different cell lines and transcription factors. The source code of MLSNet is available at https://github.com/minghaidea/MLSNet.
Assuntos
Aprendizado Profundo , Fatores de Transcrição , Fatores de Transcrição/metabolismo , Sítios de Ligação , Algoritmos , Biologia Computacional/métodos , Humanos , Sequenciamento de Cromatina por Imunoprecipitação/métodos , DNA/metabolismo , DNA/químicaRESUMO
BACKGROUND: Drug-drug interactions (DDIs) can result in unexpected pharmacological outcomes, including adverse drug events, which are crucial for drug discovery. Graph neural networks have substantially advanced our ability to model molecular representations; however, the precise identification of key local structures and the capture of long-distance structural correlations for better DDI prediction and interpretation remain significant challenges. RESULTS: Here, we present DrugDAGT, a dual-attention graph transformer framework with contrastive learning for predicting multiple DDI types. The dual-attention graph transformer incorporates attention mechanisms at both the bond and atomic levels, thereby enabling the integration of short and long-range dependencies within drug molecules to pinpoint key local structures essential for DDI discovery. Moreover, DrugDAGT further implements graph contrastive learning to maximize the similarity of representations across different views for better discrimination of molecular structures. Experiments in both warm-start and cold-start scenarios demonstrate that DrugDAGT outperforms state-of-the-art baseline models, achieving superior overall performance. Furthermore, visualization of the learned representations of drug pairs and the attention map provides interpretable insights instead of black-box results. CONCLUSIONS: DrugDAGT provides an effective tool for accurately predicting multiple DDI types by identifying key local chemical structures, offering valuable insights for prescribing medications, and guiding drug development. All data and code of our DrugDAGT can be found at https://github.com/codejiajia/DrugDAGT .
Assuntos
Interações Medicamentosas , Aprendizado de Máquina , Redes Neurais de Computação , Descoberta de Drogas/métodosRESUMO
Wildfire-specific particulate matter with diameters ≤ 2.5 µm (PM2.5) is the key component of wildfire smoke, with potentially higher toxicity than PM2.5 from other sources. In this nationwide population-based cohort study, we included 22,163,195 births from Brazil during 20102019. Daily wildfire-specific PM2.5 was estimated through the chemical transport model. Time-varying Cox proportional hazards models were used to characterize the exposure-time-response (E-T-R) relationship between weekly wildfire-specific PM2.5 exposure and preterm birth (PTB) risks, followed by subgroup analyses. A 10 µg/m3 increment in wildfire-specific PM2.5 was associated with a hazard ratio of 1.047 (95 % confidence interval [CI]: 1.032-1.063) for PTB. Stronger associations between wildfire-specific PM2.5 and PTB were observed during earlier pregnancy, among female infants, and pregnant women < 18 years old, in ethnic minorities, with a length of education ≥ 11 years, from low-income or high-temperature municipalities, and residing in North/Northeast regions. An estimated 1.47 % (95 % CI: 1.01 %1.94 %) of PTBs were attributable to wildfire-specific PM2.5 in Brazil, increasing from 2010 to 2019. The PTBs attributable to wildfire-specific PM2.5 surpassed those attributed to non-wildfire PM2.5 (0.31 %, 95% CI: 0.09 %0.57 %). Wildfire emerged as a critical source contributing to the PM2.5-linked PTBs. Prioritized fire management and emission control strategies are warranted for PTB prevention.
RESUMO
Pathogenic and antimicrobial-resistant (AMR) microorganisms are continually transmitted between human, animal, and environmental reservoirs, contributing to the high burden of infectious disease and driving the growing global AMR crisis. The sheer diversity of pathogens, AMR mechanisms, and transmission pathways connecting these reservoirs create the need for comprehensive cross-sectoral surveillance to effectively monitor risks. Current approaches are often siloed by discipline and sector, focusing independently on parts of the whole. Here we advocate that integrated surveillance approaches, developed through transdisciplinary cross-sector collaboration, are key to addressing the dual crises of infectious diseases and AMR. We first review the areas of need, challenges, and benefits of cross-sectoral surveillance, then summarise and evaluate the major detection methods already available to achieve this (culture, quantitative PCR, and metagenomic sequencing). Finally, we outline how cross-sectoral surveillance initiatives can be fostered at multiple scales of action, and present key considerations for implementation and the development of effective systems to manage and integrate this information for the benefit of multiple sectors. While methods and technologies are increasingly available and affordable for comprehensive pathogen and AMR surveillance across different reservoirs, it is imperative that systems are strengthened to effectively manage and integrate this information.
Assuntos
Farmacorresistência Bacteriana , Humanos , Animais , Resistência Microbiana a Medicamentos , Doenças Transmissíveis/tratamento farmacológico , Doenças Transmissíveis/microbiologia , Doenças Transmissíveis/transmissão , Antibacterianos/farmacologiaRESUMO
BACKGROUND: Antimicrobial resistance is a serious threat to global health. Due to the stagnant antibiotic discovery pipeline, bacteriophages (phages) have been proposed as an alternative therapy for the treatment of infections caused by multidrug-resistant pathogens. Genomic features play an important role in phage pharmacology. However, our knowledge of phage genomics is sparse, and the use of existing bioinformatic pipelines and tools requires considerable bioinformatic expertise. These challenges have substantially limited the clinical translation of phage therapy. FINDINGS: We have developed PhageGE (Phage Genome Explorer), a user-friendly graphical interface application for the interactive analysis of phage genomes. PhageGE enables users to perform key analyses, including phylogenetic analysis, visualization of phylogenetic trees, prediction of phage life cycle, and comparative analysis of phage genome annotations. The new R Shiny web server, PhageGE, integrates existing R packages and combines them with several newly developed functions to facilitate these analyses. Additionally, the web server provides interactive visualization capabilities and allows users to directly export publication-quality images. CONCLUSIONS: PhageGE is a valuable tool that simplifies the analysis of phage genome data and may expedite the development and clinical translation of phage therapy. PhageGE is publicly available at https://jason-zhao.shinyapps.io/PhageGE_Update/.
Assuntos
Bacteriófagos , Genoma Viral , Software , Bacteriófagos/genética , Genômica/métodos , Biologia Computacional/métodos , Internet , FilogeniaRESUMO
Recent advancements in high-throughput sequencing technologies have significantly enhanced our ability to unravel the intricacies of gene regulatory processes. A critical challenge in this endeavor is the identification of variant effects, a key factor in comprehending the mechanisms underlying gene regulation. Non-coding variants, constituting over 90% of all variants, have garnered increasing attention in recent years. The exploration of gene variant impacts and regulatory mechanisms has spurred the development of various deep learning approaches, providing new insights into the global regulatory landscape through the analysis of extensive genetic data. Here, we provide a comprehensive overview of the development of the non-coding variants models based on bulk and single-cell sequencing data and their model-based interpretation and downstream tasks. This review delineates the popular sequencing technologies for epigenetic profiling and deep learning approaches for discerning the effects of non-coding variants. Additionally, we summarize the limitations of current approaches in variant effect prediction research and outline opportunities for improvement. We anticipate that our study will offer a practical and useful guide for the bioinformatic community to further advance the unraveling of genetic variant effects.
Assuntos
Aprendizado Profundo , Variação Genética , Humanos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biologia Computacional/métodos , Epigênese GenéticaRESUMO
In the current battle against antibiotic resistance, the resilience of Gram-negative bacteria against traditional antibiotics is due not only to their protective outer membranes but also to mechanisms like efflux pumps and enzymatic degradation of drugs, underscores the urgent need for innovative antimicrobial tactics. Herein, this study presents an innovative method involving the synthesis of three furoxan derivatives engineered to self-assemble into nitric oxide (NO) donor nanoparticles (FuNPs). These FuNPs, notably supplied together with polymyxin B (PMB), achieve markedly enhanced bactericidal efficacy against a wide spectrum of bacterial phenotypes at considerably lower NO concentrations (0.1-2.8 µg mL-1), which is at least ten times lower than the reported data for NO donors (≥200 µg mL-1). The bactericidal mechanism is elucidated using confocal, scanning, and transmission electron microscopy techniques. Neutron reflectometry confirms that FuNPs initiate membrane disruption by specifically engaging with the polysaccharides on bacterial surfaces, causing structural perturbations. Subsequently, PMB binds to lipid A on the outer membrane, enhancing permeability and resulting in a synergistic bactericidal action with FuNPs. This pioneering strategy underscores the utility of self-assembly in NO delivery as a groundbreaking paradigm to circumvent traditional antibiotic resistance barriers, marking a significant leap forward in the development of next-generation antimicrobial agents.
RESUMO
Wildfires have devastating effects on society and public health. However, little evidence from population-based cohort has been performed to analyze the relationship of wildfire-related PM2.5, an important component of wildfire smoke, with cancer-specific mortality. We aimed to explore this relationship and identify vulnerable populations in UK with lower levels of wildfire-related PM2.5 exposure. The study consisted of 492394 participants (age: 38-73 years) recruited by UK Biobank during 2004-2010. The cumulative wildfire-related PM2.5 within 10 kilometers of residence over three years was used as exposure, which was assessed by chemical transport and machine learning models. A time-varying Cox regression was utilized to explore the relationship of exposure with diverse cancer-specific mortality outcomes. Subgroup analyses of a range of potential modifiers were performed. Each 10⯵g/m3 increment of 3-year cumulative exposure was related to a 0.4â¯% greater risk of total cancer (95â¯%CI: 1.001-1.007), a 1.1â¯% greater risk of lung cancer (95â¯%CI: 1.004-1.018), and a 2.7â¯% greater risk of lip, oral cavity and pharynx (LOP) cancer (95â¯%CI: 1.005-1.049). Higher vulnerability in the wildfire-related PM2.5-lung cancer relationship was found among participants being retired than those with other employment status. Even lower levels of exposure to PM2.5 from wildfires were related to elevated mortality risks for cancer from total, lung, LOP, highlighting the importance of wildfire prevention and control. Further investigations are warranted to enrich and extend existing knowledge in this field.
Assuntos
Poluentes Atmosféricos , Neoplasias , Material Particulado , Incêndios Florestais , Humanos , Pessoa de Meia-Idade , Material Particulado/análise , Neoplasias/mortalidade , Neoplasias/induzido quimicamente , Idoso , Masculino , Feminino , Adulto , Poluentes Atmosféricos/análise , Poluentes Atmosféricos/toxicidade , Poluentes Atmosféricos/efeitos adversos , Reino Unido/epidemiologia , Exposição Ambiental/estatística & dados numéricos , Exposição Ambiental/efeitos adversos , Fumaça/efeitos adversos , Estudos de CoortesRESUMO
MicroRNAs (miRNAs) are short non-coding RNAs involved in various cellular processes, playing a crucial role in gene regulation. Identifying miRNA targets remains a central challenge and is pivotal for elucidating the complex gene regulatory networks. Traditional computational approaches have predominantly focused on identifying miRNA targets through perfect Watson-Crick base pairings within the seed region, referred to as canonical sites. However, emerging evidence suggests that perfect seed matches are not a prerequisite for miRNA-mediated regulation, underscoring the importance of also recognizing imperfect, or non-canonical, sites. To address this challenge, we propose Mimosa, a new computational approach that employs the Transformer framework to enhance the prediction of miRNA targets. Mimosa distinguishes itself by integrating contextual, positional and base-pairing information to capture in-depth attributes, thereby improving its predictive capabilities. Its unique ability to identify non-canonical base-pairing patterns makes Mimosa a standout model, reducing the reliance on pre-selecting candidate targets. Mimosa achieves superior performance in gene-level predictions and also shows impressive performance in site-level predictions across various non-human species through extensive benchmarking tests. To facilitate research efforts in miRNA targeting, we have developed an easy-to-use web server for comprehensive end-to-end predictions, which is publicly available at http://monash.bioweb.cloud.edu.au/Mimosa.
Assuntos
Pareamento de Bases , MicroRNAs , Software , MicroRNAs/genética , MicroRNAs/metabolismo , Humanos , Algoritmos , Biologia Computacional/métodos , AnimaisRESUMO
Matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) is widely used in clinical microbiology laboratories for bacterial identification but its use for detection of antimicrobial resistance (AMR) remains limited. Here, we used MALDI-TOF MS with artificial intelligence (AI) approaches to successfully predict AMR in Pseudomonas aeruginosa, a priority pathogen with complex AMR mechanisms. The highest performance was achieved for modern ß-lactam/ß-lactamase inhibitor drugs, namely, ceftazidime/avibactam and ceftolozane/tazobactam. For these drugs, the model demonstrated area under the receiver operating characteristic curve (AUROC) of 0.869 and 0.856, specificity of 0.925 and 0.897, and sensitivity of 0.731 and 0.714, respectively. As part of this work, we developed dynamic binning, a feature engineering technique that effectively reduces the high-dimensional feature set and has wide-ranging applicability to MALDI-TOF MS data. Compared to conventional feature engineering approaches, the dynamic binning method yielded highest performance in 7 of 10 antimicrobials. Moreover, we showcased the efficacy of transfer learning in enhancing the AUROC performance for 8 of 11 antimicrobials. By assessing the contribution of features to the model's prediction, we identified proteins that may contribute to AMR mechanisms. Our findings demonstrate the potential of combining AI with MALDI-TOF MS as a rapid AMR diagnostic tool for Pseudomonas aeruginosa.IMPORTANCEPseudomonas aeruginosa is a key bacterial pathogen that causes significant global morbidity and mortality. Antimicrobial resistance (AMR) emerges rapidly in P. aeruginosa and is driven by complex mechanisms. Drug-resistant P. aeruginosa is a major challenge in clinical settings due to limited treatment options. Early detection of AMR can guide antibiotic choices, improve patient outcomes, and avoid unnecessary antibiotic use. Matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) is widely used for rapid species identification in clinical microbiology. In this study, we repurposed mass spectra generated by MALDI-TOF and used them as inputs for artificial intelligence approaches to successfully predict AMR in P. aeruginosa for multiple key antibiotic classes. This work represents an important advance toward using MALDI-TOF as a rapid AMR diagnostic for P. aeruginosa in clinical settings.
Assuntos
Antibacterianos , Inteligência Artificial , Pseudomonas aeruginosa , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz , Tazobactam , Pseudomonas aeruginosa/efeitos dos fármacos , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz/métodos , Antibacterianos/farmacologia , Humanos , Tazobactam/farmacologia , Tazobactam/uso terapêutico , Infecções por Pseudomonas/tratamento farmacológico , Infecções por Pseudomonas/microbiologia , Testes de Sensibilidade Microbiana/métodos , Farmacorresistência Bacteriana , Combinação de Medicamentos , Ceftazidima/farmacologia , Compostos Azabicíclicos/farmacologia , CefalosporinasRESUMO
Haemonchus contortus (the barber's pole worm)-a highly pathogenic gastric nematode of ruminants-causes significant economic losses in the livestock industry worldwide. H. contortus has become a valuable model organism for both fundamental and applied research (e.g., drug and vaccine discovery) because of the availability of well-defined laboratory strains (e.g., MHco3(ISE).N1 in the UK and Haecon-5 in Australia) and genomic, transcriptomic and proteomic data sets. Many recent investigations have relied heavily on the use of the chromosome-contiguous genome of MHco3(ISE).N1 in the absence of a genome for Haecon-5. However, there has been no genetic comparison of these and other strains to date. Here, we assembled and characterised the mitochondrial genome (14.1 kb) of Haecon-5 and compared it with that of MHco3(ISE).N1 and two other strains (i.e., McMaster and NZ_Hco_NP) from Australasia. We detected 276 synonymous and 25 non-synonymous single nucleotide polymorphisms (SNPs) within Haecon-5. Between the Haecon-5 and MHco3(ISE).N1 strains, we recorded 345 SNPs, 31 of which were non-synonymous and linked to fixed amino acid differences in seven protein-coding genes (nad5, nad6, nad1, atp6, nad2, cytb and nad4) between these strains. Pronounced variation (344 and 435 SNPs) was seen between Haecon-5 and each of the other two strains from Australasia. The question remains as to what impact these mitogenomic mutations might have on the biology and physiology of H. contortus, which warrants exploration. The high degree of mitogenomic variability recorded here among these strains suggests that further work should be undertaken to assess the nature and extent of the nuclear genomic variation within H. contortus.
Assuntos
Genoma Mitocondrial , Haemonchus , Polimorfismo de Nucleotídeo Único , Animais , Haemonchus/genética , Filogenia , Variação Genética , AustráliaRESUMO
Millions of livestock animals worldwide are infected with the haematophagous barber's pole worm, Haemonchus contortus, the aetiological agent of haemonchosis. Despite the major significance of this parasite worldwide and its widespread resistance to current treatments, the lack of a high-quality genome for the well-defined strain of this parasite from Australia, called Haecon-5, has constrained research in a number of areas including host-parasite interactions, drug discovery and population genetics. To enable research in these areas, we report here a chromosome-contiguous genome (â¼280 Mb) for Haecon-5 with high-quality models for 19,234 protein-coding genes. Comparative genomic analyses show significant genomic similarity (synteny) with a UK strain of H. contortus, called MHco3(ISE).N1 (abbreviated as "ISE"), but we also discover marked differences in genomic structure/gene arrangements, distribution of nucleotide variability (single nucleotide polymorphisms (SNPs) and indels) and orthology between Haecon-5 and ISE. We used the genome and extensive transcriptomic resources for Haecon-5 to predict a subset of essential single-copy genes employing a "cross-species" machine learning (ML) approach using a range of features from nucleotide/protein sequences, protein orthology, subcellular localisation, single-cell RNA-seq and/or histone methylation data available for the model organisms Caenorhabditis elegans and Drosophila melanogaster. From a set of 1,464 conserved single copy genes, transcribed in key life-cycle stages of H. contortus, we identified 232 genes whose homologs have critical functions in C. elegans and/or D. melanogaster, and prioritised 10 of them for further characterisation; nine of the 10 genes likely play roles in neurophysiological processes, germline, hypodermis and/or respiration, and one is an unknown (orphan) gene for which no detailed functional information exists. Future studies of these genes/gene products are warranted to elucidate their roles in parasite biology, host-parasite interplay and/or disease. Clearly, the present Haecon-5 reference genome and associated resources now underpin a broad range of fundamental investigations of H. contortus and could assist in accelerating the discovery of novel intervention targets and drug candidates to combat haemonchosis.
RESUMO
The critical importance of accurately predicting mutations in protein metal-binding sites for advancing drug discovery and enhancing disease diagnostic processes cannot be overstated. In response to this imperative, MetalTrans emerges as an accurate predictor for disease-associated mutations in protein metal-binding sites. The core innovation of MetalTrans lies in its seamless integration of multifeature splicing with the Transformer framework, a strategy that ensures exhaustive feature extraction. Central to MetalTrans's effectiveness is its deep feature combination strategy, which merges evolutionary-scale modeling amino acid embeddings with ProtTrans embeddings, thus shedding light on the biochemical properties of proteins. Employing the Transformer component, MetalTrans leverages the self-attention mechanism to delve into higher-level representations. Utilizing mutation site information for feature fusion not only enriches the feature set but also sidesteps the common pitfall of overestimation linked to protein sequence-based predictions. This nuanced approach to feature fusion is a key differentiator, enabling MetalTrans to outperform existing methods significantly, as evidenced by comparative analyses. Our evaluations across varied metal binding site data sets (specifically Zn, Ca, Mg, and Mix) underscore MetalTrans's superior performance, which achieved the average AUC values of 0.971, 0.965, 0.980, and 0.945 on multiple 5-fold cross-validation, respectively. Remarkably, against the multichannel convolutional neural network method on a benchmark independent test set, MetalTrans demonstrated unparalleled robustness and superiority, boasting the AUC score of 0.998 on multiple 5-fold cross-validation. Our comprehensive examination of the predicted outcomes further confirms the effectiveness of the model. The source codes, data sets, and prediction results for MetalTrans can be accessed for academic usage at https://github.com/EduardWang/MetalTrans.
Assuntos
Metais , Mutação , Sítios de Ligação , Metais/química , Metais/metabolismo , Humanos , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Modelos Moleculares , Biologia Computacional/métodos , Bases de Dados de ProteínasRESUMO
Histopathology image-based survival prediction aims to provide a precise assessment of cancer prognosis and can inform personalized treatment decision-making in order to improve patient outcomes. However, existing methods cannot automatically model the complex correlations between numerous morphologically diverse patches in each whole slide image (WSI), thereby preventing them from achieving a more profound understanding and inference of the patient status. To address this, here we propose a novel deep learning framework, termed dual-stream multi-dependency graph neural network (DM-GNN), to enable precise cancer patient survival analysis. Specifically, DM-GNN is structured with the feature updating and global analysis branches to better model each WSI as two graphs based on morphological affinity and global co-activating dependencies. As these two dependencies depict each WSI from distinct but complementary perspectives, the two designed branches of DM-GNN can jointly achieve the multi-view modeling of complex correlations between the patches. Moreover, DM-GNN is also capable of boosting the utilization of dependency information during graph construction by introducing the affinity-guided attention recalibration module as the readout function. This novel module offers increased robustness against feature perturbation, thereby ensuring more reliable and stable predictions. Extensive benchmarking experiments on five TCGA datasets demonstrate that DM-GNN outperforms other state-of-the-art methods and offers interpretable prediction insights based on the morphological depiction of high-attention patches. Overall, DM-GNN represents a powerful and auxiliary tool for personalized cancer prognosis from histopathology images and has great potential to assist clinicians in making personalized treatment decisions and improving patient outcomes.
Assuntos
Redes Neurais de Computação , Humanos , Análise de Sobrevida , Aprendizado Profundo , Neoplasias/diagnóstico por imagem , Neoplasias/mortalidade , Interpretação de Imagem Assistida por Computador/métodos , PrognósticoRESUMO
Current studies in early cancer detection based on liquid biopsy data often rely on off-the-shelf models and face challenges with heterogeneous data, as well as manually designed data preprocessing pipelines with different parameter settings. To address those challenges, we present AutoCancer, an automated, multimodal, and interpretable transformer-based framework. This framework integrates feature selection, neural architecture search, and hyperparameter optimization into a unified optimization problem with Bayesian optimization. Comprehensive experiments demonstrate that AutoCancer achieves accurate performance in specific cancer types and pan-cancer analysis, outperforming existing methods across three cohorts. We further demonstrated the interpretability of AutoCancer by identifying key gene mutations associated with non-small cell lung cancer to pinpoint crucial factors at different stages and subtypes. The robustness of AutoCancer, coupled with its strong interpretability, underscores its potential for clinical applications in early cancer detection.
RESUMO
Over the years, comprehensive explorations of the model organisms Caenorhabditis elegans (elegant worm) and Drosophila melanogaster (vinegar fly) have contributed substantially to our understanding of complex biological processes and pathways in multicellular organisms generally. Extensive functional genomic-phenomic, genomic, transcriptomic, and proteomic data sets have enabled the discovery and characterisation of genes that are crucial for life, called 'essential genes'. Recently, we investigated the feasibility of inferring essential genes from such data sets using advanced bioinformatics and showed that a machine learning (ML)-based workflow could be used to extract or engineer features from DNA, RNA, protein, and/or cellular data/information to underpin the reliable prediction of essential genes both within and between C. elegans and D. melanogaster. As these are two distantly related species within the Ecdysozoa, we proposed that this ML approach would be particularly well suited for species that are within the same phylum or evolutionary clade. In the present study, we cross-predicted essential genes within the phylum Nematoda (evolutionary clade V)-between C. elegans and the pathogenic parasitic nematode H. contortus-and then ranked and prioritised H. contortus proteins encoded by these genes as intervention (e.g., drug) target candidates. Using strong, validated predictors, we inferred essential genes of H. contortus that are involved predominantly in crucial biological processes/pathways including ribosome biogenesis, translation, RNA binding/processing, and signalling and which are highly transcribed in the germline, somatic gonad precursors, sex myoblasts, vulva cell precursors, various nerve cells, glia, or hypodermis. The findings indicate that this in silico workflow provides a promising avenue to identify and prioritise panels/groups of drug target candidates in parasitic nematodes for experimental validation in vitro and/or in vivo.
Assuntos
Caenorhabditis elegans , Genes Essenciais , Haemonchus , Aprendizado de Máquina , Animais , Haemonchus/genética , Caenorhabditis elegans/genética , Proteínas de Helminto/genética , Proteínas de Helminto/metabolismo , Biologia Computacional/métodos , Drosophila melanogaster/genéticaRESUMO
OBJECTIVE: To evaluate associations of wildfire fine particulate matter ≤2.5 mm in diameter (PM2.5) with diabetes across multiple countries and territories. RESEARCH DESIGN AND METHODS: We collected data on 3,612,135 diabetes hospitalizations from 1,008 locations in Australia, Brazil, Canada, Chile, New Zealand, Thailand, and Taiwan during 2000-2019. Daily wildfire-specific PM2.5 levels were estimated through chemical transport models and machine-learning calibration. Quasi-Poisson regression with distributed lag nonlinear models and random-effects meta-analysis were applied to estimate associations between wildfire-specific PM2.5 and diabetes hospitalization. Subgroup analyses were by age, sex, location income level, and country or territory. Diabetes hospitalizations attributable to wildfire-specific PM2.5 and nonwildfire PM2.5 were compared. RESULTS: Each 10 µg/m3 increase in wildfire-specific PM2.5 levels over the current day and previous 3 days was associated with relative risks (95% CI) of 1.017 (1.011-1.022), 1.023 (1.011-1.035), 1.023 (1.015-1.032), 0.962 (0.823-1.032), 1.033 (1.001-1.066), and 1.013 (1.004-1.022) for all-cause, type 1, type 2, malnutrition-related, other specified, and unspecified diabetes hospitalization, respectively. Stronger associations were observed for all-cause, type 1, and type 2 diabetes in Thailand, Australia, and Brazil; unspecified diabetes in New Zealand; and type 2 diabetes in high-income locations. An estimate of 0.67% (0.16-1.18%) and 1.02% (0.20-1.81%) for all-cause and type 2 diabetes hospitalizations were attributable to wildfire-specific PM2.5. Compared with nonwildfire PM2.5, wildfire-specific PM2.5 posed greater risks of all-cause, type 1, and type 2 diabetes and were responsible for 38.7% of PM2.5-related diabetes hospitalizations. CONCLUSIONS: We show the relatively underappreciated links between diabetes and wildfire air pollution, which can lead to a nonnegligible proportion of PM2.5-related diabetes hospitalizations. Precision prevention and mitigation should be developed for those in advantaged communities and in Thailand, Australia, and Brazil.
Assuntos
Diabetes Mellitus , Hospitalização , Material Particulado , Incêndios Florestais , Humanos , Hospitalização/estatística & dados numéricos , Material Particulado/análise , Material Particulado/efeitos adversos , Masculino , Austrália/epidemiologia , Pessoa de Meia-Idade , Feminino , Diabetes Mellitus/epidemiologia , Idoso , Tailândia/epidemiologia , Nova Zelândia/epidemiologia , Brasil/epidemiologia , Canadá/epidemiologia , Taiwan/epidemiologia , Adulto , Exposição Ambiental/efeitos adversos , Exposição Ambiental/estatística & dados numéricosRESUMO
RNA N6-methyladenosine is a prevalent and abundant type of RNA modification that exerts significant influence on diverse biological processes. To date, numerous computational approaches have been developed for predicting methylation, with most of them ignoring the correlations of different encoding strategies and failing to explore the adaptability of various attention mechanisms for methylation identification. To solve the above issues, we proposed an innovative framework for predicting RNA m6A modification site, termed BLAM6A-Merge. Specifically, it utilized a multimodal feature fusion strategy to combine the classification results of four features and Blastn tool. Apart from this, different attention mechanisms were employed for extracting higher-level features on specific features after the screening process. Extensive experiments on 12 benchmarking datasets demonstrated that BLAM6A-Merge achieved superior performance (average AUC: 0.849 for the full transcript mode and 0.784 for the mature mRNA mode). Notably, the Blastn tool was employed for the first time in the identification of methylation sites. The data and code can be accessed at https://github.com/DoraemonXia/BLAM6A-Merge.
RESUMO
Microbial hydrogen (H2) cycling underpins the diversity and functionality of diverse anoxic ecosystems. Among the three evolutionarily distinct hydrogenase superfamilies responsible, [FeFe] hydrogenases were thought to be restricted to bacteria and eukaryotes. Here, we show that anaerobic archaea encode diverse, active, and ancient lineages of [FeFe] hydrogenases through combining analysis of existing and new genomes with extensive biochemical experiments. [FeFe] hydrogenases are encoded by genomes of nine archaeal phyla and expressed by H2-producing Asgard archaeon cultures. We report an ultraminimal hydrogenase in DPANN archaea that binds the catalytic H-cluster and produces H2. Moreover, we identify and characterize remarkable hybrid complexes formed through the fusion of [FeFe] and [NiFe] hydrogenases in ten other archaeal orders. Phylogenetic analysis and structural modeling suggest a deep evolutionary history of hybrid hydrogenases. These findings reveal new metabolic adaptations of archaea, streamlined H2 catalysts for biotechnological development, and a surprisingly intertwined evolutionary history between the two major H2-metabolizing enzymes.
Assuntos
Archaea , Hidrogênio , Hidrogenase , Filogenia , Archaea/genética , Archaea/enzimologia , Proteínas Arqueais/metabolismo , Proteínas Arqueais/química , Proteínas Arqueais/genética , Genoma Arqueal , Hidrogênio/metabolismo , Hidrogenase/metabolismo , Hidrogenase/genética , Hidrogenase/química , Proteínas Ferro-Enxofre/metabolismo , Proteínas Ferro-Enxofre/genética , Proteínas Ferro-Enxofre/química , Modelos Moleculares , Estrutura Terciária de ProteínaRESUMO
BACKGROUND: Floods are the most frequent weather-related disaster, causing significant health impacts worldwide. Limited studies have examined the long-term consequences of flooding exposure. METHODS: Flood data were retrieved from the Dartmouth Flood Observatory and linked with health data from 499,487 UK Biobank participants. To calculate the annual cumulative flooding exposure, we multiplied the duration and severity of each flood event and then summed these values for each year. We conducted a nested case-control analysis to evaluate the long-term effect of flooding exposure on all-cause and cause-specific mortality. Each case was matched with eight controls. Flooding exposure was modelled using a distributed lag non-linear model to capture its nonlinear and lagged effects. RESULTS: The risk of all-cause mortality increased by 6.7% (odds ratio (OR): 1.067, 95% confidence interval (CI): 1.063-1.071) for every unit increase in flood index after confounders had been controlled for. The mortality risk from neurological and mental diseases was negligible in the current year, but strongest in the lag years 3 and 4. By contrast, the risk of mortality from suicide was the strongest in the current year (OR: 1.018, 95% CI: 1.008-1.028), and attenuated to lag year 5. Participants with higher levels of education and household income had a higher estimated risk of death from most causes whereas the risk of suicide-related mortality was higher among participants who were obese, had lower household income, engaged in less physical activity, were non-moderate alcohol consumers, and those living in more deprived areas. CONCLUSIONS: Long-term exposure to floods is associated with an increased risk of mortality. The health consequences of flooding exposure would vary across different periods after the event, with different profiles of vulnerable populations identified for different causes of death. These findings contribute to a better understanding of the long-term impacts of flooding exposure.