Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 20 de 91
Filtrar
Más filtros

Banco de datos
Tipo del documento
Publication year range
1.
iScience ; 27(5): 109794, 2024 May 17.
Artículo en Inglés | MEDLINE | ID: mdl-38711455

RESUMEN

Autopsy rates are declining globally, impacting cause-of-death (CoD) diagnoses and quality control. Postmortem metabolomics was evaluated for CoD screening using 4,282 human cases, encompassing CoD groups: acidosis, drug intoxication, hanging, ischemic heart disease (IHD), and pneumonia. Cases were split 3:1 into training and test sets. High-resolution mass spectrometry data from femoral blood were analyzed via orthogonal-partial least squares discriminant analysis (OPLS-DA) to discriminate CoD groups. OPLS-DA achieved an R2 = 0.52 and Q2 = 0.30, with true-positive prediction rates of 68% and 65% for training and test sets, respectively, across all groups. Specificity-optimized thresholds predicted 56% of test cases with a unique CoD, average 45% sensitivity, and average 96% specificity. Prediction accuracies varied: 98.7% for acidosis, 80.5% for drug intoxication, 81.6% for hanging, 73.1% for IHD, and 93.6% for pneumonia. This study demonstrates the potential of large-scale postmortem metabolomics for CoD screening, offering high specificity and enhancing throughput and decision-making in human death investigations.

2.
iScience ; 27(1): 108385, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38205255

RESUMEN

We introduce an all-optical technique that enables volumetric imaging of brain-wide calcium activity and targeted optogenetic stimulation of specific brain regions in unrestrained larval zebrafish. The system consists of three main components: a 3D tracking module, a dual-color fluorescence imaging module, and a real-time activity manipulation module. Our approach uses a sensitive genetically encoded calcium indicator in combination with a long Stokes shift red fluorescence protein as a reference channel, allowing the extraction of Ca2+ activity from signals contaminated by motion artifacts. The method also incorporates rapid 3D image reconstruction and registration, facilitating real-time selective optogenetic stimulation of different regions of the brain. By demonstrating that selective light activation of the midbrain regions in larval zebrafish could reliably trigger biased turning behavior and changes of brain-wide neural activity, we present a valuable tool for investigating the causal relationship between distributed neural circuit dynamics and naturalistic behavior.

3.
iScience ; 27(4): 109503, 2024 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-38591007

RESUMEN

Microinjecting yeast cells has been challenging for decades with no significant breakthrough due to the ultra-tough cell wall and low stiffness of the traditional injector tip at the micro-scale. Penetrating this protection wall is the key step for artificially bringing foreign substance into the yeast. In this paper, a yeast cell model was built by using finite element analysis (FEA) method to analyze the penetrating process. The key parameters of the yeast cell wall in the model (the Young's modulus, the shear modulus, and the Lame constant) were calibrated according to a general nanoindentation experiment. Then by employing the calibrated model, the injection parameters were optimized to minimize the cell damage (the maximum cell deformation at the critical stress of the cell wall). Key guidelines were suggested for penetrating the cell wall during microinjection.

4.
iScience ; 27(4): 109387, 2024 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-38510118

RESUMEN

Identifying cancer genes is vital for cancer diagnosis and treatment. However, because of the complexity of cancer occurrence and limited cancer genes knowledge, it is hard to identify cancer genes accurately using only a few omics data, and the overall performance of existing methods is being called for further improvement. Here, we introduce a two-stage gradual-learning strategy GLIMS to predict cancer genes using integrative features from multi-omics data. Firstly, it uses a semi-supervised hierarchical graph neural network to predict the initial candidate cancer genes by integrating multi-omics data and protein-protein interaction (PPI) network. Then, it uses an unsupervised approach to further optimize the initial prediction by integrating the co-splicing network in post-transcriptional regulation, which plays an important role in cancer development. Systematic experiments on multi-omics cancer data demonstrated that GLIMS outperforms the state-of-the-art methods for the identification of cancer genes and it could be a useful tool to help advance cancer analysis.

5.
iScience ; 27(1): 108756, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38230261

RESUMEN

Compound-protein interaction (CPI) affinity prediction plays an important role in reducing the cost and time of drug discovery. However, the interpretability of how fragments function in CPI is impacted by the fact that current methods ignore the affinity relationships between fragments of compounds and fragments of proteins in CPI modeling. This article introduces an improved Transformer called FOTF-CPI (a Fusion of Optimal Transport Fragments compound-protein interaction prediction model). We use an optimal transport-based fragmentation approach to improve the model's understanding of compound and protein sequences. Additionally, a fused attention mechanism is employed, which combines the features of fragments to capture full affinity information. This fused attention redistributes higher attention scores to fragments with higher affinity. Experimental results show FOTF-CPI achieves an average 2% higher performance than other models on all three datasets. Furthermore, the visualization confirms the potential of FOTF-CPI for drug discovery applications.

6.
iScience ; 27(3): 109300, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38469560

RESUMEN

microRNAs (miRNAs) are small regulatory RNAs that repress target mRNA transcripts through base pairing. Although the mechanisms of miRNA production and function are clearly established, new insights into miRNA regulation or miRNA-mediated gene silencing are still emerging. In order to facilitate the discovery of miRNA regulators or effectors, we have developed sRNA-Effector, a machine learning algorithm trained on enhanced crosslinking and immunoprecipitation sequencing and RNA sequencing data following knockdown of specific genes. sRNA-Effector can accurately identify known miRNA biogenesis and effector proteins and identifies 9 putative regulators of miRNA function, including serine/threonine kinase STK33, splicing factor SFPQ, and proto-oncogene BMI1. We validated the role of STK33, SFPQ, and BMI1 in miRNA regulation, showing that sRNA-Effector is useful for identifying new players in small RNA biology. sRNA-Effector will be a web tool available for all researchers to identify potential miRNA regulators in any cell line of interest.

7.
iScience ; 27(3): 109054, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38361606

RESUMEN

Genome assembly databases are growing rapidly. The redundancy of sequence content between a new assembly and previous ones is neither conceptually nor algorithmically easy to measure. We introduce pertinent methods and DandD, a tool addressing how much new sequence is gained when a sequence collection grows. DandD can describe how much structural variation is discovered in each new human genome assembly and when discoveries will level off in the future. DandD uses a measure called δ ("delta"), developed initially for data compression and chiefly dependent on k-mer counts. DandD rapidly estimates δ using genomic sketches. We propose δ as an alternative to k-mer-specific cardinalities when computing the Jaccard coefficient, thereby avoiding the pitfalls of a poor choice of k. We demonstrate the utility of DandD's functions for estimating δ, characterizing the rate of pangenome growth, and computing all-pairs similarities using k-independent Jaccard.

8.
iScience ; 27(6): 109908, 2024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38827397

RESUMEN

Accurate detection of pathogens, particularly distinguishing between Gram-positive and Gram-negative bacteria, could improve disease treatment. Host gene expression can capture the immune system's response to infections caused by various pathogens. Here, we present a deep neural network model, bvnGPS2, which incorporates the attention mechanism based on a large-scale integrated host transcriptome dataset to precisely identify Gram-positive and Gram-negative bacterial infections as well as viral infections. We performed analysis of 4,949 blood samples across 40 cohorts from 10 countries using our previously designed omics data integration method, iPAGE, to select discriminant gene pairs and train the bvnGPS2. The performance of the model was evaluated on six independent cohorts comprising 374 samples. Overall, our deep neural network model shows robust capability to accurately identify specific infections, paving the way for precise medicine strategies in infection treatment and potentially also for identifying subtypes of other diseases.

9.
iScience ; 27(6): 110032, 2024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38868195

RESUMEN

Evaluation of the binding affinities of drugs to proteins is a crucial process for identifying drug pharmacological actions, but it requires three dimensional structures of proteins. Herein, we propose novel computational methods to predict the therapeutic indications and side effects of drug candidate compounds from the binding affinities to human protein structures on a proteome-wide scale. Large-scale docking simulations were performed for 7,582 drugs with 19,135 protein structures revealed by AlphaFold (including experimentally unresolved proteins), and machine learning models on the proteome-wide binding affinity score (PBAS) profiles were constructed. We demonstrated the usefulness of the method for predicting the therapeutic indications for 559 diseases and side effects for 285 toxicities. The method enabled to predict drug indications for which the related protein structures had not been experimentally determined and to successfully extract proteins eliciting the side effects. The proposed method will be useful in various applications in drug discovery.

10.
iScience ; 27(6): 109928, 2024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38812546

RESUMEN

Interactions within the tumor microenvironment (TME) significantly influence tumor progression and treatment responses. While single-cell RNA sequencing (scRNA-seq) and spatial genomics facilitate TME exploration, many clinical cohorts are assessed at the bulk tissue level. Integrating scRNA-seq and bulk tissue RNA-seq data through computational deconvolution is essential for obtaining clinically relevant insights. Our method, ProM, enables the examination of major and minor cell types. Through evaluation against existing methods using paired single-cell and bulk RNA sequencing of human urothelial cancer (UC) samples, ProM demonstrates superiority. Application to UC cohorts treated with immune checkpoint inhibitors reveals pre-treatment cellular features associated with poor outcomes, such as elevated SPP1 expression in macrophage/monocytes (MM). Our deconvolution method and paired single-cell and bulk tissue RNA-seq dataset contribute novel insights into TME heterogeneity and resistance to immune checkpoint blockade.

11.
iScience ; 27(6): 109926, 2024 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-38832027

RESUMEN

Cytotoxic T lymphocyte (CTL) and terminal exhausted T lymphocyte (ETL) activities crucially influence immune checkpoint inhibitor (ICI) response. Despite this, the efficacy of ETL and CTL transcriptomic signatures for response prediction remains limited. Investigating this across the TCGA and publicly available single-cell cohorts, we find a strong positive correlation between ETL and CTL expression signatures in most cancers. We hence posited that their limited predictability arises due to their mutually canceling effects on ICI response. Thus, we developed DETACH, a computational method to identify a gene set whose expression pinpoints to a subset of melanoma patients where the CTL and ETL correlation is low. DETACH enhances CTL's prediction accuracy, outperforming existing signatures. DETACH signature genes activity also demonstrates a positive correlation with lymphocyte infiltration and the prevalence of reactive T cells in the tumor microenvironment (TME), advancing our understanding of the CTL cell state within the TME.

12.
iScience ; 27(3): 109212, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38433927

RESUMEN

Traditional loss functions such as cross-entropy loss often quantify the penalty for each mis-classified training sample without adequately considering its distance from the ground truth class distribution in the feature space. Intuitively, the larger this distance is, the higher the penalty should be. With this observation, we propose a penalty called distance-weighted Sinkhorn (DWS) loss. For each mis-classified training sample (with predicted label A and true label B), its contribution to the DWS loss positively correlates to the distance the training sample needs to travel to reach the ground truth distribution of all the A samples. We apply the DWS framework with a neural network to classify different stages of Alzheimer's disease. Our empirical results demonstrate that the DWS framework outperforms the traditional neural network loss functions and is comparable or better to traditional machine learning methods, highlighting its potential in biomedical informatics and data science.

13.
iScience ; 27(3): 109124, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38455978

RESUMEN

Dysregulation of normal transcription factor activity is a common driver of disease. Therefore, the detection of aberrant transcription factor activity is important to understand disease pathogenesis. We have developed Priori, a method to predict transcription factor activity from RNA sequencing data. Priori has two key advantages over existing methods. First, Priori utilizes literature-supported regulatory information to identify transcription factor-target gene relationships. It then applies linear models to determine the impact of transcription factor regulation on the expression of its target genes. Second, results from a third-party benchmarking pipeline reveals that Priori detects aberrant activity from 124 single-gene perturbation experiments with higher sensitivity and specificity than 11 other methods. We applied Priori and other top-performing methods to predict transcription factor activity from two large primary patient datasets. Our work demonstrates that Priori uniquely discovered significant determinants of survival in breast cancer and identified mediators of drug response in leukemia.

14.
iScience ; 27(3): 109209, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38439972

RESUMEN

GWAS focuses on significance loosing false positives; machine learning probes sub-significant features relying on predictivity. Yet, these are far from orthogonal. We sought to explore how these inform each other in sub-genome-wide significant situations to define relevance for predictive features. We introduce the SVM-based RubricOE that selects heavily cross-validated feature sets, and LDpred2 PRS as a strong contrast to SVM, to explore significance and predictivity. Our Alzheimer's test case notoriously lacks strong genetic signals except for few very strong phenotype-SNP associations, which suits the problem we are exploring. We found that the most significant SNPs among ML and PRS-selected SNPs captured most of the predictivity, while weaker associations tend also to contribute weakly to predictivity. SNPs with weak associations tend not to contribute to predictivity, but deletion of these features does not injure it. Significance provides a ranking that helps identify weakly predictive features.

15.
iScience ; 27(7): 110147, 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-38989463

RESUMEN

Amyotrophic lateral sclerosis (ALS) is a universally fatal neurodegenerative disease with no cure. Human endogenous retroviruses (HERVs) have been implicated in its pathogenesis but their relevance to ALS is not fully understood. We examined bulk RNA-seq data from almost 2,000 ALS and unaffected control samples derived from the cortex and spinal cord. Using different methods of feature selection, including differential expression analysis and machine learning, we discovered that transcription of HERV-K loci 1q22 and 8p23.1 were significantly upregulated in the spinal cord of individuals with ALS. Additionally, we identified a subset of ALS patients with upregulated HERV-K expression in the cortex and spinal cord. We also found the expression of HERV-K loci 19q11 and 8p23.1 was correlated with protein coding genes previously implicated in ALS and dysregulated in ALS patients in this study. These results clarify the association of HERV-K and ALS and highlight specific genes in the pathobiology of late-stage ALS.

16.
iScience ; 27(7): 110371, 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-39055916

RESUMEN

Ab initio computational reconstructions of protein-protein interaction (PPI) networks will provide invaluable insights into cellular systems, enabling the discovery of novel molecular interactions and elucidating biological mechanisms within and between organisms. Leveraging the latest generation protein language models and recurrent neural networks, we present SENSE-PPI, a sequence-based deep learning model that efficiently reconstructs ab initio PPIs, distinguishing partners among tens of thousands of proteins and identifying specific interactions within functionally similar proteins. SENSE-PPI demonstrates high accuracy, limited training requirements, and versatility in cross-species predictions, even with non-model organisms and human-virus interactions. Its performance decreases for phylogenetically more distant model and non-model organisms, but signal alteration is very slow. In this regard, it demonstrates the important role of parameters in protein language models. SENSE-PPI is very fast and can test 10,000 proteins against themselves in a matter of hours, enabling the reconstruction of genome-wide proteomes.

17.
iScience ; 27(7): 110104, 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-38989470

RESUMEN

Coronary artery disease (CAD) remains a leading cause of disease burden globally, and there is a persistent need for new therapeutic targets. Instrumental variable (IV) and genetic colocalization analyses can help identify novel therapeutic targets for human disease by nominating causal genes in genome-wide association study (GWAS) loci. We conducted cis-IV analyses for 20,125 genes and 1,746 plasma proteins with CAD using molecular trait quantitative trait loci variant (QTLs) data from three different studies. 19 proteins and 119 genes were significantly associated with CAD risk by IV analyses and demonstrated evidence of genetic colocalization. Notably, our analyses validated well-established targets such as PCSK9 and ANGPTL4 while also identifying HTRA1 and endotrophin (a cleavage product of COL6A3) as proteins whose levels are causally associated with CAD risk. Further experimental studies are needed to confirm the causal role of the genes and proteins identified through our multiomic cis-IV analyses on human disease.

18.
iScience ; 27(2): 108782, 2024 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-38318372

RESUMEN

As the influence of transformer-based approaches in general and generative artificial intelligence (AI) in particular continues to expand across various domains, concerns regarding authenticity and explainability are on the rise. Here, we share our perspective on the necessity of implementing effective detection, verification, and explainability mechanisms to counteract the potential harms arising from the proliferation of AI-generated inauthentic content and science. We recognize the transformative potential of generative AI, exemplified by ChatGPT, in the scientific landscape. However, we also emphasize the urgency of addressing associated challenges, particularly in light of the risks posed by disinformation, misinformation, and unreproducible science. This perspective serves as a response to the call for concerted efforts to safeguard the authenticity of information in the age of AI. By prioritizing detection, fact-checking, and explainability policies, we aim to foster a climate of trust, uphold ethical standards, and harness the full potential of AI for the betterment of science and society.

19.
iScience ; 27(7): 110368, 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-39071890

RESUMEN

Deconvolution algorithms mostly rely on single-cell RNA-sequencing (scRNA-seq) data applied onto bulk RNA-sequencing (bulk RNA-seq) to estimate tissues' cell-type composition, with performance accuracy validated on deposited databases. Adipose tissues' cellular composition is highly variable, and adipocytes can only be captured by single-nucleus RNA-sequencing (snRNA-seq). Here we report the development of sNucConv, a Scaden deep-learning-based deconvolution tool, trained using 5 hSAT and 7 hVAT snRNA-seq-based data corrected by (i) snRNA-seq/bulk RNA-seq highly correlated genes and (ii) individual cell-type regression models. Applying sNucConv on our bulk RNA-seq data resulted in cell-type proportion estimation of 15 and 13 cell types, with accuracy of R = 0.93 (range: 0.76-0.97) and R = 0.95 (range: 0.92-0.98) for hVAT and hSAT, respectively. This performance level was further validated on an independent set of samples (5 hSAT; 5 hVAT). The resulting model was depot specific, reflecting depot differences in gene expression patterns. Jointly, sNucConv provides proof-of-concept for producing validated deconvolution models for tissues un-amenable to scRNA-seq.

20.
iScience ; 26(8): 107454, 2023 Aug 18.
Artículo en Inglés | MEDLINE | ID: mdl-37599835

RESUMEN

The hippocampus plays a vital role in navigation, learning, and memory, and is affected in Alzheimer's disease (AD). This study investigated the classification of AD-transgenic rats versus wild-type littermates using electrophysiological activity recorded from the hippocampus at an early, presymptomatic stage of the disease (6 months old) in the TgF344-AD rat model. The recorded signals were filtered into low frequency (LFP) and high frequency (spiking activity) signals, and machine learning classifiers were employed to identify the rat genotype (TG vs. WT). By analyzing specific frequency bands in the low frequency signals and calculating distance metrics between spike trains in the high frequency signals, accurate classification was achieved. Gamma band power emerged as a valuable signal for classification, and combining information from both low and high frequency signals improved the accuracy further. These findings provide valuable insights into the early stage effects of AD on different regions of the hippocampus.

SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda