RESUMO
Single-cell RNA sequencing (scRNA-seq) enables the exploration of biological heterogeneity among different cell types within tissues at a resolution. Inferring cell types within tissues is foundational for downstream research. Most existing methods for cell type inference based on scRNA-seq data primarily utilize highly variable genes (HVGs) with higher expression levels as clustering features, overlooking the contribution of HVGs with lower expression levels. To address this, we have designed a novel cell type inference method for scRNA-seq data, termed scLEGA. scLEGA employs a novel zero-inflated negative binomial (ZINB) loss function that fully considers the contribution of genes with lower expression levels and combines two distinct scRNA-seq clustering strategies through a multi-head attention mechanism. It utilizes a low-expression optimized denoising autoencoder, based on the novel ZINB model, to extract low-dimensional features and handle dropout events, and a GCN-based graph autoencoder (GAE) that leverages neighbor information to guide dimensionality reduction. The iterative fusion of denoising and topological embedding in scLEGA facilitates the acquisition of cluster-friendly cell representations in the hidden embedding, where similar cells are brought closer together. Compared to 12 state-of-the-art cell type inference methods on 15 scRNA-seq datasets, scLEGA demonstrates superior performance in clustering accuracy, scalability, and stability. Our scLEGA model codes are freely available at https://github.com/Masonze/scLEGA-main.
Assuntos
RNA-Seq , Análise da Expressão Gênica de Célula Única , Humanos , Algoritmos , Análise por Conglomerados , Biologia Computacional/métodos , RNA-Seq/métodos , SoftwareRESUMO
In the growth and development of multicellular organisms, the immune processes of the immune system and the maintenance of the organism's internal environment, cell communication plays a crucial role. It exerts a significant influence on regulating internal cellular states such as gene expression and cell functionality. Currently, the mainstream methods for studying intercellular communication are focused on exploring the ligand-receptor-transcription factor and ligand-receptor-subunit scales. However, there is relatively limited research on the association between intercellular communication and highly variable genes (HVGs). As some HVGs are closely related to cell communication, accurately identifying these HVGs can enhance the accuracy of constructing cell communication networks. The rapid development of single-cell sequencing (scRNA-seq) and spatial transcriptomics technologies provides a data foundation for exploring the relationship between intercellular communication and HVGs. Therefore, we propose CPPLS-MLP, which can identify HVGs closely related to intercellular communication and further analyze the impact of Multiple Input Multiple Output cellular communication on the differential expression of these HVGs. By comparing with the commonly used method CCPLS for constructing intercellular communication networks, we validated the superior performance of our method in identifying cell-type-specific HVGs and effectively analyzing the influence of neighboring cell types on HVG expression regulation. Source codes for the CPPLS_MLP R, python packages and the related scripts are available at 'CPPLS_MLP Github [https://github.com/wuzhenao/CPPLS-MLP]'.
Assuntos
Comunicação Celular , Análise de Célula Única , Análise de Célula Única/métodos , Transcriptoma , Perfilação da Expressão Gênica/métodos , Humanos , Biologia Computacional/métodos , Redes Reguladoras de Genes , Animais , Software , AlgoritmosRESUMO
The field of computational drug repurposing aims to uncover novel therapeutic applications for existing drugs through high-throughput data analysis. However, there is a scarcity of drug repurposing methods leveraging the cellular-level information provided by single-cell RNA sequencing data. To address this need, we propose DrugReSC, an innovative approach to drug repurposing utilizing single-cell RNA sequencing data, intending to target specific cell subpopulations critical to disease pathology. DrugReSC constructs a drug-by-cell matrix representing the transcriptional relationships between individual cells and drugs and utilizes permutation-based methods to assess drug contributions to cellular phenotypic changes. We demonstrate DrugReSC's superior performance compared to existing drug repurposing methods based on bulk or single-cell RNA sequencing data across multiple cancer case studies. In summary, DrugReSC offers a novel perspective on the utilization of single-cell sequencing data in drug repurposing methods, contributing to the advancement of precision medicine for cancer.
Assuntos
Reposicionamento de Medicamentos , Neoplasias , Análise de Célula Única , Transcriptoma , Reposicionamento de Medicamentos/métodos , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/genética , Neoplasias/patologia , Neoplasias/metabolismo , Análise de Célula Única/métodos , Biologia Computacional/métodos , Análise de Sequência de RNA/métodos , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêuticoRESUMO
BACKGROUND: Atherosclerosis is a globally prevalent chronic inflammatory disease with high morbidity and mortality. The development of atherosclerotic lesions is determined by macrophages. This study aimed to investigate the specific role of myeloid-derived CD147 (cluster of differentiation 147) in atherosclerosis and its translational significance. METHODS AND RESULTS: We generated mice with a myeloid-specific knockout of CD147 and mice with restricted CD147 overexpression, both in an apoE-deficient (ApoE-/-) background. Here, the myeloid-specific deletion of CD147 ameliorated atherosclerosis and inflammation. Consistent with our in vivo data, macrophages isolated from myeloid-specific CD147 knockout mice exhibited a phenotype shift from proinflammatory to anti-inflammatory macrophage polarization in response to lipopolysaccharide/IFN (interferon)-γ. These macrophages demonstrated a weakened proinflammatory macrophage phenotype, characterized by reduced production of NO and reactive nitrogen species derived from iNOS (inducible NO synthase). Mechanistically, the TRAF6 (tumor necrosis factor receptor-associated factor 6)-IKK (inhibitor of κB kinase)-IRF5 (IFN regulatory factor 5) signaling pathway was essential for the effect of CD147 on proinflammatory responses. Consistent with the reduced size of the necrotic core, myeloid-specific CD147 deficiency diminished the susceptibility of iNOS-mediated late apoptosis, accompanied by enhanced efferocytotic capacity mediated by increased secretion of GAS6 (growth arrest-specific 6) in proinflammatory macrophages. These findings were consistent in a mouse model with myeloid-restricted overexpression of CD147. Furthermore, we developed a new atherosclerosis model in ApoE-/- mice with humanized CD147 transgenic expression and demonstrated that the administration of an anti-human CD147 antibody effectively suppressed atherosclerosis by targeting inflammation and efferocytosis. CONCLUSIONS: Myeloid CD147 plays a crucial role in the growth of plaques by promoting inflammation in a TRAF6-IKK-IRF5-dependent manner and inhibiting efferocytosis by suppressing GAS6 during proinflammatory conditions. Consequently, the use of anti-human CD147 antibodies presents a complementary therapeutic approach to the existing lipid-lowering strategies for treating atherosclerotic diseases.
Assuntos
Aterosclerose , Placa Aterosclerótica , Camundongos , Animais , Eferocitose , Fator 6 Associado a Receptor de TNF/metabolismo , Aterosclerose/metabolismo , Inflamação/genética , Camundongos Knockout , Fenótipo , Apolipoproteínas E , Fatores Reguladores de Interferon/genética , Camundongos Endogâmicos C57BLRESUMO
Silencers are noncoding DNA sequence fragments located on the genome that suppress gene expression. The variation of silencers in specific cells is closely related to gene expression and cancer development. Computational approaches that exclusively rely on DNA sequence information for silencer identification fail to account for the cell specificity of silencers, resulting in diminished accuracy. Despite the discovery of several transcription factors and epigenetic modifications associated with silencers on the genome, there is still no definitive biological signal or combination thereof to fully characterize silencers, posing challenges in selecting suitable biological signals for their identification. Therefore, we propose a sophisticated deep learning framework called DeepICSH, which is based on multiple biological data sources. Specifically, DeepICSH leverages a deep convolutional neural network to automatically capture biologically relevant signal combinations strongly associated with silencers, originating from a diverse array of biological signals. Furthermore, the utilization of attention mechanisms facilitates the scoring and visualization of these signal combinations, whereas the employment of skip connections facilitates the fusion of multilevel sequence features and signal combinations, thereby empowering the accurate identification of silencers within specific cells. Extensive experiments on HepG2 and K562 cell line data sets demonstrate that DeepICSH outperforms state-of-the-art methods in silencer identification. Notably, we introduce for the first time a deep learning framework based on multi-omics data for classifying strong and weak silencers, achieving favorable performance. In conclusion, DeepICSH shows great promise for advancing the study and analysis of silencers in complex diseases. The source code is available at https://github.com/lyli1013/DeepICSH.
Assuntos
Aprendizado Profundo , Genoma Humano , Humanos , Linhagem Celular , Epigênese Genética , MultiômicaRESUMO
With the emergence of spatial transcriptome sequencing (ST-seq), research now heavily relies on the joint analysis of ST-seq and single-cell RNA sequencing (scRNA-seq) data to precisely identify cell spatial composition in tissues. However, common methods for combining these datasets often merge data from multiple cells to generate pseudo-ST data, overlooking topological relationships and failing to represent spatial arrangements accurately. We introduce GTAD, a method utilizing the Graph Attention Network for deconvolution of integrated scRNA-seq and ST-seq data. GTAD effectively captures cell spatial relationships and topological structures within tissues using a graph-based approach, enhancing cell-type identification and our understanding of complex tissue cellular landscapes. By integrating scRNA-seq and ST data into a unified graph structure, GTAD outperforms traditional 'pseudo-ST' methods, providing robust and information-rich results. GTAD performs exceptionally well with synthesized spatial data and accurately identifies cell spatial composition in tissues like the mouse cerebral cortex, cerebellum, developing human heart and pancreatic ductal carcinoma. GTAD holds the potential to enhance our understanding of tissue microenvironments and cellular diversity in complex bio-logical systems. The source code is available at https://github.com/zzhjs/GTAD.
Assuntos
Análise da Expressão Gênica de Célula Única , Software , Humanos , Animais , CamundongosRESUMO
MOTIVATION: Cell clustering is foundational for analyzing the heterogeneity of biological tissues using single-cell sequencing data. With the maturation of single-cell multi-omics sequencing technologies, we can integrate multiple omics data to perform cell clustering, thereby overcoming the limitations of insufficient information from single omics data. Existing methods for cell clustering often only consider the differences in data patterns during the analysis of multi-omics data, but the dependencies between omics features of different cell types also significantly influence cell clustering. Moreover, the high dropout rates in scRNA-seq and scATAC-seq data can impact the performance of cell clustering. RESULTS: We propose a cell clustering model based on a masked autoencoder, scDRMAE. Utilizing a masking mechanism, scDRMAE effectively learns the relationships between different features and imputes false zeros caused by dropout events. To differentiate the importance of various omics data in cell clustering, we dynamically adjust the weights of different omics data through an attention mechanism. Finally, we use the K-means algorithm for cluster analysis of the fused multi-omics data. On commonly used sets of 15 multi-omics datasets, our method demonstrates superior cell clustering performance on multiple metrics compared to other computational methods. In addition, when datasets exhibit varying degrees of dropout noise, our method shows better performance and stronger stability on multiple metrics compared to other methods. Moreover, by analyzing the cell clusters classified by scDRMAE, we identified several biologically significant biomarkers that have been validated, further confirming the effectiveness of scDRMAE in cell clustering from a biological perspective.
Assuntos
Algoritmos , Análise de Célula Única , Análise por Conglomerados , Humanos , Análise de Célula Única/métodos , Biologia Computacional/métodosRESUMO
BACKGROUND AND AIMS: Acute liver failure (ALF) is a rare but life-threatening condition, and DILI, particularly acetaminophen toxicity, is the leading cause of ALF. Innate immune mechanisms further perpetuate liver injury, while the role of the adaptive immune system in DILI-related ALF is unclear. APPROACH AND RESULTS: We analyzed liver tissue from 2 independent patient cohorts with ALF and identified hepatic T cell infiltration as a prominent feature in human ALF. CD8 + T cells were characterized by zonation toward necrotic regions and an activated gene expression signature. In murine acetaminophen-induced liver injury, intravital microscopy revealed zonation of CD8 + but not CD4 + T cells at necrotic areas. Gene expression analysis exposed upregulated C-C chemokine receptor 7 (CCR7) and its ligand CCL21 in the liver as well as a broadly activated phenotype of hepatic CD8 + T cells. In 2 mouse models of ALF, Ccr7-/- mice had significantly aggravated early-phase liver damage. Functionally, CCR7 was not involved in the recruitment of CD8 + T cells, but regulated their activation profile potentially through egress to lymphatics. Ccr7-/- CD8 + T cells were characterized by elevated expression of activation, effector, and exhaustion profiles. Adoptive transfer revealed preferential homing of CCR7-deficient CD8 + T cells to the liver, and depletion of CD8 + T cells attenuated liver damage in mice. CONCLUSIONS: Our study demonstrates the involvement of the adaptive immune system in ALF in humans and mice. We identify the CCR7-CCL21 axis as an important regulatory pathway, providing downstream protection against T cell-mediated liver injury.
Assuntos
Linfócitos T CD8-Positivos , Homeostase , Falência Hepática Aguda , Receptores CCR7 , Animais , Receptores CCR7/metabolismo , Receptores CCR7/genética , Linfócitos T CD8-Positivos/imunologia , Linfócitos T CD8-Positivos/metabolismo , Camundongos , Humanos , Falência Hepática Aguda/imunologia , Falência Hepática Aguda/induzido quimicamente , Falência Hepática Aguda/metabolismo , Falência Hepática Aguda/patologia , Masculino , Fígado/patologia , Fígado/metabolismo , Fígado/imunologia , Acetaminofen/toxicidade , Acetaminofen/efeitos adversos , Quimiocina CCL21/metabolismo , Quimiocina CCL21/genética , Doença Hepática Induzida por Substâncias e Drogas/imunologia , Doença Hepática Induzida por Substâncias e Drogas/metabolismo , Doença Hepática Induzida por Substâncias e Drogas/patologia , Modelos Animais de Doenças , Camundongos Endogâmicos C57BL , Feminino , Camundongos KnockoutRESUMO
Lewy body diseases (LBD) comprise a group of complex neurodegenerative conditions originating from accumulation of misfolded alpha-synuclein (α-syn) in the form of Lewy bodies. LBD pathologies are characterized by α-syn deposition in association with other proteins such as Amyloid ß (Aß), Tau, and TAR-DNA-binding protein. To investigate the complex interactions of these proteins, we constructed 2 novel transgenic overexpressing (OE) C. elegans strains (α-synA53T;Taupro-agg (OE) and α-synA53T;Aß1-42;Taupro-agg (OE)) and compared them with previously established Parkinson's, Alzheimer's, and Lewy Body Dementia disease models. The LBD models presented here demonstrate impairments including uncoordinated movement, egg-laying deficits, altered serotonergic and cholinergic signaling, memory and posture deficits, as well as dopaminergic neuron damage and loss. Expression levels of total and prone to aggregation α-syn protein were increased in α-synA53T;Aß1-42 but decreased in α-synA53T;Taupro-agg animals when compared to α-synA53T animals suggesting protein interactions. These alterations were also observed at the mRNA level suggesting a pre-transcriptional mechanism. miRNA-seq revealed that cel-miR-1018 was upregulated in LBD models α-synA53T, α-synA53T;Aß1-42, and α-synA53T;Taupro-agg compared with WT. cel-miR-58c was upregulated in α-synA53T;Taupro-agg but downregulated in α-synA53T and α-synA53T;Aß1-42 compared with WT. cel-miR-41-3p and cel-miR-355-5p were significantly downregulated in 3 LBD models. Our results obtained in a model organism provide evidence of interactions between different pathological proteins and alterations in specific miRNAs that may further exacerbate or ameliorate LBD pathology.
Assuntos
Peptídeos beta-Amiloides , Animais Geneticamente Modificados , Caenorhabditis elegans , Modelos Animais de Doenças , Doença por Corpos de Lewy , MicroRNAs , alfa-Sinucleína , Animais , Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/genética , MicroRNAs/genética , MicroRNAs/metabolismo , Doença por Corpos de Lewy/metabolismo , Doença por Corpos de Lewy/patologia , Doença por Corpos de Lewy/genética , alfa-Sinucleína/metabolismo , alfa-Sinucleína/genética , Peptídeos beta-Amiloides/metabolismo , Peptídeos beta-Amiloides/genética , Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/genética , Humanos , Proteínas tau/metabolismo , Proteínas tau/genética , Neurônios Dopaminérgicos/metabolismo , Neurônios Dopaminérgicos/patologiaRESUMO
PIWI-interacting RNAs (piRNAs) are short 21-35 nucleotide molecules that comprise the largest class of non-coding RNAs and found in a large diversity of species including yeast, worms, flies, plants and mammals including humans. The most well-understood function of piRNAs is to monitor and protect the genome from transposons particularly in germline cells. Recent data suggest that piRNAs may have additional functions in somatic cells although they are expressed there in far lower abundance. Compared with microRNAs (miRNAs), piRNAs have more limited bioinformatics resources available. This review collates 39 piRNA specific and non-specific databases and bioinformatics resources, describes and compares their utility and attributes and provides an overview of their place in the field. In addition, we review 33 computational models based upon function: piRNA prediction, transposon element and mRNA-related piRNA prediction, cluster prediction, signature detection, target prediction and disease association. Based on the collection of databases and computational models, we identify trends and potential gaps in tool development. We further analyze the breadth and depth of piRNA data available in public sources, their contribution to specific human diseases, particularly in cancer and neurodegenerative conditions, and highlight a few specific piRNAs that appear to be associated with these diseases. This briefing presents the most recent and comprehensive mapping of piRNA bioinformatics resources including databases, models and tools for disease associations to date. Such a mapping should facilitate and stimulate further research on piRNAs.
Assuntos
Proteínas Argonautas , Células Germinativas , Animais , Proteínas Argonautas/genética , Proteínas Argonautas/metabolismo , Simulação por Computador , Elementos de DNA Transponíveis , Células Germinativas/metabolismo , Humanos , Mamíferos/genética , Mamíferos/metabolismo , RNA Interferente Pequeno/genéticaRESUMO
CRISPR-Cas system is an adaptive immune system widely found in most bacteria and archaea to defend against exogenous gene invasion. One of the most critical steps in the study of exploring and classifying novel CRISPR-Cas systems and their functional diversity is the identification of Cas proteins in CRISPR-Cas systems. The discovery of novel Cas proteins has also laid the foundation for technologies such as CRISPR-Cas-based gene editing and gene therapy. Currently, accurate and efficient screening of Cas proteins from metagenomic sequences and proteomic sequences remains a challenge. For Cas proteins with low sequence conservation, existing tools for Cas protein identification based on homology cannot guarantee identification accuracy and efficiency. In this paper, we have developed a novel stacking-based ensemble learning framework for Cas protein identification, called CRISPRCasStack. In particular, we applied the SHAP (SHapley Additive exPlanations) method to analyze the features used in CRISPRCasStack. Sufficient experimental validation and independent testing have demonstrated that CRISPRCasStack can address the accuracy deficiencies and inefficiencies of the existing state-of-the-art tools. We also provide a toolkit to accurately identify and analyze potential Cas proteins, Cas operons, CRISPR arrays and CRISPR-Cas locus in prokaryotic sequences. The CRISPRCasStack toolkit is available at https://github.com/yrjia1015/CRISPRCasStack.
Assuntos
Archaea , Proteômica , Archaea/genética , Sistemas CRISPR-Cas , Edição de Genes/métodos , Aprendizado de MáquinaRESUMO
High-quality genome chromosome-scale sequences provide an important basis for genomics downstream analysis, especially the construction of haplotype-resolved and complete genomes, which plays a key role in genome annotation, mutation detection, evolutionary analysis, gene function research, comparative genomics and other aspects. However, genome-wide short-read sequencing is difficult to produce a complete genome in the face of a complex genome with high duplication and multiple heterozygosity. The emergence of long-read sequencing technology has greatly improved the integrity of complex genome assembly. We review a variety of computational methods for complex genome assembly and describe in detail the theories, innovations and shortcomings of collapsed, semi-collapsed and uncollapsed assemblers based on long reads. Among the three methods, uncollapsed assembly is the most correct and complete way to represent genomes. In addition, genome assembly is closely related to haplotype reconstruction, that is uncollapsed assembly realizes haplotype reconstruction, and haplotype reconstruction promotes uncollapsed assembly. We hope that gapless, telomere-to-telomere and accurate assembly of complex genomes can be truly routinely achieved using only a simple process or a single tool in the future.
Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Mapeamento Cromossômico , Genoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodosRESUMO
MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer development. The methods that identify eRNAs based solely on genomic sequence data have high error rates because they do not account for tissue specificity. Specific histone modifications associated with eRNAs offer valuable information for their identification. However, identification of eRNAs using histone modification data requires the use of both RNA-seq and histone modification data. Unfortunately, many public datasets contain only one of these components, which impedes the accurate identification of eRNAs. RESULTS: We introduce DeepITEH, a deep learning framework that leverages RNA-seq data and histone modification data from multiple samples of the same tissue to enhance the accuracy of identifying eRNAs. Specifically, deepITEH initially categorizes eRNAs into two classes, namely, regularly expressed eRNAs and accidental eRNAs, using histone modification data from multiple samples of the same tissue. Thereafter, it integrates both sequence and histone modification features to identify eRNAs in specific tissues. To evaluate the performance of DeepITEH, we compared it with four existing state-of-the-art enhancer prediction methods, SeqPose, iEnhancer-RD, LSTMAtt, and FRL, on four normal tissues and four cancer tissues. Remarkably, seven of these tissues demonstrated a substantially improved specific eRNA prediction performance with DeepITEH, when compared with other methods. Our findings suggest that DeepITEH can effectively predict potential eRNAs on the human genome, providing insights for studying the eRNA function in cancer. AVAILABILITY AND IMPLEMENTATION: The source code and dataset of DeepITEH have been uploaded to https://github.com/lyli1013/DeepITEH.
Assuntos
Aprendizado Profundo , RNA Longo não Codificante , Humanos , Transcrição Gênica , Genoma Humano , Histonas/genética , Elementos Facilitadores Genéticos , RNA Longo não Codificante/genéticaRESUMO
BACKGROUND: In previous publications, the Task Force on Reference Measurement System Implementation proposed a procedural approach combining a critical review of entries available in the Joint Committee on Traceability in Laboratory Medicine (JCTLM) database with a comparison of this information against analytical performance specifications for measurement uncertainty (MU) and applied it to a group of 13 measurands. CONTENT: Here we applied this approach to 17 additional measurands, of which measurements are frequently requested. The aims of the study were (a) to describe the main characteristics for implementing traceability and the potential to fulfill the maximum allowable MU (MAU) at the clinical sample level of certified reference materials and reference measurement procedures listed in the JCTLM database; (b) to discuss limitations and obstacles, if any, to the achievement of the required quality of laboratory measurements; and (c) to provide a gap analysis by highlighting what is still missing in the database. Results were integrated with those obtained in the previous study, therefore offering an overview of where we are and what is still missing in the practical application of the metrological traceability concept to 30 common biochemical tests employed in laboratory medicine. SUMMARY: Our analysis shows that for 28 out of 30 measurands, conditions exist to correctly implement metrological traceability to the International System of units and fulfill at least the MAU of the minimum quality level derived according to internationally recommended models. For 2 measurands (serum albumin and chloride), further improvements in MU of higher-order references would be necessary.
RESUMO
Compared to animal cells, phenotypic characterization of single plant cells on microfluidic platforms is still rare. In this work, we collated population statistics on the morphological, biochemical, physical and electrical properties of Arabidopsis protoplasts under different external and internal conditions, using progressively improved microfluidic platforms. First, we analyzed the different effects of three phytohormones (auxin, cytokinin and gibberellin) on the primary cell wall (PCW) regeneration process using a microfluidic flow cytometry platform equipped with a single-channel fluorescence sensor. Second, we correlated the intracellular reactive oxygen species (ROS) level induced by heavy metal stress with the concurrent PCW regeneration process by using a dual-channel fluorescence sensor. Third, by integrating contraction channels, we were able to effectively discriminate variations in cell size while monitoring the intensity of intracellular ROS signaling. Fourth, by combining an electrical impedance electrode with the contraction channel, we analyzed the differences in electrical and mechanical properties of wild-type and mutant plant cells before and after primary cell wall regeneration. Overall, our work demonstrates the feasibility and sensitivity of microfluidic flow cytometry in high-throughput phenotyping of plant cells and provides a reference for assessing metabolic and physiological indicators of individual plant cells in multiple dimensions.
Assuntos
Arabidopsis , Citometria de Fluxo , Fenótipo , Espécies Reativas de Oxigênio , Arabidopsis/citologia , Arabidopsis/fisiologia , Citometria de Fluxo/métodos , Espécies Reativas de Oxigênio/metabolismo , Análise de Célula Única/métodos , Análise de Célula Única/instrumentação , Protoplastos/efeitos dos fármacos , Protoplastos/citologia , Técnicas Analíticas Microfluídicas/métodos , Técnicas Analíticas Microfluídicas/instrumentação , Reguladores de Crescimento de Plantas/farmacologia , Parede Celular/química , Parede Celular/efeitos dos fármacos , Dispositivos Lab-On-A-ChipRESUMO
OBJECTIVES: This study aims to evaluate the commutability of external quality assessment (EQA) materials and candidate reference materials (RMs) for plasma renin activity (PRA) assay. METHODS: Commutabilities of 16 candidate RMs were measured along with 40 clinical samples by the four different routine PRA assays, including three LCâMS/MS assays and one chemiluminescence immunoassay. Sixteen candidate RMs included native/spiked human plasma pools (small-scale pools with <50 individuals) and current EQA materials (large-scale pools with >1,000 individuals). Difference in bias approach and linear regression with prediction interval approach were adopted to determine the commutability. Two-way variance analysis was used to estimate the effects of spiked and pool size on the commutability. Stability and homogeneity studies were performed. RESULTS: Precision and correlation performance of all assays was acceptable. In the difference in bias approach, the commutability results were not satisfactory (noncommutability: 14/16) and significant sample-specific effects were detected in assay pairs using different incubation buffers. For the prediction interval approach, no commutability was observed in the spiked small-scale pools; EQA materials (4/9) had more satisfactory commutability among all assays than the small-scale pools (2/7); RMs of large-scale pools tend to have better commutability no matter spiked or not. CONCLUSIONS: Commutable RMs were obtainable but challenging. Current EQA materials with relatively good commutability, stability, and homogeneity were appropriate RMs. Large-scale pools are tending to be commutable. Spiking in small-scale pools was not suggested to prepare RMs. MPs adopting a uniform incubation buffer would be preferable for further commutability research.
Assuntos
Renina , Espectrometria de Massas em Tandem , Humanos , Padrões de Referência , Cromatografia Líquida , ViésRESUMO
OBJECTIVES: The standardization of cystatin C (CysC) measurement has received increasing attention in recent years due to its importance in estimating glomerular filtration rate (GFR). Mass spectrometry-based assays have the potential to provide an accuracy base for CysC measurement. However, a precise, accurate and sustainable LC-MS/MS method for CysC is still lacking. METHODS: The developed LC-MS/MS method quantified CysC by detecting signature peptide (T3) obtained from tryptic digestion. Stable isotope labeled T3 peptide (SIL-T3) was spiked to control matrix effects and errors caused by liquid handling. The protein denaturation, reduction and alkylation procedures were combined into a single step with incubation time of 1â¯h, and the digestion lasted for 3.5â¯h. In the method validation, digestion time-course, imprecision, accuracy, matrix effect, interference, limit of quantification (LOQ), carryover, linearity, and the comparability to two routine immunoassays were evaluated. RESULTS: No significant matrix effect or interference was observed with the CysC measurement. The LOQ was 0.21â¯mg/L; the within-run and total imprecision were 1.33-2.05â¯% and 2.18-3.90â¯% for three serum pools (1.18-5.34â¯mg/L). The LC-MS/MS method was calibrated by ERM-DA471/IFCC and showed good correlation with two immunoassays traceable to ERM-DA471/IFCC. However, significant bias was observed for immunoassays against the LC-MS/MS method. CONCLUSIONS: The developed LC-MS/MS method is robust and simpler and holds the promise to provide an accuracy base for routine immunoassays, which will promote the standardization of CysC measurement.
Assuntos
Cistatina C , Espectrometria de Massa com Cromatografia Líquida , Humanos , Cistatina C/sangue , Imunoensaio/métodos , Limite de Detecção , Espectrometria de Massa com Cromatografia Líquida/métodos , Espectrometria de Massas em Tandem/métodosRESUMO
BACKGROUND: This study aimed to investigate the utilization rate and equity of health examination service among the middle-aged and elderly population in China from 2011 to 2018. The contribution of various determinants to the inequity in health examination service utilization was also examined. METHODS: Data from the China Health and Retirement Longitudinal Survey (CHARLS) were analyzed to assess the health examination service utilization rate among the middle-aged and elderly population. A concentration curve and concentration index were employed to measure the equity of health examination service utilization and decomposed into its determining factors. Horizontal inequity index was applied to evaluate the trends in equity of health examination service. RESULTS: The health examination service utilization rates among the middle-aged and elderly population were 29.45%, 20.69%, 25.40%, and 32.05% in 2011, 2013, 2015, and 2018, respectively. The concentration indexes for health examination service utilization were 0.0080 (95% CI: - 0.0084, 0.0244), 0.0155 (95% CI: - 0.0054, 0.0363), 0.0095 (95% CI: - 0.0088, 0.0277), and - 0.0100 (95% CI: - 0.0254, 0.0054) from 2011 to 2018, respectively. The horizontal inequity index was positive from 2011 to 2018, evidencing a pro-rich inequity trend. Age, residence, education, region, and economic status were the major identified contributors influencing the equity of health examination service utilization. CONCLUSIONS: A pro-rich inequity existed in health examination service utilization among the middle-aged and elderly population in China. Reducing the wealth and regional gap, providing equal educational opportunities, and strengthening the capacity for chronic disease prevention and control are crucial for reducing the inequity in health examination service utilization.
Assuntos
Disparidades em Assistência à Saúde , Aposentadoria , Pessoa de Meia-Idade , Humanos , Idoso , Fatores Socioeconômicos , China , Estudos LongitudinaisRESUMO
With the widespread adoption of next-generation sequencing technologies, the speed and convenience of genome sequencing have significantly improved, and many biological genomes have been sequenced. However, during the assembly of small genomes, we still face a series of challenges, including repetitive fragments, inverted repeats, low sequencing coverage, and the limitations of sequencing technologies. These challenges lead to unknown gaps in small genomes, hindering complete genome assembly. Although there are many existing assembly software options, they do not fully utilize the potential of artificial intelligence technologies, resulting in limited improvement in gap filling. Here, we propose a novel method, DLGapCloser, based on deep learning, aimed at assisting traditional tools in further filling gaps in small genomes. Firstly, we created four datasets based on the original genomes of Saccharomyces cerevisiae, Schizosaccharomyces pombe, Neurospora crassa, and Micromonas pusilla. To further extract effective information from the gene sequences, we also added homologous genomes to enrich the datasets. Secondly, we proposed the DGCNet model, which effectively extracts features and learns context from sequences flanking gaps. Addressing issues with early pruning and high memory usage in the Beam Search algorithm, we developed a new prediction algorithm, Wave-Beam Search. This algorithm alternates between expansion and contraction phases, enhancing efficiency and accuracy. Experimental results showed that the Wave-Beam Search algorithm improved the gap-filling performance of assembly tools by 7.35%, 28.57%, 42.85%, and 8.33% on the original results. Finally, we established new gap-filling standards and created and implemented a novel evaluation method. Validation on the genomes of Saccharomyces cerevisiae, Schizosaccharomyces pombe, Neurospora crassa, and Micromonas pusilla showed that DLGapCloser increased the number of filled gaps by 8.05%, 15.3%, 1.4%, and 7% compared to traditional assembly tools.
Assuntos
Redes Neurais de Computação , Algoritmos , Aprendizado Profundo , Genoma Fúngico , Saccharomyces cerevisiae/genética , Schizosaccharomyces/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neurospora crassa/genética , Software , Genômica/métodos , Análise de Sequência de DNA/métodosRESUMO
Single-cell RNA sequencing (scRNA-seq) is widely used to interpret cellular states, detect cell subpopulations, and study disease mechanisms. In scRNA-seq data analysis, cell clustering is a key step that can identify cell types. However, scRNA-seq data are characterized by high dimensionality and significant sparsity, presenting considerable challenges for clustering. In the high-dimensional gene expression space, cells may form complex topological structures. Many conventional scRNA-seq data analysis methods focus on identifying cell subgroups rather than exploring these potential high-dimensional structures in detail. Although some methods have begun to consider the topological structures within the data, many still overlook the continuity and complex topology present in single-cell data. We propose a deep learning framework that begins by employing a zero-inflated negative binomial (ZINB) model to denoise the highly sparse and over-dispersed scRNA-seq data. Next, scZAG uses an adaptive graph contrastive representation learning approach that combines approximate personalized propagation of neural predictions graph convolution (APPNPGCN) with graph contrastive learning methods. By using APPNPGCN as the encoder for graph contrastive learning, we ensure that each cell's representation reflects not only its own features but also its position in the graph and its relationships with other cells. Graph contrastive learning exploits the relationships between nodes to capture the similarity among cells, better representing the data's underlying continuity and complex topology. Finally, the learned low-dimensional latent representations are clustered using Kullback-Leibler divergence. We validated the superior clustering performance of scZAG on 10 common scRNA-seq datasets in comparison to existing state-of-the-art clustering methods.