RESUMO
The advancement of spatial transcriptomics (ST) technology contributes to a more profound comprehension of the spatial properties of gene expression within tissues. However, due to challenges of high dimensionality, pronounced noise and dynamic limitations in ST data, the integration of gene expression and spatial information to accurately identify spatial domains remains challenging. This paper proposes a SpaNCMG algorithm for the purpose of achieving precise spatial domain description and localization based on a neighborhood-complementary mixed-view graph convolutional network. The algorithm enables better adaptation to ST data at different resolutions by integrating the local information from KNN and the global structure from r-radius into a complementary neighborhood graph. It also introduces an attention mechanism to achieve adaptive fusion of different reconstructed expressions, and utilizes KPCA method for dimensionality reduction. The application of SpaNCMG on five datasets from four sequencing platforms demonstrates superior performance to eight existing advanced methods. Specifically, the algorithm achieved highest ARI accuracies of 0.63 and 0.52 on the datasets of the human dorsolateral prefrontal cortex and mouse somatosensory cortex, respectively. It accurately identified the spatial locations of marker genes in the mouse olfactory bulb tissue and inferred the biological functions of different regions. When handling larger datasets such as mouse embryos, the SpaNCMG not only identified the main tissue structures but also explored unlabeled domains. Overall, the good generalization ability and scalability of SpaNCMG make it an outstanding tool for understanding tissue structure and disease mechanisms. Our codes are available at https://github.com/ZhihaoSi/SpaNCMG.
Assuntos
Algoritmos , Transcriptoma , Humanos , Animais , Camundongos , Perfilação da Expressão Gênica/métodos , Redes Neurais de Computação , Biologia Computacional/métodos , Córtex Pré-Frontal/metabolismoRESUMO
The emerging importance of embryonic development research rapidly increases the volume for a professional resource related to multi-omics data. However, the lack of global embryogenesis repository and systematic analysis tools limits the preceding in stem cell research, human congenital diseases and assisted reproduction. Here, we developed the EmAtlas, which collects the most comprehensive multi-omics data and provides multi-scale tools to explore spatiotemporal activation during mammalian embryogenesis. EmAtlas contains data on multiple types of gene expression, chromatin accessibility, DNA methylation, nucleosome occupancy, histone modifications, and transcription factors, which displays the complete spatiotemporal landscape in mouse and human across several time points, involving gametogenesis, preimplantation, even fetus and neonate, and each tissue involves various cell types. To characterize signatures involved in the tissue, cell, genome, gene and protein levels during mammalian embryogenesis, analysis tools on these five scales were developed. Additionally, we proposed EmRanger to deliver extensive development-related biological background annotations. Users can utilize these tools to analyze, browse, visualize, and download data owing to the user-friendly interface. EmAtlas is freely accessible at http://bioinfor.imu.edu.cn/ematlas.
Assuntos
Embrião de Mamíferos , Desenvolvimento Embrionário , Animais , Humanos , Recém-Nascido , Camundongos , Cromatina/genética , Metilação de DNA , Desenvolvimento Embrionário/genética , Genoma , Mamíferos/genética , Nucleossomos , Atlas como AssuntoRESUMO
The imaging spectrometer's high performance in practical applications may be compromised by environmental factors, particularly temperature variations, posing a challenge to its stability. Temperature fluctuations can induce spectral shift, directly impacting the accuracy of spectral measurements, subsequently influencing the precision of radiometric measurements. To address this issue, this study investigates a dual-channel UV imaging spectrometer. This instrument boasts a wavelength calibration accuracy of 0.01â nm. This paper conducts an in-depth analysis of the various mechanisms through which temperature changes influence the spectral line offset in the imaging spectrometer, integrating actual orbital temperature data to discuss the instrument's temperature load settings. The impact of temperature on spectral shift is examined using finite element analysis and optical design software. Estimations of spectral shift were made based on temperature variations. Simulation results indicated that the maximum deviation of spectral shift is estimated at 0.018â nm under a temperature condition of 16 ± 1°C. Under a more controlled orbital temperature condition (16 ± 0.3°C), the maximum deviation of spectral shift decreased to 0.01â nm. Experimental data revealed that at 16 ± 1°C, the maximum deviation of spectral shift did not exceed 0.01â nm. This effectively corroborates our theoretical analysis. The relationship between temperature and spectral shift offers a crucial theoretical foundation for calibrating spectral measurements and managing the thermal conditions of the instrument.
RESUMO
Long non-coding RNAs (lncRNAs), as promising novel biomarkers for cancer treatment and prognosis, can function as tumor suppressors and oncogenes in the occurrence and development of many types of cancer, including gastric cancer (GC). However, little is known about the complex regulatory system of lncRNAs in GC. In this study, we systematically analyzed lncRNA and miRNA transcriptomic profiles of GC based on bioinformatics methods and experimental validation. An lncRNA-miRNA interaction network related to GC was constructed, and the nine crucial lncRNAs were identified. These 9 lncRNAs were found to be associated with the prognosis of GC patients by Cox proportional hazards regression analysis. Among them, the expression of lncRNA SNHG14 can affect the survival of GC patients as a potential prognostic marker. Moreover, it was shown that SNHG14 was involved in immune-related pathways and significantly correlated with immune cell infiltration in GC. Meanwhile, we found that SNHG14 affected immune function in many cancers, such as breast cancer and esophageal carcinoma. Such information revealed that SNHG14 may serve as a potential target for cancer immunotherapy. As well, our study could provide practical and theoretical guiding significance for clinical application of non-coding RNAs.
RESUMO
Hyperspectral detection of the change rate of organic matter content in agricultural remote sensing requires a high signal-to-noise ratio (SNR). However, due to the large number and efficiency limitation of the components, it is difficult to improve the SNR. This study uses high-efficiency convex grating with a diffraction efficiency exceeding 50% across the 360-850 nm range, a back-illuminated Complementary Metal Oxide Semiconductor (CMOS) detector with a 95% efficiency in peak wavelength, and silver-coated mirrors to develop an imaging spectrometer for detecting soil organic matter (SOM). The designed system meets the spectral resolution of 10 nm in the 360-850 nm range and achieves a swath of 100 km and a spatial resolution of 100 m at an orbital height of 648.2 km. This study also uses the basic structure of Offner with fewer components in the design and sets the mirrors of the Offner structure to have the same sphere, which can achieve the rapid adjustment of the co-standard. This study performs a theoretical analysis of the developed Offner imaging spectrometer based on the classical Rowland circular structure, with a 21.8 mm slit length; simulates its capacity for suppressing the +2nd-order diffraction stray light with the filter; and analyzes the imaging quality after meeting the tolerance requirements, which is combined with the surface shape characteristics of the high-efficiency grating. After this test, the grating has a diffraction efficiency above 50%, and the silver-coated mirrors have a reflection value above 95% on average. Finally, the laboratory tests show that the SNR over the waveband exceeds 300 and reaches 800 at 550 nm, which is higher than some current instruments in orbit for soil observation. The proposed imaging spectrometer has a spectral resolution of 10 nm, and its modulation transfer function (MTF) is greater than 0.23 at the Nyquist frequency, making it suitable for remote sensing observation of SOM change rate. The manufacture of such a high-efficiency broadband grating and the development of the proposed instrument with high energy transmission efficiency can provide a feasible technical solution for observing faint targets with a high SNR.
RESUMO
Developmental pluripotency-associated 2 (Dppa2) and developmental pluripotency-associated 4 (Dppa4) as positive drivers were helpful for transcriptional regulation of zygotic genome activation (ZGA). Here, we systematically assessed the cooperative interplay of Dppa2 and Dppa4 in regulating cell pluripotency and found that simultaneous overexpression of Dppa2/4 can make induced pluripotent stem cells closer to embryonic stem cells (ESCs). Compared with other pluripotency transcription factors, Dppa2/4 can regulate majorities of signaling pathways by binding on CG-rich region of proximal promoter (0-500 bp), of which 85% and 77% signaling pathways were significantly activated by Dppa2 and Dppa4, respectively. Notably, Dppa2/4 also can dramatically trigger the decisive signaling pathways for facilitating ZGA, including Hippo, MAPK and TGF-beta signaling pathways and so on. At last, we found alkaline phosphatase, placental-like 2 (Alppl2) was completely silenced when Dppa2 and 4 single- or double-knockout in ESC, which is consistent with Dux. Moreover, Alppl2 was significantly activated in mouse 2-cell embryos and 4-8 cells stage of human embryos, further predicted that Alppl2 was directly regulated by Dppa2/4 as a ZGA candidate driver to facilitate pre-embryonic development.
Assuntos
Ilhas de CpG , Genoma Humano , Proteínas Nucleares/metabolismo , Transdução de Sinais , Fatores de Transcrição/metabolismo , Zigoto/metabolismo , Animais , Blastocisto/metabolismo , Linhagem Celular , Humanos , Camundongos , Proteínas Nucleares/genética , Fatores de Transcrição/genéticaRESUMO
Immersion gratings have high dispersion efficiency and have important application value in miniaturized imaging spectrometers, but its serious dispersion nonlinearity causes difficulties in calibration and image processing, which limits its application range. To solve this, this paper presents a design method for a two-material linear dispersion immersion grating device design method, and a compact small F-number immersion grating spectrometer based on it. First the vector form dispersion equation of the two-material immersion grating is derived and the linear spectral dispersion immersion grating design process is given, then a compact small F-number uniform dispersion imaging spectrometer is given as a design example using the proposed method. The results show that when the operating band of the system is 1590-1675â nm, the spectral resolution is better than 0.25â nm, and F-number can achieve better than 2. Compared with traditional single-material immersion grating imaging spectrometer, the designed imaging spectrometer dispersion linearity is significantly improved. Finally, the influence of prism materials, structure parameters and grating parameters on dispersion nonlinearity is analyzed. Design and analysis results show that the proposed two-material immersion grating device has much better spectral dispersion nonlinearity correction ability, and its design method can provide reference to the compact spectrometer design based on immersion gratings.
RESUMO
ABCB1 is an important gene that closely related to analgesic tolerance to opioids, and plays an important role in their postoperative treatment. Recent studies have demonstrated that ABCB1 genotype is significantly associated with the chemico-resistance and chemical sensitivity in breast cancer patients. So, it is become very important to investigate the important role of ABCB1 for predicting drug response in breast cancer patients. In this study, by conducting the Cox proportional hazards regression analysis in breast cancer patients, significant differences were found in prognosis between the ABCB1 high- and low-expression subtypes. Meanwhile, by using immune infiltration profiles as well as transcriptomics datasets, the ABCB1 high subtype was found to be significantly enriched in many immune-related KEGG pathways and biological processes, and was characterized by the high infiltration levels of immune cell types. Furthermore, bioinformatics inference revealed that the ABCB1 subtypes were associated with the therapeutic effect of immunotherapy, which would be important for patient prognosis. In conclusion, these findings may provide useful help for recognizing the diversity between ABCB1 subtypes in tumor immune microenvironment, and may unravel prognosis outcomes and immunotherapy utility for ABCB1 in breast cancer.
Assuntos
Fenômenos Biológicos , Neoplasias da Mama , Subfamília B de Transportador de Cassetes de Ligação de ATP/genética , Subfamília B de Transportador de Cassetes de Ligação de ATP/uso terapêutico , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Feminino , Humanos , Prognóstico , Microambiente Tumoral/genéticaRESUMO
As spectroscopic detection technology rapidly advances, back-illuminated InGaAs detectors with a wider spectral range have emerged. Compared to traditional detectors such as HgCdTe, CCD, and CMOS, InGaAs detectors offer a working range of 400-1800 nm and exhibit a quantum efficiency of over 60% in both the visible and near-infrared bands. This is leading to the demand for innovative designs of imaging spectrometers with wider spectral ranges. However, the widening of the spectral range has led to the presence of significant axial chromatic aberration and secondary spectrum in imaging spectrometers. Additionally, there is difficulty in aligning the system optical axis perpendicular to the detector image plane, resulting in increased challenges during post-installation adjustment. Based on chromatic aberration correction theory, this paper presents the design of a wide spectral range transmission prism-grating imaging spectrometer with a working range of 400-1750 nm using Code V. The spectral range of this spectrometer covers both the visible and near-infrared regions, which is beyond the capability of traditional PG spectrometers. In the past, the working spectral range of transmission-type PG imaging spectrometers has been limited to 400-1000 nm. This study's proposed chromatic aberration correction process involves selecting optical glass materials that match the design requirements and correcting the axial chromatic aberration and secondary spectrum, ensuring that the system axis is perpendicular to the detector plane and easy to adjust during installation. The results show that the spectrometer has a spectral resolution of 5 nm, a root-mean-square spot diagram less than 8 µm over the full field of view, and an optical transfer function MTF greater than 0.6 at a Nyquist frequency of 30 lp/mm. The system size is less than 90 mm. Spherical lenses are employed in the system design to reduce manufacturing costs and complexity while meeting the requirements of wide spectral range, miniaturization, and easy installation.
Assuntos
Fabaceae , Lentes , Diagnóstico por Imagem , Comércio , VidroRESUMO
Many progresses have recently been achieved in animal somatic cell nuclear transfer (SCNT). However, embryos derived from SCNT rarely result in live births. Single-cell RNA sequencing (scRNA-seq) can be used to investigate the development details of SCNT embryos. Here, bovine fibroblasts and three factors bovine iPSCs (3F biPSCs) were used as donors for bovine nuclear transfer, and the single blastomere transcriptome was analysed by scRNA-seq. Compared to in vitro fertilization (IVF) embryos, SCNT embryos exhibited many defects. Abnormally expressed genes were found at each stage of embryos, which enriched in metabolism, and epigenetic modification. The DEGs of the adjacent stage in SCNT embryos did not follow the temporal expression pattern similar to that of IVF embryos. Particularly, SCNT 8-cell stage embryos showed failures in some gene activation, including ZSCAN4, and defects in protein association networks which cored as POLR2K, GRO1, and ANKRD1. Some important signalling pathways also showed incomplete activation at SCNT zygote to morula stage. Interestingly, 3F biPSCNT embryos exhibited more dysregulated genes than SCNT embryos at zygote and 2-cell stage, including genes in KDM family. Pseudotime analysis of 3F biPSCNT embryos showed the different developmental fate from SCNT and IVF embryos. These findings suggested partial reprogrammed 3F biPS cells as donors for bovine nuclear transfer hindered the reprogramming of nuclear transfer embryos. Our studies revealed the abnormal gene expression and pathway activation of SCNT embryos, which could increase our understanding of the development of SCNT embryos and give hints to improve the efficiency of nuclear transfer.
Assuntos
Clonagem de Organismos , Técnicas de Transferência Nuclear , Animais , Bovinos , Reprogramação Celular/genética , Embrião de Mamíferos/metabolismo , Desenvolvimento Embrionário/genética , Fertilização in vitro , Análise de Sequência de RNA , TranscriptomaRESUMO
MOTIVATION: Hematopoietic stem cells (HSCs) give rise to all blood cells and play a vital role throughout the whole lifespan through their pluripotency and self-renewal properties. Accurately identifying the stages of early HSCs is extremely important, as it may open up new prospects for extracorporeal blood research. Existing experimental techniques for identifying the early stages of HSCs development are time-consuming and expensive. Machine learning has shown its excellence in massive single-cell data processing and it is desirable to develop related computational models as good complements to experimental techniques. RESULTS: In this study, we presented a novel predictor called eHSCPr specifically for predicting the early stages of HSCs development. To reveal the distinct genes at each developmental stage of HSCs, we compared F-score with three state-of-art differential gene selection methods (limma, DESeq2, edgeR) and evaluated their performance. F-score captured the more critical surface markers of endothelial cells and hematopoietic cells, and the area under receiver operating characteristic curve (ROC) value was 0.987. Based on SVM, the 10-fold cross-validation accuracy of eHSCpr in the independent dataset and the training dataset reached 94.84% and 94.19%, respectively. Importantly, we performed transcription analysis on the F-score gene set, which indeed further enriched the signal markers of HSCs development stages. eHSCPr can be a powerful tool for predicting early stages of HSCs development, facilitating hypothesis-driven experimental design and providing crucial clues for the in vitro blood regeneration studies. AVAILABILITY AND IMPLEMENTATION: http://bioinfor.imu.edu.cn/ehscpr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
RESUMO
A UV hyperspectral instrument was designed with a polarization measurement channel for real-time in-orbit polarization correction to reduce the influence of polarization on the detection accuracy of atmospheric radiation. One of the prerequisites for in-orbit polarization calibration is accurately calibrating the instrument's polarization properties in the laboratory. This study first introduces the calibration method and measuring device of the polarization characteristics of the ultraviolet (UV) hyperspectral detector and conducts a polarization calibration test of the instrument. The two main error sources introduced by the calibration device were emphatically analyzed, and the correction method of the error sources was deduced theoretically. Finally, the polarization calibration results of the UV hyperspectral detector were corrected, and the uncertainty analysis of the corrected calibration results was about 1.4%, which provides effective ground polarization calibration data for the on-orbit polarization correction of the instrument.
RESUMO
R-loop, a three-stranded RNA/DNA structure, plays important roles in modulating genome stability and gene expression, but the molecular mechanism of R-loops in cell reprogramming remains elusive. Here, we comprehensively profiled the genome-wide landscape of R-loops during cell reprogramming. The results showed that the R-loop formation on most different types of repetitive elements is stage-specific in cell reprogramming. We unveiled that the cumulative deposition of an R-loop subset is positively correlated with gene expression during reprogramming. More importantly, the dynamic turnover of this R-loop subset is accompanied by the activation of the pluripotent transcriptional regulatory network (TRN). Moreover, the large accumulation of the active histone marker H3K4me3 and the reduction in H3K27me3 were also observed in these R-loop regions. Finally, we characterized the dynamic network of R-loops that facilitates cell fate transitions in reprogramming. Together, our study provides a new clue for deciphering the interplay mechanism between R-loops and HMs to control cell reprogramming.
Assuntos
Reprogramação Celular , Código das Histonas , Estruturas R-Loop , Animais , Reprogramação Celular/genética , Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes , Genoma , Código das Histonas/genética , Camundongos , Células-Tronco Pluripotentes/metabolismo , Estruturas R-Loop/genéticaRESUMO
Somatic cell nuclear transfer (SCNT) technology can reprogram terminally differentiated cell nuclei into a totipotent state. However, the underlying molecular barriers of SCNT embryo development remain incompletely elucidated. Here, we observed that transcription-related pathways were incompletely activated in nuclear transfer arrest (NTA) embryos compared to normal SCNT embryos and in vivo fertilized (WT) embryos, which hinders the development of SCNT embryos. We further revealed the transcription pathway associated gene regulatory networks (GRNs) and found the aberrant transcription pathways can lead to the massive dysregulation of genes in NTA embryos. The predicted target genes of transcription pathways contain a series of crucial factors in WT embryos, which play an important role in catabolic process, pluripotency regulation, epigenetic modification and signal transduction. In NTA embryos, however, these genes were varying degrees of inhibition and show a defect in synergy. Overall, our research found that the incomplete activation of transcription pathways is another potential molecular barrier for SCNT embryos besides the incomplete reprogramming of epigenetic modifications, broadening the understanding of molecular mechanism of SCNT embryonic development.
Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes , Técnicas de Transferência Nuclear/efeitos adversos , Transcriptoma , Animais , Blastocisto/metabolismo , Camundongos , RNA-Seq , Análise de Célula Única , Transcrição GênicaRESUMO
Mammalian embryonic development is a complex process, characterized by intricate spatiotemporal dynamics and distinct chromatin preferences. However, the quick diversification in early embryogenesis leads to significant cellular diversity and the sparsity of scRNA-seq data, posing challenges in accurately determining cell fate decisions. In this study, we introduce a chromatin region binning method using scChrBin, designed to identify chromatin regions that elucidate the dynamics of embryonic development and lineage differentiation. This method transforms scRNA-seq data into a chromatin-based matrix, leveraging genomic annotations. Our results showed that the scChrBin method achieves high accuracy, with 98.0% and 89.2% on two single-cell embryonic datasets, demonstrating its effectiveness in analyzing complex developmental processes. We also systematically and comprehensively analysis of these key chromatin binning regions and their associated genes, focusing on their roles in lineage and stage development. The perspective of chromatin region binning method enables a comprehensive analysis of transcriptome data at the chromatin level, allowing us to unveil the dynamic expression of chromatin regions across temporal and spatial development. The tool is available as an application at https://github.com/liameihao/scChrBin.
Assuntos
Cromatina , Desenvolvimento Embrionário , Animais , Feminino , Gravidez , Cromatina/genética , Desenvolvimento Embrionário/genética , Diferenciação Celular/genética , Transcriptoma , Genoma , Perfilação da Expressão Gênica , Análise de Sequência de RNA , Mamíferos/genéticaRESUMO
The increasing emergence of the time-series single-cell RNA sequencing (scRNA-seq) data, inferring developmental trajectory by connecting transcriptome similar cell states (i.e., cell types or clusters) has become a major challenge. Most existing computational methods are designed for individual cells and do not take into account the available time series information. We present IDTI based on the Increment of Diversity for Trajectory Inference, which combines time series information and the minimum increment of diversity method to infer cell state trajectory of time-series scRNA-seq data. We apply IDTI to simulated and three real diverse tissue development datasets, and compare it with six other commonly used trajectory inference methods in terms of topology similarity and branching accuracy. The results have shown that the IDTI method accurately constructs the cell state trajectory without the requirement of starting cells. In the performance test, we further demonstrate that IDTI has the advantages of high accuracy and strong robustness.
RESUMO
The precise characterization of cellular differentiation potency remains an open question, which is fundamentally important for deciphering the dynamics mechanism related to cell fate transition. We quantitatively evaluated the differentiation potency of different stem cells based on the Hopfield neural network (HNN). The results emphasized that cellular differentiation potency can be approximated by Hopfield energy values. We then profiled the Waddington energy landscape of embryogenesis and cell reprogramming processes. The energy landscape at single-cell resolution further confirmed that cell fate decision is progressively specified in a continuous process. Moreover, the transition of cells from one steady state to another in embryogenesis and cell reprogramming processes was dynamically simulated on the energy ladder. These two processes can be metaphorized as the motion of descending and ascending ladders, respectively. We further deciphered the dynamics of the gene regulatory network (GRN) for driving cell fate transition. Our study proposes a new energy indicator to quantitatively characterize cellular differentiation potency without prior knowledge, facilitating the further exploration of the potential mechanism of cellular plasticity.
RESUMO
Single-cell studies have demonstrated that somatic cell reprogramming is a continuous process of cell fates transition. Only partial reprogramming intermediates can overcome the molecular bottlenecks to acquire pluripotency. To decipher the underlying decisive factors driving cell fate, we identified induced pluripotent stem cells or stromal-like cells (iPSCs/SLCs) and iPSCs or trophoblast-like cells (iPSCs/TLCs) fate bifurcations by reconstructing cellular trajectory. The mesenchymal-epithelial transition and the activation of pluripotency networks are the main molecular series in successful reprogramming. Correspondingly, intermediates diverge into SLCs accompanied by the inhibition of cell cycle genes and the activation of extracellular matrix genes, whereas the TLCs fate is characterized by the up-regulation of placenta development genes. Combining putative gene regulatory networks, seven (Taf7, Ezh2, Klf2, etc.) and three key factors (Cdc5l, Klf4, and Nanog) were individually identified as drivers of the successful reprogramming by triggering downstream pluripotent networks during iPSCs/SLCs and iPSCs/TLCs fate bifurcation. Conversely, 11 factors (Cebpb, Sox4, Junb, etc.) and four factors (Gata2, Jund, Ctnnb1, etc.) drive SLCs fate and TLCs fate, respectively. Our study sheds new light on the understanding of decisive factors driving cell fate, which is helpful for improving reprogramming efficiency through manipulating cell fates to avoid alternative fates.
RESUMO
BACKGROUND: The placenta, as a unique exchange organ between mother and fetus, is essential for successful human pregnancy and fetal health. Preeclampsia (PE) caused by placental dysfunction contributes to both maternal and infant morbidity and mortality. Accurate identification of PE patients plays a vital role in the formulation of treatment plans. However, the traditional clinical methods of PE have a high misdiagnosis rate. RESULTS: Here, we first designed a computational biology method that used single-cell transcriptome (scRNA-seq) of healthy pregnancy (38 wk) and early-onset PE (28-32 wk) to identify pathological cell subpopulations and predict PE risk. Based on machine learning methods and feature selection techniques, we observed that the Tuning ReliefF (TURF) score hybrid with XGBoost (TURF_XGB) achieved optimal performance, with 92.61% accuracy and 92.46% recall for classifying nine cell subpopulations of healthy placentas. Biological landscapes of placenta heterogeneity could be mapped by the 110 marker genes screened by TURF_XGB, which revealed the superiority of the TURF feature mining. Moreover, we processed the PE dataset with LASSO to obtain 497 biomarkers. Integration analysis of the above two gene sets revealed that dendritic cells were closely associated with early-onset PE, and C1QB and C1QC might drive preeclampsia by mediating inflammation. In addition, an ensemble model-based risk stratification card was developed to classify preeclampsia patients, and its area under the receiver operating characteristic curve (AUC) could reach 0.99. For broader accessibility, we designed an accessible online web server ( http://bioinfor.imu.edu.cn/placenta ). CONCLUSION: Single-cell transcriptome-based preeclampsia risk assessment using an ensemble machine learning framework is a valuable asset for clinical decision-making. C1QB and C1QC may be involved in the development and progression of early-onset PE by affecting the complement and coagulation cascades pathway that mediate inflammation, which has important implications for better understanding the pathogenesis of PE.
RESUMO
DNMT3A/B and TET1 play indispensable roles in regulating DNA methylation that undergoes extensive reprogramming during mammalian embryogenesis. Yet the competitive and cooperative relationships between TET1 and DNMT3A/B remain largely unknown in the human embryonic stem cells. Here, we revealed that the main DNA-binding domain of TET1 contains more positive charges by using charge reduction of amino acid alphabet, followed by DNMT3A and DNMT3B. The genome-wide binding profiles showed that TET1 prefers binding to the proximal promoters and CpG islands compared with DNMT3A/B. Moreover, the binding regions of these three transcription factors can be divided into specific and co-binding regions. And a stronger inhibitory effect of DNMT3A on TET1 demethylation was observed in co-binding regions. Furthermore, we integrated TET1 knockout data to further discuss the competitive binding patterns of TET1 and DNMT3A/B. The lack of TET1 increased the occupation of DNMT3A/B at the specific binding regions of TET1 causing focal hypermethylation. The knockout of TET1 was also accompanied by a reduction of DNMT3A/B binding in the co-binding regions, further confirming the cooperative binding function between TET1 and DNMT3A/B. In conclusion, our studies found that the competitive binding of TET1 and DNMT3A/B cooperatively shapes the global DNA methylation pattern in human embryonic stem cells.