RESUMEN
A powerful feature of single-cell genomics is the possibility of identifying cell types from their molecular profiles. In particular, identifying novel rare cell types and their marker genes is a key potential of single-cell RNA sequencing. Standard clustering approaches perform well in identifying relatively abundant cell types, but tend to miss rarer cell types. Here, we have developed CIARA (Cluster Independent Algorithm for the identification of markers of RAre cell types), a cluster-independent computational tool designed to select genes that are likely to be markers of rare cell types. Genes selected by CIARA are subsequently integrated with common clustering algorithms to single out groups of rare cell types. CIARA outperforms existing methods for rare cell type detection, and we use it to find previously uncharacterized rare populations of cells in a human gastrula and among mouse embryonic stem cells treated with retinoic acid. Moreover, CIARA can be applied more generally to any type of single-cell omic data, thus allowing the identification of rare cells across multiple data modalities. We provide implementations of CIARA in user-friendly packages available in R and Python.
Asunto(s)
Algoritmos , Análisis de la Célula Individual , Animales , Humanos , Ratones , Análisis de Secuencia de ARN/métodos , Análisis por Conglomerados , Análisis de la Célula Individual/métodos , Perfilación de la Expresión Génica/métodosRESUMEN
Advances in single-cell RNA sequencing provide an unprecedented window into cellular identity. The abundance of data requires new theoretical and computational frameworks to analyze the dynamics of differentiation and integrate knowledge from cell atlases. We present 'single-cell Type Order Parameters' (scTOP): a statistical, physics-inspired approach for quantifying cell identity given a reference basis of cell types. scTOP can accurately classify cells, visualize developmental trajectories and assess the fidelity of engineered cells. Importantly, scTOP does this without feature selection, statistical fitting or dimensional reduction (e.g. uniform manifold approximation and projection, principle components analysis, etc.). We illustrate the power of scTOP using human and mouse datasets. By reanalyzing mouse lung data, we characterize a transient hybrid alveolar type 1/alveolar type 2 cell population. Visualizations of lineage tracing hematopoiesis data using scTOP confirm that a single clone can give rise to multiple mature cell types. We assess the transcriptional similarity between endogenous and donor-derived cells in the context of murine pulmonary cell transplantation. Our results suggest that physics-inspired order parameters can be an important tool for understanding differentiation and characterizing engineered cells. scTOP is available as an easy-to-use Python package.
Asunto(s)
Pulmón , Análisis de la Célula Individual , Animales , Humanos , Ratones , Diferenciación Celular/genética , Análisis de la Célula Individual/métodos , Análisis de Secuencia de ARN/métodosRESUMEN
Microbial biochemistry is central to the pathophysiology of inflammatory bowel diseases (IBD). Improved knowledge of microbial metabolites and their immunomodulatory roles is thus necessary for diagnosis and management. Here, we systematically analyzed the chemical, ecological, and epidemiological properties of ~82k metabolic features in 546 Integrative Human Microbiome Project (iHMP/HMP2) metabolomes, using a newly developed methodology for bioactive compound prioritization from microbial communities. This suggested >1000 metabolic features as potentially bioactive in IBD and associated ~43% of prevalent, unannotated features with at least one well-characterized metabolite, thereby providing initial information for further characterization of a significant portion of the fecal metabolome. Prioritized features included known IBD-linked chemical families such as bile acids and short-chain fatty acids, and less-explored bilirubin, polyamine, and vitamin derivatives, and other microbial products. One of these, nicotinamide riboside, reduced colitis scores in DSS-treated mice. The method, MACARRoN, is generalizable with the potential to improve microbial community characterization and provide therapeutic candidates.
Asunto(s)
Colitis , Enfermedades Inflamatorias del Intestino , Humanos , Animales , Ratones , Enfermedades Inflamatorias del Intestino/tratamiento farmacológico , Enfermedades Inflamatorias del Intestino/metabolismo , Metaboloma , Ácidos y Sales BiliaresRESUMEN
Chromatin loop is of crucial importance for the regulation of gene transcription. Cohesin is a type of chromatin-associated protein that mediates the interaction of chromatin through the loop extrusion. Cohesin-mediated chromatin interactions have strong cell-type specificity, posing a challenge for predicting chromatin loops. Existing computational methods perform poorly in predicting cell-type-specific chromatin loops. To address this issue, we propose a random forest model to predict cell-type-specific cohesin-mediated chromatin loops based on chromatin states identified by ChromHMM and the occupancy of related factors. Our results show that chromatin state is responsible for cell-type-specificity of loops. Using only chromatin states as features, the model achieved high accuracy in predicting cell-type-specific loops between two cell types and can be applied to different cell types. Furthermore, when chromatin states are combined with the occurrence frequency of CTCF, RAD21, YY1, and H3K27ac ChIP-seq peaks, more accurate prediction can be achieved. Our feature extraction method provides novel insights into predicting cell-type-specific chromatin loops and reveals the relationship between chromatin state and chromatin loop formation.
Asunto(s)
Factor de Unión a CCCTC , Proteínas de Ciclo Celular , Cromatina , Proteínas Cromosómicas no Histona , Cohesinas , Proteínas Cromosómicas no Histona/metabolismo , Proteínas Cromosómicas no Histona/genética , Proteínas de Ciclo Celular/metabolismo , Proteínas de Ciclo Celular/genética , Cromatina/metabolismo , Cromatina/genética , Humanos , Factor de Unión a CCCTC/metabolismo , Factor de Unión a CCCTC/genética , Factor de Transcripción YY1/metabolismo , Factor de Transcripción YY1/genética , Proteínas Nucleares/metabolismo , Proteínas Nucleares/genética , Biología Computacional/métodos , Proteínas de Unión al ADN/metabolismo , Proteínas de Unión al ADN/genética , Histonas/metabolismo , Histonas/genética , Fosfoproteínas/metabolismo , Fosfoproteínas/genética , Secuenciación de Inmunoprecipitación de Cromatina/métodosRESUMEN
Pedigree inference from genotype data is a challenging problem, particularly when pedigrees are sparsely sampled and individuals may be distantly related to their closest genotyped relatives. We present a method that infers small pedigrees of close relatives and then assembles them into larger pedigrees. To assemble large pedigrees, we introduce several formulas and tools including a likelihood for the degree separating two small pedigrees, a generalization of the fast DRUID point estimate of the degree separating two pedigrees, a method for detecting individuals who share background identity-by-descent (IBD) that does not reflect recent common ancestry, and a method for identifying the ancestral branches through which distant relatives are connected. Our method also takes several approaches that help to improve the accuracy and efficiency of pedigree inference. In particular, we incorporate age information directly into the likelihood rather than using ages only for consistency checks and we employ a heuristic branch-and-bound-like approach to more efficiently explore the space of possible pedigrees. Together, these approaches make it possible to construct large pedigrees that are challenging or intractable for current inference methods.
Asunto(s)
Genotipo , Linaje , Algoritmos , Femenino , Humanos , Funciones de Verosimilitud , Masculino , Modelos GenéticosRESUMEN
Optimal methods could effectively improve the accuracy of predicting and identifying candidate driver genes. Various computational methods based on mutational frequency, network and function approaches have been developed to identify mutation driver genes in cancer genomes. However, a comprehensive evaluation of the performance levels of network-, function- and frequency-based methods is lacking. In the present study, we assessed and compared eight performance criteria for eight network-based, one function-based and three frequency-based algorithms using eight benchmark datasets. Under different conditions, the performance of approaches varied in terms of network, measurement and sample size. The frequency-based driverMAPS and network-based HotNet2 methods showed the best overall performance. Network-based algorithms using protein-protein interaction networks outperformed the function- and the frequency-based approaches. Precision, F1 score and Matthews correlation coefficient were low for most approaches. Thus, most of these algorithms require stringent cutoffs to correctly distinguish driver and non-driver genes. We constructed a website named Cancer Driver Catalog (http://159.226.67.237/sun/cancer_driver/), wherein we integrated the gene scores predicted by the foregoing software programs. This resource provides valuable guidance for cancer researchers and clinical oncologists prioritizing cancer driver gene candidates by using an optimal tool.
Asunto(s)
Neoplasias , Oncogenes , Algoritmos , Biología Computacional/métodos , Redes Reguladoras de Genes , Humanos , Mutación , Neoplasias/genética , Programas InformáticosRESUMEN
A newly invented post-translational modification (PTM), phosphoglycerylation, has shown its essential role in the construction and functional properties of proteins and dangerous human diseases. Hence, it is very urgent to know about the molecular mechanism behind the phosphoglycerylation process to develop the drugs for related diseases. But accurately identifying of phosphoglycerylation site from a protein sequence in a laboratory is a very difficult and challenging task. Hence, the construction of an efficient computation model is greatly sought for this purpose. A little number of computational models are currently available for identifying the phosphoglycerylation sites, which are not able to reach their prediction capability at a satisfactory level. Therefore, an effective predictor named PLP_FS has been designed and constructed to identify phosphoglycerylation sites in this study. For the training purpose, an optimal number of feature sets was obtained by fusion of multiple F_Score feature selection techniques from the features generated by three types of sequence-based feature extraction methods and fitted with the support vector machine classification technique to the prediction model. On the other hand, the k-neighbor near cleaning and SMOTE methods were also implemented to balance the benchmark dataset. The suggested model in 10-fold cross-validation obtained an accuracy of 99.22%, a sensitivity of 98.17% and a specificity of 99.75% according to the experimental findings, which are better than other currently available predictors for accurately identifying the phosphoglycerylation sites.
Asunto(s)
Lisina , Máquina de Vectores de Soporte , Algoritmos , Secuencia de Aminoácidos , Biología Computacional/métodos , Humanos , Lisina/metabolismo , Procesamiento Proteico-Postraduccional , Proteínas/metabolismoRESUMEN
Tumor homing peptides (THPs) have a distinctive capacity to specifically attach to tumor cells, providing a promising approach for targeted cancer treatment and detection. Although THPs have the potential for significant impact, their detection by conventional methods is both time-consuming and expensive. To tackle this issue, we provide LLM4THP, an innovative computational approach that utilizes large language models (LLMs) to quickly and effectively detect THPs. LLM4THP utilizes two protein LLMs, ESM2 and Prot_T5_XL_UniRef50, to encode peptide sequences. This allows for the capture of complex patterns and relationships within the peptide data. In addition, we utilize inherent sequence characteristics such as Amino Acid Composition (AAC), Pseudo Amino Acid Composition (PAAC), Amphiphilic Pseudo Amino Acid Composition (APAAC), and Composition, Transition, and Distribution (CTD) to improve the representation of peptides. The RDKitDescriptors feature representation approach transforms peptide sequences into molecular objects and computes chemical characteristics, resulting in enhanced THP identification. The LLM4THP ensemble strategy incorporates various features into a two-layer learning architecture. The first layer consists of LightGBM, XGBoost, Random Forest, and Extremely Randomized Trees, which generate a set of meta results. The second layer utilizes Logistic Regression to further refine the identification of sequences as either THP or non-THP. LLM4THP exhibits exceptional performance compared to the most advanced methods, showcasing enhancements in accuracy, Matthew's correlation coefficient, F1 score, area under the curve, and average precision. The source code and dataset can be accessed at the following URL: https://github.com/abcair/LLM4THP.
Asunto(s)
Péptidos , Humanos , Péptidos/química , Neoplasias/metabolismo , Secuencia de Aminoácidos , Programas Informáticos , Biología Computacional/métodos , AlgoritmosRESUMEN
This study evaluates the corrosion inhibition capabilities of two novel hydrazone derivatives, (E)-2-(5-methoxy-2-methyl-1H-indol-3-yl)-N'-(4-methylbenzylidene)acetohydrazide (MeHDZ) and (E)-N'-benzylidene-2-(5-methoxy-2-methyl-1H-indol-3-yl)acetohydrazide (HHDZ), on carbon steel in a 15 wt.% HCl solution. A comprehensive suite of analytical techniques, including gravimetric analysis, potentiodynamic polarization (PDP), electrochemical impedance spectroscopy (EIS), and scanning electron microscopy (SEM), demonstrates their significant inhibition efficiency. At an optimal concentration of 5 × 10-3 mol/L, MeHDZ and HHDZ achieve remarkable inhibition efficiencies of 98% and 94%, respectively. EIS measurements reveal a dramatic reduction in effective double-layer capacitance (from 236.2 to 52.8 and 75.3 µF/cm2), strongly suggesting inhibitor adsorption on the steel surface. This effect is further corroborated by an increase in polarization resistance and a significant decrease in corrosion current density at optimal concentrations. Moreover, these inhibitors demonstrate sustained corrosion mitigation over extended exposure durations and maintain effectiveness even under elevated temperatures, highlighting their potential for diverse operational conditions. The adsorption process of these inhibitors aligns well with the Langmuir adsorption isotherm, implying physicochemical interactions at the carbon steel surface. Density functional tight-binding (DFTB) calculations and molecular dynamics simulations provide insights into the inhibitor-surface interaction mechanism, further elucidating the potential of these hydrazone derivatives as highly effective corrosion inhibitors in acidic environments.
RESUMEN
During pre-mRNA maturation 3' end processing can occur at different polyadenylation sites in the 3' untranslated region (3' UTR) to give rise to transcript isoforms that differ in the length of their 3' UTRs. Longer 3' UTRs contain additional cis-regulatory elements that impact the fate of the transcript and/or of the resulting protein. Extensive alternative polyadenylation (APA) has been observed in cancers, but the mechanisms and roles remain elusive. In particular, it is unclear whether the APA occurs in the malignant cells or in other cell types that infiltrate the tumor. To resolve this, we developed a computational method, called SCUREL, that quantifies changes in 3' UTR length between groups of cells, including cells of the same type originating from tumor and control tissue. We used this method to study APA in human lung adenocarcinoma (LUAD). SCUREL relies solely on annotated 3' UTRs and on control systems such as T cell activation, and spermatogenesis gives qualitatively similar results at much greater sensitivity compared to the previously published scAPA method. In the LUAD samples, we find a general trend toward 3' UTR shortening not only in cancer cells compared to the cell type of origin, but also when comparing other cell types from the tumor vs. the control tissue environment. However, we also find high variability in the individual targets between patients. The findings help in understanding the extent and impact of APA in LUAD, which may support improvements in diagnosis and treatment.
Asunto(s)
Regiones no Traducidas 3'/genética , Adenocarcinoma del Pulmón/patología , Regulación Neoplásica de la Expresión Génica , Neoplasias Pulmonares/patología , Poliadenilación , ARN Mensajero/metabolismo , Adenocarcinoma del Pulmón/genética , Estudios de Casos y Controles , Humanos , Neoplasias Pulmonares/genética , Isoformas de Proteínas , ARN Mensajero/genéticaRESUMEN
MOTIVATION: Computational methods accelerate drug discovery and play an important role in biomedicine, such as molecular property prediction and compound-protein interaction (CPI) identification. A key challenge is to learn useful molecular representation. In the early years, molecular properties are mainly calculated by quantum mechanics or predicted by traditional machine learning methods, which requires expert knowledge and is often labor-intensive. Nowadays, graph neural networks have received significant attention because of the powerful ability to learn representation from graph data. Nevertheless, current graph-based methods have some limitations that need to be addressed, such as large-scale parameters and insufficient bond information extraction. RESULTS: In this study, we proposed a graph-based approach and employed a novel triplet message mechanism to learn molecular representation efficiently, named triplet message networks (TrimNet). We show that TrimNet can accurately complete multiple molecular representation learning tasks with significant parameter reduction, including the quantum properties, bioactivity, physiology and CPI prediction. In the experiments, TrimNet outperforms the previous state-of-the-art method by a significant margin on various datasets. Besides the few parameters and high prediction accuracy, TrimNet could focus on the atoms essential to the target properties, providing a clear interpretation of the prediction tasks. These advantages have established TrimNet as a powerful and useful computational tool in solving the challenging problem of molecular representation learning. AVAILABILITY: The quantum and drug datasets are available on the website of MoleculeNet: http://moleculenet.ai. The source code is available in GitHub: https://github.com/yvquanli/trimnet. CONTACT: xjyao@lzu.edu.cn, songsen@tsinghua.edu.cn.
Asunto(s)
Descubrimiento de Drogas , Aprendizaje Automático , Programas InformáticosRESUMEN
BACKGROUND: This research extends prior studies by the Finnish Institute for Health and Welfare on pandemic-related risk perception, concentrating on the role of trust in health authorities and its impact on public health outcomes. OBJECTIVE: The paper aims to investigate variations in trust levels over time and across social media platforms, as well as to further explore 12 subcategories of political mistrust. It seeks to understand the dynamics of political trust, including mistrust accumulation, fluctuations over time, and changes in topic relevance. Additionally, the study aims to compare qualitative research findings with those obtained through computational methods. METHODS: Data were gathered from a large-scale data set consisting of 13,629 Twitter and Facebook posts from 2020 to 2023 related to COVID-19. For analysis, a fine-tuned FinBERT model with an 80% accuracy rate was used for predicting political mistrust. The BERTopic model was also used for superior topic modeling performance. RESULTS: Our preliminary analysis identifies 43 mistrust-related topics categorized into 9 major themes. The most salient topics include COVID-19 mortality, coping strategies, polymerase chain reaction testing, and vaccine efficacy. Discourse related to mistrust in authority is associated with perceptions of disease severity, willingness to adopt health measures, and information-seeking behavior. Our findings highlight that the distinct user engagement mechanisms and platform features of Facebook and Twitter contributed to varying patterns of mistrust and susceptibility to misinformation during the pandemic. CONCLUSIONS: The study highlights the effectiveness of computational methods like natural language processing in managing large-scale engagement and misinformation. It underscores the critical role of trust in health authorities for effective risk communication and public compliance. The findings also emphasize the necessity for transparent communication from authorities, concluding that a holistic approach to public health communication is integral for managing health crises effectively.
Asunto(s)
COVID-19 , Medios de Comunicación Sociales , Humanos , Pandemias , Conducta en la Búsqueda de Información , COVID-19/prevención & control , Análisis de DatosRESUMEN
BACKGROUND: Selection of optimal computational strategies for analyzing metagenomics data is a decisive step in determining the microbial composition of a sample, and this procedure is complex because of the numerous tools currently available. The aim of this research was to summarize the results of crowdsourced sbv IMPROVER Microbiomics Challenge designed to evaluate the performance of off-the-shelf metagenomics software as well as to investigate the robustness of these results by the extended post-challenge analysis. In total 21 off-the-shelf taxonomic metagenome profiling pipelines were benchmarked for their capacity to identify the microbiome composition at various taxon levels across 104 shotgun metagenomics datasets of bacterial genomes (representative of various microbiome samples) from public databases. Performance was determined by comparing predicted taxonomy profiles with the gold standard. RESULTS: Most taxonomic profilers performed homogeneously well at the phylum level but generated intermediate and heterogeneous scores at the genus and species levels, respectively. kmer-based pipelines using Kraken with and without Bracken or using CLARK-S performed best overall, but they exhibited lower precision than the two marker-gene-based methods MetaPhlAn and mOTU. Filtering out the 1% least abundance species-which were not reliably predicted-helped increase the performance of most profilers by increasing precision but at the cost of recall. However, the use of adaptive filtering thresholds determined from the sample's Shannon index increased the performance of most kmer-based profilers while mitigating the tradeoff between precision and recall. CONCLUSIONS: kmer-based metagenomic pipelines using Kraken/Bracken or CLARK-S performed most robustly across a large variety of microbiome datasets. Removing non-reliably predicted low-abundance species by using diversity-dependent adaptive filtering thresholds further enhanced the performance of these tools. This work demonstrates the applicability of computational pipelines for accurately determining taxonomic profiles in clinical and environmental contexts and exemplifies the power of crowdsourcing for unbiased evaluation.
Asunto(s)
Colaboración de las Masas , Metagenoma , Benchmarking , Metagenómica/métodos , Programas InformáticosRESUMEN
5-Methylcytosine (m5C) plays an extremely important role in the basic biochemical process. With the great increase of identified m5C sites in a wide variety of organisms, their epigenetic roles become largely unknown. Hence, accurate identification of m5C site is a key step in understanding its biological functions. Over the past several years, more attentions have been paid on the identification of m5C sites in multiple species. In this work, we firstly summarized the current progresses in computational prediction of m5C sites and then constructed a more powerful and reliable model for identifying m5C sites. To train the model, we collected experimentally confirmed m5C data from Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Arabidopsis thaliana, and compared the performances of different feature extraction methods and classification algorithms for optimizing prediction model. Based on the optimal model, a novel predictor called iRNA-m5C was developed for the recognition of m5C sites. Finally, we critically evaluated the performance of iRNA-m5C and compared it with existing methods. The result showed that iRNA-m5C could produce the best prediction performance. We hope that this paper could provide a guide on the computational identification of m5C site and also anticipate that the proposed iRNA-m5C will become a powerful tool for large scale identification of m5C sites.
Asunto(s)
5-Metilcitosina/metabolismo , Biología Computacional/métodos , Algoritmos , Animales , Arabidopsis/metabolismo , Conjuntos de Datos como Asunto , Humanos , Ratones , Saccharomyces cerevisiae/metabolismoRESUMEN
(1) Background: Cooperative Intelligent Transportation Systems (C-ITS) will soon operate using 5G New-Radio (NR) wireless communication, overcoming the limitations of the current V2X (Vehicle-to-Everything) wireless communication technologies and increasing road-safety and driving efficiency. These innovations will also change the RF exposure levels of pedestrians and road-users in general. These people, in fact, will be exposed to additional RF sources coming from nearby cars and from the infrastructure. Therefore, an exposure assessment of people in the proximity of a connected car is necessary and urgent. (2) Methods: Two array antennas for 5G-V2X communication at 3.5 GHz were modelled and mounted on a realistic 3D car model for evaluating the exposure levels of a human model representing people on the road near the car. Computational simulations were conducted using the FDTD solver implemented in the Sim4Life platform; different positions and orientations between the car and the human model were assessed. The analyzed quantities were the Specific Absorption Rate on the whole body (SARwb), averaged over 10 g (SAR10g) in specific tissues, as indicated in the ICNIRP guidelines. (3) Results: the data showed that the highest exposure levels were obtained mostly in the head area of the human model, with the highest peak obtained in the configuration where the main beam of the 5G-V2X antennas was more direct towards the human model. Moreover, in all configurations, the dose absorbed by a pedestrian was well below the ICNIRP guidelines to avoid harmful effects. (4) Conclusions: This work is the first study on human exposure assessment in a 5G-V2X scenario, and it expands the knowledge about the exposure levels for the forthcoming use of 5G in connected vehicles.
Asunto(s)
Conducción de Automóvil , Peatones , Automóviles , Humanos , Tecnología InalámbricaRESUMEN
Daily work in molecular biology presently depends on a large number of computational tools. An in-depth, large-scale study of that 'ecosystem' of Web tools, its characteristics, interconnectivity, patterns of usage/citation, temporal evolution and rate of decay is crucial for understanding the forces that shape it and for informing initiatives aimed at its funding, long-term maintenance and improvement. In particular, the long-term maintenance of these tools is compromised because of their specific development model. Hundreds of published studies become irreproducible de facto, as the software tools used to conduct them become unavailable. In this study, we present a large-scale survey of >5400 publications describing Web servers within the two main bibliographic resources for disseminating new software developments in molecular biology. For all these servers, we studied their citation patterns, the subjects they address, their citation networks and the temporal evolution of these factors. We also analysed how these factors affect the availability of these servers (whether they are alive). Our results show that this ecosystem of tools is highly interconnected and adapts to the 'trendy' subjects in every moment. The servers present characteristic temporal patterns of citation/usage, and there is a worrying rate of server 'death', which is influenced by factors such as the server popularity and the institutions that hosts it. These results can inform initiatives aimed at the long-term maintenance of these resources.
Asunto(s)
Biología Molecular/estadística & datos numéricos , Programas Informáticos , Biología Computacional/métodos , Biología Computacional/tendencias , Internet , Biología Molecular/tendencias , Publicaciones Periódicas como Asunto/estadística & datos numéricos , Programas Informáticos/tendenciasRESUMEN
PURPOSE: The coronal Cobb angle is commonly used for assessing the adolescent idiopathic scoliosis (AIS); however, it may underestimate the severity of AIS while the plane of maximum curvature (PMC) could be a promising descriptor for three-dimensional assessment of AIS. This study aimed to develop a computational method (CM) for estimating the PMC based on the coronal and sagittal images of the spine, and to verify the results with computed tomography (CT). METHODS: Twenty-eight thoracic and 24 lumbar curves from 30 subjects with AIS were analysed. For the CM, PMC was estimated via identifying the two corner points at the superior endplate of upper-end vertebra and the inferior endplate of lower-end vertebra in the coronal and sagittal CT images separately (eight corner points in total). For the CT, PMC was determined through rotating the spine images axially until the maximum Cobb angle was found. Intraclass correlation coefficient (ICC), Bland-Altman method and linear regression analysis were used for the statistical analyses. RESULTS: The high ICC values (intra- > 0.91; inter- > 0.84) suggested very good intra- and inter-rater reliability of the CM in PMC estimation. The high ICC values (> 0.91) and assessment of Bland-Altman method demonstrated a good agreement between the PMC acquired using the CM and CT. The generated linear regression equations (R2 > 0.69) could allow to estimate the PMC (originally measured through the CT) via the CM. CONCLUSION: The developed computational method could estimate reliable and valid PMC for the patients with AIS, and become feasible for three-dimensional assessment of AIS. LEVEL OF EVIDENCE: III.
Asunto(s)
Cifosis , Escoliosis , Adolescente , Humanos , Reproducibilidad de los Resultados , Escoliosis/diagnóstico por imagen , Columna Vertebral , Tomografía Computarizada por Rayos XRESUMEN
OBJECTIVES: Preeclampsia is a dangerous pregnancy complication. The source of preeclampsia is unknown, though the placenta is believed to have a central role in its pathogenesis. An association between maternal infection and preeclampsia has been demonstrated, yet the involvement of the placental microbiome in the etiology of preeclampsia has not been determined. In this study, we examined whether preeclampsia is associated with an imbalanced microorganism composition in the placenta. METHODS: To this end, we developed a novel method for the identification of bacteria/viruses based on sequencing of small non-coding RNA, which increases the microorganism-to-host ratio, this being a major challenge in microbiome methods. We validated the method on various infected tissues and demonstrated its efficiency in detecting microorganisms in samples with extremely low bacterial/viral biomass. We then applied the method to placenta specimens from preeclamptic and healthy pregnancies. Since the placenta is a remarkably large and heterogeneous organ, we explored the bacterial and viral RNA at each of 15 distinct locations. RESULTS: Bacterial RNA was detected at all locations and was consistent with previous studies of the placental microbiome, though without significant differences between the preeclampsia and control groups. Nevertheless, the bacterial RNA composition differed significantly between various areas of the placenta. Viral RNA was detected in extremely low quantities, below the threshold of significance, thus viral abundance could not be determined. CONCLUSIONS: Our results suggest that the bacterial and viral abundance in the placenta may have only limited involvement in the pathogenesis of preeclampsia. The evidence of a heterogenic bacterial RNA composition in the various placental locations warrants further investigation to capture the true nature of the placental microbiome.
Asunto(s)
Microbiota/genética , Placenta/microbiología , Preeclampsia , ARN Bacteriano , ARN Viral , Análisis de Secuencia de ARN , Adulto , Bacterias/clasificación , Bacterias/aislamiento & purificación , Correlación de Datos , Femenino , Humanos , Evaluación de Resultado en la Atención de Salud , Placenta/patología , Preeclampsia/sangre , Preeclampsia/diagnóstico , Preeclampsia/microbiología , Embarazo , ARN Bacteriano/análisis , ARN Bacteriano/aislamiento & purificación , ARN no Traducido/análisis , ARN no Traducido/aislamiento & purificación , ARN Viral/análisis , ARN Viral/aislamiento & purificación , Reproducibilidad de los Resultados , Análisis de Secuencia de ARN/métodos , Análisis de Secuencia de ARN/estadística & datos numéricos , Manejo de Especímenes/métodosRESUMEN
BACKGROUND: Annotation of cell identity is an essential process in neuroscience that allows comparison of cells, including that of neural activities across different animals. In Caenorhabditis elegans, although unique identities have been assigned to all neurons, the number of annotatable neurons in an intact animal has been limited due to the lack of quantitative information on the location and identity of neurons. RESULTS: Here, we present a dataset that facilitates the annotation of neuronal identities, and demonstrate its application in a comprehensive analysis of whole-brain imaging. We systematically identified neurons in the head region of 311 adult worms using 35 cell-specific promoters and created a dataset of the expression patterns and the positions of the neurons. We found large positional variations that illustrated the difficulty of the annotation task. We investigated multiple combinations of cell-specific promoters driving distinct fluorescence and generated optimal strains for the annotation of most head neurons in an animal. We also developed an automatic annotation method with human interaction functionality that facilitates annotations needed for whole-brain imaging. CONCLUSION: Our neuron ID dataset and optimal fluorescent strains enable the annotation of most neurons in the head region of adult C. elegans, both in full-automated fashion and a semi-automated version that includes human interaction functionalities. Our method can potentially be applied to model species used in research other than C. elegans, where the number of available cell-type-specific promoters and their variety will be an important consideration.
Asunto(s)
Encéfalo/fisiología , Caenorhabditis elegans/fisiología , Neuronas/fisiología , Animales , Conjuntos de Datos como AsuntoRESUMEN
Molecularly imprinted polymer (MIP) computational design is expected to become a routine technique prior to synthesis to produce polymers with high affinity and selectivity towards target molecules. Furthermore, using these simulations reduces the cost of optimizing polymerization composition. There are several computational methods used in MIP fabrication and each requires a comprehensive study in order to select a process with results that are most similar to properties exhibited by polymers synthesized through laboratory experiments. Until now, no review has linked computational strategies with experimental results, which are needed to determine the method that is most appropriate for use in designing MIP with high molecular recognition. This review will present an update of the computational approaches started from 2016 until now on quantum mechanics, molecular mechanics and molecular dynamics that have been widely used. It will also discuss the linear correlation between computational results and the polymer performance tests through laboratory experiments to examine to what extent these methods can be relied upon to obtain polymers with high molecular recognition. Based on the literature search, density functional theory (DFT) with various hybrid functions and basis sets is most often used as a theoretical method to provide a shorter MIP manufacturing process as well as good analytical performance as recognition material.