Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
Add more filters











Publication year range
1.
Front Plant Sci ; 15: 1437118, 2024.
Article in English | MEDLINE | ID: mdl-39372861

ABSTRACT

Introduction: Single-cell RNA-seq (scRNA-seq) technologies have been widely used to reveal the diversity and complexity of cells, and pioneering studies on scRNA-seq in plants began to emerge since 2019. However, existing studies on plants utilized scRNA-seq focused only on the gene expression regulation. As an essential post-transcriptional mechanism for regulating gene expression, alternative polyadenylation (APA) generates diverse mRNA isoforms with distinct 3' ends through the selective use of different polyadenylation sites in a gene. APA plays important roles in regulating multiple developmental processes in plants, such as flowering time and stress response. Methods: In this study, we developed a pipeline to identify and integrate APA sites from different scRNA-seq data and analyze APA dynamics in single cells. First, high-confidence poly(A) sites in single root cells were identified and quantified. Second, three kinds of APA markers were identified for exploring APA dynamics in single cells, including differentially expressed poly(A) sites based on APA site expression, APA markers based on APA usages, and APA switching genes based on 3' UTR (untranslated region) length change. Moreover, cell type annotations of single root cells were refined by integrating both the APA information and the gene expression profile. Results: We comprehensively compiled a single-cell APA atlas from five scRNA-seq studies, covering over 150,000 cells spanning four major tissue branches, twelve cell types, and three developmental stages. Moreover, we quantified the dynamic APA usages in single cells and identified APA markers across tissues and cell types. Further, we integrated complementary information of gene expression and APA profiles to annotate cell types and reveal subtle differences between cell types. Discussion: This study reveals that APA provides an additional layer of information for determining cell identity and provides a landscape of APA dynamics during Arabidopsis root development.

5.
Front Immunol ; 14: 1278534, 2023.
Article in English | MEDLINE | ID: mdl-38124749

ABSTRACT

The application of B-cell epitope identification to develop therapeutic antibodies and vaccine candidates is well established. However, the validation of epitopes is time-consuming and resource-intensive. To alleviate this, in recent years, multiple computational predictors have been developed in the immunoinformatics community. Brewpitopes is a pipeline that curates bioinformatic B-cell epitope predictions obtained by integrating different state-of-the-art tools. We used additional computational predictors to account for subcellular location, glycosylation status, and surface accessibility of the predicted epitopes. The implementation of these sets of rational filters optimizes in vivo antibody recognition properties of the candidate epitopes. To validate Brewpitopes, we performed a proteome-wide analysis of SARS-CoV-2 with a particular focus on S protein and its variants of concern. In the S protein, we obtained a fivefold enrichment in terms of predicted neutralization versus the epitopes identified by individual tools. We analyzed epitope landscape changes caused by mutations in the S protein of new viral variants that were linked to observed immune escape evidence in specific strains. In addition, we identified a set of epitopes with neutralizing potential in four SARS-CoV-2 proteins (R1AB, R1A, AP3A, and ORF9C). These epitopes and antigenic proteins are conserved targets for viral neutralization studies. In summary, Brewpitopes is a powerful pipeline that refines B-cell epitope bioinformatic predictions during public health emergencies in a high-throughput capacity to facilitate the optimization of experimental validation of therapeutic antibodies and candidate vaccines.


Subject(s)
Epitopes, B-Lymphocyte , Viral Vaccines , Humans , Epitopes, B-Lymphocyte/genetics , Epitopes, T-Lymphocyte , Emergencies , Public Health , SARS-CoV-2
7.
Front Immunol ; 14: 1158295, 2023.
Article in English | MEDLINE | ID: mdl-36993970

ABSTRACT

Unlike conventional major histocompatibility complex (MHC) class I and II molecules reactive T cells, the unconventional T cell subpopulations recognize various non-polymorphic antigen-presenting molecules and are typically characterized by simplified patterns of T cell receptors (TCRs), rapid effector responses and 'public' antigen specificities. Dissecting the recognition patterns of the non-MHC antigens by unconventional TCRs can help us further our understanding of the unconventional T cell immunity. The small size and irregularities of the released unconventional TCR sequences are far from high-quality to support systemic analysis of unconventional TCR repertoire. Here we present UcTCRdb, a database that contains 669,900 unconventional TCRs collected from 34 corresponding studies in humans, mice, and cattle. In UcTCRdb, users can interactively browse TCR features of different unconventional T cell subsets in different species, search and download sequences under different conditions. Additionally, basic and advanced online TCR analysis tools have been integrated into the database, which will facilitate the study of unconventional TCR patterns for users with different backgrounds. UcTCRdb is freely available at http://uctcrdb.cn/.


Subject(s)
Histocompatibility Antigens Class I , Receptors, Antigen, T-Cell , Humans , Animals , Mice , Cattle , T-Lymphocyte Subsets , Antigens , Databases, Nucleic Acid
8.
Front Bioinform ; 3: 1332902, 2023.
Article in English | MEDLINE | ID: mdl-38259432

ABSTRACT

No-boundary thinking enables the scientific community to reflect in a thoughtful manner and discover new opportunities, create innovative solutions, and break through barriers that might have otherwise constrained their progress. This concept encourages thinking without being confined by traditional rules, limitations, or established norms, and a mindset that is not limited by previous work, leading to fresh perspectives and innovative outcomes. So, where do we see the field of artificial intelligence (AI) in bioinformatics going in the next 30 years? That was the theme of a "No-Boundary Thinking" Session as part of the Mid-South Computational Bioinformatics Society's (MCBIOS) 19th annual meeting in Irving, Texas. This session addressed various areas of AI in an open discussion and raised some perspectives on how popular tools like ChatGPT can be integrated into bioinformatics, communicating with scientists in different fields to properly utilize the potential of these algorithms, and how to continue educational outreach to further interest of data science and informatics to the next-generation of scientists.

9.
Front Genet ; 13: 1057408, 2022.
Article in English | MEDLINE | ID: mdl-36324507
10.
Front Bioeng Biotechnol ; 10: 1016408, 2022.
Article in English | MEDLINE | ID: mdl-36324897

ABSTRACT

Nanopore technology enables portable, real-time sequencing of microbial populations from clinical and ecological samples. An emerging healthcare application for Nanopore includes point-of-care, timely identification of antibiotic resistance genes (ARGs) to help developing targeted treatments of bacterial infections, and monitoring resistant outbreaks in the environment. While several computational tools exist for classifying ARGs from sequencing data, to date (2022) none have been developed for mobile devices. We present here KARGAMobile, a mobile app for portable, real-time, easily interpretable analysis of ARGs from Nanopore sequencing. KARGAMobile is the porting of an existing ARG identification tool named KARGA; it retains the same algorithmic structure, but it is optimized for mobile devices. Specifically, KARGAMobile employs a compressed ARG reference database and different internal data structures to save RAM usage. The KARGAMobile app features a friendly graphical user interface that guides through file browsing, loading, parameter setup, and process execution. More importantly, the output files are post-processed to create visual, printable and shareable reports, aiding users to interpret the ARG findings. The difference in classification performance between KARGAMobile and KARGA is minimal (96.2% vs. 96.9% f-measure on semi-synthetic datasets of 1 million reads with known resistance ground truth). Using real Nanopore experiments, KARGAMobile processes on average 1 GB data every 23-48 min (targeted sequencing - metagenomics), with peak RAM usage below 500MB, independently from input file sizes, and an average temperature of 49°C after 1 h of continuous data processing. KARGAMobile is written in Java and is available at https://github.com/Ruiz-HCI-Lab/KargaMobile under the MIT license.

11.
Front Toxicol ; 4: 893924, 2022.
Article in English | MEDLINE | ID: mdl-35812168

ABSTRACT

Research in environmental health is becoming increasingly reliant upon data science and computational methods that can more efficiently extract information from complex datasets. Data science and computational methods can be leveraged to better identify relationships between exposures to stressors in the environment and human disease outcomes, representing critical information needed to protect and improve global public health. Still, there remains a critical gap surrounding the training of researchers on these in silico methods. We aimed to address this gap by developing the inTelligence And Machine lEarning (TAME) Toolkit, promoting trainee-driven data generation, management, and analysis methods to "TAME" data in environmental health studies. Training modules were developed to provide applications-driven examples of data organization and analysis methods that can be used to address environmental health questions. Target audiences for these modules include students, post-baccalaureate and post-doctorate trainees, and professionals that are interested in expanding their skillset to include recent advances in data analysis methods relevant to environmental health, toxicology, exposure science, epidemiology, and bioinformatics/cheminformatics. Modules were developed by study coauthors using annotated script and were organized into three chapters within a GitHub Bookdown site. The first chapter of modules focuses on introductory data science, which includes the following topics: setting up R/RStudio and coding in the R environment; data organization basics; finding and visualizing data trends; high-dimensional data visualizations; and Findability, Accessibility, Interoperability, and Reusability (FAIR) data management practices. The second chapter of modules incorporates chemical-biological analyses and predictive modeling, spanning the following methods: dose-response modeling; machine learning and predictive modeling; mixtures analyses; -omics analyses; toxicokinetic modeling; and read-across toxicity predictions. The last chapter of modules was organized to provide examples on environmental health database mining and integration, including chemical exposure, health outcome, and environmental justice indicators. Training modules and associated data are publicly available online (https://uncsrp.github.io/Data-Analysis-Training-Modules/). Together, this resource provides unique opportunities to obtain introductory-level training on current data analysis methods applicable to 21st century science and environmental health.

13.
Front Oncol ; 11: 738801, 2021.
Article in English | MEDLINE | ID: mdl-34804927

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a highly malignant tumor with poor prognosis and limited therapeutic options. Alternating electrical fields with low intensity called "Tumor Treating Fields" (TTFields) are a new, non-invasive approach with almost no side effects and phase 3 trials are ongoing in advanced PDAC. We evaluated TTFields in combination with mild hyperthermia. Three established human PDAC cell lines and an immortalized pancreatic duct cell line were treated with TTFields and hyperthermia at 38.5°C, followed by microscopy, assays for MTT, migration, colony and sphere formation, RT-qPCR, FACS, Western blot, microarray and bioinformatics, and in silico analysis using the online databases GSEA, KEGG, Cytoscape-String, and Kaplan-Meier Plotter. Whereas TTFields and hyperthermia alone had weak effects, their combination strongly inhibited the viability of malignant, but not those of nonmalignant cells. Progression features and the cell cycle were impaired, and autophagy was induced. The identified target genes were key players in autophagy, the cell cycle and DNA repair. The expression profiles of part of these target genes were significantly involved in the survival of PDAC patients. In conclusion, the combination of TTFields with mild hyperthermia results in greater efficacy without increased toxicity and could be easily clinically approved as supporting therapy.

14.
Front Genet ; 12: 739470, 2021.
Article in English | MEDLINE | ID: mdl-34497636

ABSTRACT

BACKGROUND: Gastric cancer is one of the most serious gastrointestinal malignancies with bad prognosis. Ferroptosis is an iron-dependent form of programmed cell death, which may affect the prognosis of gastric cancer patients. Long non-coding RNAs (lncRNAs) can affect the prognosis of cancer through regulating the ferroptosis process, which could be potential overall survival (OS) prediction factors for gastric cancer. METHODS: Ferroptosis-related lncRNA expression profiles and the clinicopathological and OS information were collected from The Cancer Genome Atlas (TCGA) and the FerrDb database. The differentially expressed ferroptosis-related lncRNAs were screened with the DESeq2 method. Through co-expression analysis and functional annotation, we then identified the associations between ferroptosis-related lncRNAs and the OS rates for gastric cancer patients. Using Cox regression analysis with the least absolute shrinkage and selection operator (LASSO) algorithm, we constructed a prognostic model based on 17 ferroptosis-related lncRNAs. We also evaluated the prognostic power of this model using Kaplan-Meier (K-M) survival curve analysis, receiver operating characteristic (ROC) curve analysis, and decision curve analysis (DCA). RESULTS: A ferroptosis-related "lncRNA-mRNA" co-expression network was constructed. Functional annotation revealed that the FOXO and HIF-1 signaling pathways were dysregulated, which might control the prognosis of gastric cancer patients. Then, a ferroptosis-related gastric cancer prognostic signature model including 17 lncRNAs was constructed. Based on the RiskScore calculated using this model, the patients were divided into a High-Risk group and a low-risk group. The K-M survival curve analysis revealed that the higher the RiskScore, the worse is the obtained prognosis. The ROC curve analysis showed that the area under the ROC curve (AUC) of our model is 0.751, which was better than those of other published models. The multivariate Cox regression analysis results showed that the lncRNA signature is an independent risk factor for the OS rates. Finally, using nomogram and DCA, we also observed a preferable clinical practicality potential for prognosis prediction of gastric cancer patients. CONCLUSION: Our prognostic signature model based on 17 ferroptosis-related lncRNAs may improve the overall survival prediction in gastric cancer.

15.
Front Genet ; 12: 721873, 2021.
Article in English | MEDLINE | ID: mdl-34408776

ABSTRACT

Background: Triple-negative breast cancer (TNBC) is a special subtype of breast cancer with poor prognosis. DNA damage response (DDR) is one of the hallmarks of this cancer. However, the association of DDR genes with the prognosis of TNBC is still unclear. Methods: We identified differentially expressed genes (DEGs) between normal and TNBC samples from The Cancer Genome Atlas (TCGA). DDR genes were obtained from the Molecular Signatures Database through six DDR gene sets. After the expression of six differential genes were verified by quantitative real-time polymerase chain reaction (qRT-PCR), we then overlapped the DEGs with DDR genes. Based on univariate and LASSO Cox regression analyses, a prognostic model was constructed to predict overall survival (OS). Kaplan-Meier analysis and receiver operating characteristic curve were used to assess the performance of the prognostic model. Cox regression analysis was applied to identify independent prognostic factors in TNBC. The Human Protein Atlas was used to study the immunohistochemical data of six DEGs. The prognostic model was validated using an independent dataset. Gene Ontology and the Kyoto Encyclopedia of Genes and Genomes analysis were performed by using gene set enrichment analysis (GSEA). Single-sample gene set enrichment analysis was employed to estimate immune cells related to this prognostic model. Finally, we constructed a transcriptional factor (TF) network and a competing endogenous RNA regulatory network. Results: Twenty-three differentially expressed DDR genes were detected between TNBC and normal samples. The six-gene prognostic model we developed was shown to be related to OS in TNBC using univariate and LASSO Cox regression analyses. All the six DEGs were identified as significantly up-regulated in the tumor samples compared to the normal samples in qRT-PCR. The GSEA analysis indicated that the genes in the high-risk group were mainly correlated with leukocyte migration, cytokine interaction, oxidative phosphorylation, autoimmune diseases, and coagulation cascade. The mutation data revealed the mutated genes were different. The gene-TF regulatory network showed that Replication Factor C subunit 4 occupied the dominant position. Conclusion: We identified six gene markers related to DDR, which can predict prognosis and serve as an independent biomarker for TNBC patients.

17.
Front Mol Biosci ; 8: 668184, 2021.
Article in English | MEDLINE | ID: mdl-34041266

ABSTRACT

This article is dedicated to the memory of Cyrus Chothia, who was a leading light in the world of protein structure evolution. His elegant analyses of protein families and their mechanisms of structural and functional evolution provided important evolutionary and biological insights and firmly established the value of structural perspectives. He was a mentor and supervisor to many other leading scientists who continued his quest to characterise structure and function space. He was also a generous and supportive colleague to those applying different approaches. In this article we review some of his accomplishments and the history of protein structure classifications, particularly SCOP and CATH. We also highlight some of the evolutionary insights these two classifications have brought. Finally, we discuss how the expansion and integration of protein sequence data into these structural families helps reveal the dark matter of function space and can inform the emergence of novel functions in Metazoa. Since we cover 25 years of structural classification, it has not been feasible to review all structure based evolutionary studies and hence we focus mainly on those undertaken by the SCOP and CATH groups and their collaborators.

19.
Front Genet ; 11: 568546, 2020.
Article in English | MEDLINE | ID: mdl-33193663

ABSTRACT

G-quadruplexes (G4s) are a class of stable structural nucleic acid secondary structures that are known to play a role in a wide spectrum of genomic functions, such as DNA replication and transcription. The classical understanding of G4 structure points to four variable length guanine strands joined by variable length nucleotide stretches. Experiments using G4 immunoprecipitation and sequencing experiments have produced a high number of highly probable G4 forming genomic sequences. The expense and technical difficulty of experimental techniques highlights the need for computational approaches of G4 identification. Here, we present PENGUINN, a machine learning method based on Convolutional neural networks, that learns the characteristics of G4 sequences and accurately predicts G4s outperforming state-of-the-art methods. We provide both a standalone implementation of the trained model, and a web application that can be used to evaluate sequences for their G4 potential.

SELECTION OF CITATIONS
SEARCH DETAIL