Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 97
Filtrar
1.
Comput Struct Biotechnol J ; 23: 1864-1876, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38707536

RESUMO

In current genomic research, the widely used methods for predicting antimicrobial resistance (AMR) often rely on prior knowledge of known AMR genes or reference genomes. However, these methods have limitations, potentially resulting in imprecise predictions owing to incomplete coverage of AMR mechanisms and genetic variations. To overcome these limitations, we propose a pan-genome-based machine learning approach to advance our understanding of AMR gene repertoires and uncover possible feature sets for precise AMR classification. By building compacted de Brujin graphs (cDBGs) from thousands of genomes and collecting the presence/absence patterns of unique sequences (unitigs) for Pseudomonas aeruginosa, we determined that using machine learning models on unitig-centered pan-genomes showed significant promise for accurately predicting the antibiotic resistance or susceptibility of microbial strains. Applying a feature-selection-based machine learning algorithm led to satisfactory predictive performance for the training dataset (with an area under the receiver operating characteristic curve (AUC) of > 0.929) and an independent validation dataset (AUC, approximately 0.77). Furthermore, the selected unitigs revealed previously unidentified resistance genes, allowing for the expansion of the resistance gene repertoire to those that have not previously been described in the literature on antibiotic resistance. These results demonstrate that our proposed unitig-based pan-genome feature set was effective in constructing machine learning predictors that could accurately identify AMR pathogens. Gene sets extracted using this approach may offer valuable insights into expanding known AMR genes and forming new hypotheses to uncover the underlying mechanisms of bacterial AMR.

2.
Expert Opin Drug Metab Toxicol ; : 1-8, 2024 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-38742542

RESUMO

INTRODUCTION: This review explores the transformative impact of machine learning (ML) on carcinogenicity prediction within drug development. It discusses the historical context and recent advancements, emphasizing the significance of ML methodologies in overcoming challenges related to data interpretation, ethical considerations, and regulatory acceptance. AREAS COVERED: The review comprehensively examines the integration of ML, deep learning, and diverse artificial intelligence (AI) approaches in various aspects of drug development safety assessments. It explores applications ranging from early-phase compound screening to clinical trial optimization, highlighting the versatility of ML in enhancing predictive accuracy and efficiency. EXPERT OPINION: Through the analysis of traditional approaches such as in vivo rodent bioassays and in vitro assays, the review underscores the limitations and resource intensity associated with these methods. It provides expert insights into how ML offers innovative solutions to address these challenges, revolutionizing safety assessments in drug development.

3.
J Imaging Inform Med ; 2024 Apr 30.
Artigo em Inglês | MEDLINE | ID: mdl-38689151

RESUMO

Recurrences are frequent in nasopharyngeal carcinoma (NPC) despite high remission rates with treatment, leading to considerable morbidity. This study aimed to develop a prediction model for NPC survival by harnessing both pre- and post-treatment magnetic resonance imaging (MRI) radiomics in conjunction with clinical data, focusing on 3-year progression-free survival (PFS) as the primary outcome. Our comprehensive approach involved retrospective clinical and MRI data collection of 276 eligible NPC patients from three independent hospitals (180 in the training cohort, 46 in the validation cohort, and 50 in the external cohort) who underwent MRI scans twice, once within 2 months prior to treatment and once within 10 months after treatment. From the contrast-enhanced T1-weighted images before and after treatment, 3404 radiomics features were extracted. These features were not only derived from the primary lesion but also from the adjacent lymph nodes surrounding the tumor. We conducted appropriate feature selection pipelines, followed by Cox proportional hazards models for survival analysis. Model evaluation was performed using receiver operating characteristic (ROC) analysis, the Kaplan-Meier method, and nomogram construction. Our study unveiled several crucial predictors of NPC survival, notably highlighting the synergistic combination of pre- and post-treatment data in both clinical and radiomics assessments. Our prediction model demonstrated robust performance, with an accuracy of AUCs of 0.66 (95% CI: 0.536-0.779) in the training cohort, 0.717 (95% CI: 0.536-0.883) in the testing cohort, and 0.827 (95% CI: 0.684-0.948) in validation cohort in prognosticating patient outcomes. Our study presented a novel and effective prediction model for NPC survival, leveraging both pre- and post-treatment clinical data in conjunction with MRI features. Its constructed nomogram provides potentially significant implications for NPC research, offering clinicians a valuable tool for individualized treatment planning and patient counseling.

4.
Comput Biol Med ; 174: 108408, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38636332

RESUMO

Accurately predicting tumor T-cell antigen (TTCA) sequences is a crucial task in the development of cancer vaccines and immunotherapies. TTCAs derived from tumor cells, are presented to immune cells (T cells) through major histocompatibility complex (MHC), via the recognition of specific portions of their structure known as epitopes. More specifically, MHC class I introduces TTCAs to T-cell receptors (TCR) which are located on the surface of CD8+ T cells. However, TTCA sequences are varied and lead to struggles in vaccine design. Recently, Machine learning (ML) models have been developed to predict TTCA sequences which could aid in fast and correct TTCA identification. During the construction of the TTCA predictor, the peptide encoding strategy is an important step. Previous studies have used biological descriptors for encoding TTCA sequences. However, there have been no studies that use natural language processing (NLP), a potential approach for this purpose. As sentences have their own words with diverse properties, biological sequences also hold unique characteristics that reflect evolutionary information, physicochemical values, and structural information. We hypothesized that NLP methods would benefit the prediction of TTCA. To develop a new identifying TTCA model, we first constructed a based model with widely used ML algorithms and extracted features from biological descriptors. Then, to improve our model performance, we added extracted features from biological language models (BLMs) based on NLP methods. Besides, we conducted feature selection by using Chi-square and Pearson Correlation Coefficient techniques. Then, SMOTE, Up-sampling, and Near-Miss were used to treat unbalanced data. Finally, we optimized Sa-TTCA by the SVM algorithm to the four most effective feature groups. The best performance of Sa-TTCA showed a competitive balanced accuracy of 87.5% on a training set, and 72.0% on an independent testing set. Our results suggest that integrating biological descriptors with natural language processing has the potential to improve the precision of predicting protein/peptide functionality, which could be beneficial for developing cancer vaccines.


Assuntos
Antígenos de Neoplasias , Processamento de Linguagem Natural , Máquina de Vetores de Suporte , Humanos , Antígenos de Neoplasias/imunologia , Antígenos de Neoplasias/química , Antígenos de Neoplasias/genética , Neoplasias/imunologia , Análise de Sequência de Proteína/métodos , Biologia Computacional/métodos
5.
EClinicalMedicine ; 71: 102578, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38606167

RESUMO

Background: Constipation is prevalent worldwide, significantly increasing healthcare costs and diminishing the quality of life in children affected. Current studies have yielded mixed results regarding the factors associated with constipation, and mainly focusing on patients outside of Asia. Moreover, most of these studies lack focus on the paediatric population. This study aimed to identify the prevalence and associated factors of constipation among children in Asia. Methods: In this systematic review and meta-analysis, we systematically searched PubMed, Scopus, and Cochrane for cohort and cross-sectional studies published from database inception up to October 12, 2022, and continued with manual searching until September 2, 2023. Eligible studies were those that included children in Asia aged 0-18 years old suffering from idiopathic constipation, with prevalence value provided in the English abstract. The analysis included clinical and general population. Children with organic constipation, who had undergone gastrointestinal surgery, or with congenital defects were excluded, as these factors affect the incidence of constipation. Data included in the analysis were extracted from published reports only. The extracted data were pooled using random-effects model to analyse the prevalence of constipation in children in Asia. This study is registered with PROSPERO, CRD42022367122. Findings: Out of 4410 systematically searched studies and 36 manually searched ones, a total of 50 studies were included in the final analysis, encompassing data from 311,660 children residing in Asia. The pooled prevalence of constipation was 12.0% (95% CI 9.3-14.6%, I2 = 99.8%). There was no significant difference in constipation prevalence observed by sex and geographical location. Nonetheless, adolescents and children aged 1-9 years exhibited a significantly higher prevalence constipation compared to infants (p < 0.0001) Additionally, significant differences in constipation rates were observed across various diagnostic methods, population sources, and mental health conditions. Interpretation: Despite the high heterogeneity resulting from varying diagnostic tools or definitions used among studies, our review adds to the literature on constipation among children in Asia. It reveals a notably high prevalence of constipation in this demographic. Diagnostic methods, age, and compromised mental health emerged as significant influencers of constipation among children in Asia, highlighting potential strategies to mitigate constipation prevalence in children in Asia. Funding: The National Science and Technology Council, Taiwan.

6.
Int J Mol Sci ; 25(5)2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-38473938

RESUMO

The role of the IFI6 gene has been described in several cancers, but its involvement in esophageal cancer (ESCA) remains unclear. This study aimed to identify novel prognostic indicators for ESCA-targeted therapy by investigating IFI6's expression, epigenetic mechanisms, and signaling activities. We utilized public data from the Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) to analyze IFI6's expression, clinical characteristics, gene function, pathways, and correlation with different immune cells in ESCA. The TIMER2.0 database was employed to assess the pan-cancer expression of IFI6, while UALCAN was used to examine its expression across tumor stages and histology subtypes. Additionally, the KEGG database helped identify related pathways. Our findings revealed 95 genes positively correlated and 15 genes negatively correlated with IFI6 in ESCA. IFI6 was over-expressed in ESCA and other cancers, impacting patient survival and showing higher expression in tumor tissues than normal tissues. IFI6 was also correlated with CD4+ T cells and B cell receptors (BCRs), both essential in immune response. GO Biological Process (GO BP) enrichment analysis indicated that IFI6 was primarily associated with the Type I interferon signaling pathway and the defense response to viruses. Intriguingly, KEGG pathway analysis demonstrated that IFI6 and its positively correlated genes in ESCA were mostly linked to the Cytosolic DNA-sensing pathway, which plays a crucial role in innate immunity and viral defense, and the RIG-I-like receptor (RLR) signaling pathway, which detects viral infections and activates immune responses. Pathways related to various viral infections were also identified. It is important to note that our study relied on online databases. Given that ESCA consists of two distinct subgroups (ESCC and EAC), most databases combine them into a single category. Future research should focus on evaluating IFI6 expression and its impact on each subgroup to gain more specific insights. In conclusion, inhibiting IFI6 using targeted therapy could be an effective strategy for treating ESCA considering its potential as a biomarker and correlation with immune cell factors.


Assuntos
Neoplasias Esofágicas , Viroses , Humanos , Prognóstico , Multiômica , Linfócitos T CD4-Positivos , Proteínas Mitocondriais
7.
J Imaging Inform Med ; 37(2): 725-733, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38308069

RESUMO

Common pediatric distal forearm fractures necessitate precise detection. To support prompt treatment planning by clinicians, our study aimed to create a multi-class convolutional neural network (CNN) model for pediatric distal forearm fractures, guided by the AO Foundation/Orthopaedic Trauma Association (AO/ATO) classification system for pediatric fractures. The GRAZPEDWRI-DX dataset (2008-2018) of wrist X-ray images was used. We labeled images into four fracture classes (FRM, FUM, FRE, and FUE with F, fracture; R, radius; U, ulna; M, metaphysis; and E, epiphysis) based on the pediatric AO/ATO classification. We performed multi-class classification by training a YOLOv4-based CNN object detection model with 7006 images from 1809 patients (80% for training and 20% for validation). An 88-image test set from 34 patients was used to evaluate the model performance, which was then compared to the diagnosis performances of two readers-an orthopedist and a radiologist. The overall mean average precision levels on the validation set in four classes of the model were 0.97, 0.92, 0.95, and 0.94, respectively. On the test set, the model's performance included sensitivities of 0.86, 0.71, 0.88, and 0.89; specificities of 0.88, 0.94, 0.97, and 0.98; and area under the curve (AUC) values of 0.87, 0.83, 0.93, and 0.94, respectively. The best performance among the three readers belonged to the radiologist, with a mean AUC of 0.922, followed by our model (0.892) and the orthopedist (0.830). Therefore, using the AO/OTA concept, our multi-class fracture detection model excelled in identifying pediatric distal forearm fractures.

8.
Prog Mol Biol Transl Sci ; 203: 83-97, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38360007

RESUMO

Nowadays, information technology (IT) has been holding a significant role in daily life worldwide. The trajectory of data science and bioinformatics promises pioneering personalized therapies, reshaping medical landscapes and patient care. For RNA therapy to reach more patients, a comprehensive understanding of the application of data science and bioinformatics to this therapy is essential. Thus, this chapter has summarized the application of data science and bioinformatics in RNA therapeutics. Data science applications in RNA therapy, such as data integration and analytics, machine learning, and drug development, have been discussed. In addition, aspects of bioinformatics such as RNA design and evaluation, drug delivery system simulation, and databases for personalized medicine have also been covered in this chapter. These insights have shed light on existing evidence and opened potential future directions. From there, scientists can elevate RNA-based therapeutics into an era of tailored treatments and revolutionary healthcare.


Assuntos
Biologia Computacional , Ciência de Dados , Humanos , Medicina de Precisão
10.
J Assist Reprod Genet ; 41(2): 239-252, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37880512

RESUMO

With the rising demand for in vitro fertilization (IVF) cycles, there is a growing need for innovative techniques to optimize procedure outcomes. One such technique is time-lapse system (TLS) for embryo incubation, which minimizes environmental changes in the embryo culture process. TLS also significantly advances predicting embryo quality, a crucial determinant of IVF cycle success. However, the current subjective nature of embryo assessments is due to inter- and intra-observer subjectivity, resulting in highly variable results. To address this challenge, reproductive medicine has gradually turned to artificial intelligence (AI) to establish a standardized and objective approach, aiming to achieve higher success rates. Extensive research is underway investigating the utilization of AI in TLS to predict multiple outcomes. These studies explore the application of popular AI algorithms, their specific implementations, and the achieved advancements in TLS. This review aims to provide an overview of the advances in AI algorithms and their particular applications within the context of TLS and the potential challenges and opportunities for further advancements in reproductive medicine.


Assuntos
Inteligência Artificial , Medicina Reprodutiva , Humanos , Imagem com Lapso de Tempo/métodos , Fertilização in vitro/métodos , Algoritmos
11.
Acad Radiol ; 31(2): 660-683, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37120403

RESUMO

RATIONALE AND OBJECTIVES: Recent advancements in artificial intelligence (AI) render a substantial promise for epidermal growth factor receptor (EGFR) mutation status prediction in non-small cell lung cancer (NSCLC). We aimed to evaluate the performance and quality of AI algorithms that use radiomics features in predicting EGFR mutation status in patient with NSCLC. MATERIALS AND METHODS: We searched PubMed (Medline), EMBASE, Web of Science, and IEEExplore for studies published up to February 28, 2022. Studies utilizing an AI algorithm (either conventional machine learning [cML] and deep learning [DL]) for predicting EGFR mutations in patients with NSLCL were included. We extracted binary diagnostic accuracy data and constructed a bivariate random-effects model to obtain pooled sensitivity, specificity, and 95% confidence interval. This study is registered with PROSPERO, CRD42021278738. RESULTS: Our search identified 460 studies, of which 42 were included. Thirty-five studies were included in the meta-analysis. The AI algorithms exhibited an overall area under the curve (AUC) value of 0.789 and pooled sensitivity and specificity levels of 72.2% and 73.3%, respectively. The DL algorithms outperformed cML in terms of AUC (0.822 vs. 0.775) and sensitivity (80.1% vs. 71.1%), but had lower specificity (70.0% vs. 73.8%, p-value < 0.001) compared to cML. Subgroup analysis revealed that the use of positron-emission tomography/computed tomography, additional clinical information, deep feature extraction, and manual segmentation can improve diagnostic performance. CONCLUSION: DL algorithms can serve as a novel method for increasing predictive accuracy and thus have considerable potential for use in predicting EGFR mutation status in patient with NSCLC. We also suggest that guidelines on using AI algorithms in medical image analysis should be developed with a focus on oncologic radiomics.


Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Humanos , Carcinoma Pulmonar de Células não Pequenas/diagnóstico por imagem , Carcinoma Pulmonar de Células não Pequenas/genética , Neoplasias Pulmonares/diagnóstico por imagem , Neoplasias Pulmonares/genética , Inteligência Artificial , Receptores ErbB/genética , Mutação/genética
12.
Comput Biol Med ; 168: 107662, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-37979206

RESUMO

This study introduces VF-Pred, a novel framework developed for the purpose of detecting virulence factors (VFs) through the analysis of genomic data. VFs are crucial for pathogens to successfully infect host tissue and evade the immune system, leading to the onset of infectious diseases. Identifying VFs accurately is of utmost importance in the quest for developing potent drugs and vaccines to counter these diseases. To accomplish this, VF-Pred combines various feature engineering techniques to generate inputs for distinct machine learning classification models. The collective predictions of these models are then consolidated by a final downstream model using an innovative ensembling approach. One notable aspect of VF-Pred is the inclusion of a novel Seq-Alignment feature, which significantly enhances the accuracy of the employed machine learning algorithms. The framework was meticulously trained on 982 features obtained from extensive feature engineering, utilizing a comprehensive ensemble of 25 models. The new downstream ensembling technique adopted by VF-Pred surpasses existing stacking strategies and other ensembling methods, delivering superior performance in VF detection. There have been similar studies done earlier, VF-Pred stands out in comparison showing higher accuracy (83.5 %), higher sensitivity (87 %) towards identification of VFs. Accessible through a user-friendly web page, VF-Pred can be accessed by providing the identifier and protein sequence, enabling the prediction of high or low likelihoods of VFs. Overall, VF-Pred showcases a highly promising methodology for the identification of VFs, potentially paving the way for the development of more effective strategies in the battle against infectious diseases.


Assuntos
Doenças Transmissíveis , Fatores de Virulência , Humanos , Fatores de Virulência/genética , Alinhamento de Sequência , Algoritmos , Aprendizado de Máquina
13.
ACS Omega ; 8(42): 39420-39426, 2023 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-37901522

RESUMO

DNA N6-methyladenosine (6 mA) modification carries significant epigenetic information and plays a pivotal role in biological functions, thereby profoundly impacting human development. Precise and reliable detection of 6 mA sites is integral to understanding the mechanisms underpinning DNA modification. The present methods, primarily experimental, used to identify specific molecular sites are often time-intensive and costly. Consequently, the rise of computer-based methods aimed at identifying 6 mA sites provides a welcome alternative. Our research introduces a novel model to discern DNA 6 mA sites in cross-species genomes. This model, developed through machine learning, utilizes extracted sequence information. Hyperparameter tuning was employed to ascertain the most effective feature combination and model implementation, thereby garnering vital information from sequences. Our model demonstrated superior accuracy compared to the existing models when tested using five-fold cross-validation. Thus, our study substantiates the reliability and efficiency of our model as a valuable tool for supplementing experimental research.

14.
Radiol Case Rep ; 18(12): 4404-4408, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37829164

RESUMO

The ingested foreign body is a very unusual etiology of liver abscess. This clinical scenario is infrequently reported in the literature. A 66-year-old male patient presented to the hospital because of abdominal pain along with 7 days of right upper quadrant pain and intermittent low-grade fever. He was living in an epidemiological area of Fasciola infection. Physical examination showed right hypochondria tenderness without guarding or rebounding. Laboratory results were significant for leukocytosis, predominant neutrophils, and increased inflammatory markers. The liver function tests were within normal limits. Abdominal ultrasonography and CT scan were consistent with a hepatic abscess spread from segment 4B to segment 3. The patient was preliminarily diagnosed with a parasitic hepatic abscess. After management with fluid infusion and antibiotics, the patient was discharged in stable condition. Two weeks later, on the follow-up visit, the patient reported intermittent low-grade fever had persisted. After consulting the CT scan, an abnormal high-attenuation linear structure was identified inside the liver lesion, which is suspected of being a foreign body. Laparoscopic surgery was performed, and a fishbone was removed from the abscess cavity. Perforation was not found in the stomach, duodenum, or in the bowel. One week later, their condition was fully resolved. Liver abscess due to a foreign body should be suspected when a patient has radiology findings suggestive of an abscess, but the clinical presentation does not indicate the common etiologies. Meticulous observation on abdominal CT scans or ultrasonography can help with diagnosis and guide treatment.

15.
Curr Med Chem ; 2023 Sep 13.
Artigo em Inglês | MEDLINE | ID: mdl-37711014

RESUMO

Gastric cancer (GC) represents a significant global health burden, ranking as the fifth most common malignancy and the fourth leading cause of cancer-related death worldwide. Despite recent advancements in GC treatment, the five-year survival rate for advanced-stage GC patients remains low. Consequently, there is an urgent need to identify novel drug targets and develop effective therapies. However, traditional drug discovery approaches are associated with high costs, time-consuming processes, and a high failure rate, posing challenges in meeting this critical need. In recent years, there has been a rapid increase in the utilization of artificial intelligence (AI) algorithms and big data in drug discovery, particularly in cancer research. AI has the potential to improve the drug discovery process by analyzing vast and complex datasets from multiple sources, enabling the prediction of compound efficacy and toxicity, as well as the optimization of drug candidates. This review provides an overview of the latest AI algorithms and big data employed in drug discovery for GC. Additionally, we examine the various applications of AI in this field, with a specific focus on therapeutic discovery. Moreover, we discuss the challenges, limitations, and prospects of emerging AI methods, which hold significant promise for advancing GC research in the future.

16.
Nutrition ; 116: 112212, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37776838

RESUMO

OBJECTIVE: Mobile nutrition applications (apps) provide a simple way for individuals to record their diet, but the validity and inherent errors need to be carefully evaluated. The aim of this study was to assess the validity and clarify the sources of measurement errors of image-assisted mobile nutrition apps. METHODS: This was a cross-sectional study with 98 students recruited from School of Nutrition and Health Sciences, Taipei Medical University. A 3-d nutrient intake record by Formosa Food and Nutrient Recording App (FoodApp) was compared with a 24-h dietary recall (24-HDR). A two-stage data modification process, manual data cleaning, and reanalyzing of prepackaged foods were employed to address inherent errors. Nutrient intake levels obtained by the two methods were compared with the recommended daily intake (DRI), Taiwan. Paired t test, Spearman's correlation coefficients, and Bland-Altman plots were used to assess agreement between the FoodApp and 24-HDR. RESULTS: Manual data cleaning identified 166 food coding errors (12%; stage 1), and 426 food codes with missing micronutrients (32%) were reanalyzed (stage 2). Positive linear trends were observed for total energy and micronutrient intake (all Ptrend < 0.05) after the two stages of data modification, but not for dietary fat, carbohydrates, or vitamin D. There were no statistical differences in mean energy and macronutrient intake between the FoodApp and 24-HDR, and this agreement was confirmed by Bland-Altman plots. Spearman's correlation analyses showed strong to moderate correlations (r = 0.834 ∼ 0.386) between the two methods. Participants' nutrient intake tended to be lower than the DRI, but no differences in proportions of adequacy/inadequacy for DRI values were observed between the two methods. CONCLUSIONS: Mitigating errors significantly improved the accuracy of the Formosa FoodApp, indicating its validity and reliability as a self-reporting mobile-based dietary assessment tool. Dietitians and health professionals should be mindful of potential errors associated with self-reporting nutrition apps, and manual data cleaning is vital to obtain reliable nutrient intake data.


Assuntos
Aplicativos Móveis , Humanos , Reprodutibilidade dos Testes , Estudos Transversais , Avaliação Nutricional , Dieta , Ingestão de Energia , Gorduras na Dieta , Registros de Dieta
17.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37649385

RESUMO

Protein crystallization is crucial for biology, but the steps involved are complex and demanding in terms of external factors and internal structure. To save on experimental costs and time, the tendency of proteins to crystallize can be initially determined and screened by modeling. As a result, this study created a new pipeline aimed at using protein sequence to predict protein crystallization propensity in the protein material production stage, purification stage and production of crystal stage. The newly created pipeline proposed a new feature selection method, which involves combining Chi-square (${\chi }^{2}$) and recursive feature elimination together with the 12 selected features, followed by a linear discriminant analysisfor dimensionality reduction and finally, a support vector machine algorithm with hyperparameter tuning and 10-fold cross-validation is used to train the model and test the results. This new pipeline has been tested on three different datasets, and the accuracy rates are higher than the existing pipelines. In conclusion, our model provides a new solution to predict multistage protein crystallization propensity which is a big challenge in computational biology.


Assuntos
Algoritmos , Aprendizado de Máquina , Cristalização , Sequência de Aminoácidos , Biologia Computacional
18.
Cancers (Basel) ; 15(15)2023 Jul 31.
Artigo em Inglês | MEDLINE | ID: mdl-37568715

RESUMO

Glioblastoma (GBM) is one of the most progressive and prevalent cancers of the central nervous system. Identifying genetic markers is therefore crucial to predict prognosis and enhance treatment effectiveness in GBM. To this end, we obtained gene expression data of GBM from TCGA and GEO datasets and identified differentially expressed genes (DEGs), which were overlapped and used for survival analysis with univariate Cox regression. Next, the genes' biological significance and potential as immunotherapy candidates were examined using functional enrichment and immune infiltration analysis. Eight prognostic-related DEGs in GBM were identified, namely CRNDE, NRXN3, POPDC3, PTPRN, PTPRN2, SLC46A2, TIMP1, and TNFSF9. The derived risk model showed robustness in identifying patient subgroups with significantly poorer overall survival, as well as those with distinct GBM molecular subtypes and MGMT status. Furthermore, several correlations between the expression of the prognostic genes and immune infiltration cells were discovered. Overall, we propose a survival-derived risk score that can provide prognostic significance and guide therapeutic strategies for patients with GBM.

19.
Med Biol Eng Comput ; 61(10): 2699-2712, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37432527

RESUMO

Lower-grade gliomas (LGG) can eventually progress to glioblastoma (GBM) and death. In the context of the transfer learning approach, we aimed to train and test an MRI-based radiomics model for predicting survival in GBM patients and validate it in LGG patients. From each patient's 704 MRI-based radiomics features, we chose seventeen optimal radiomics signatures in the GBM training set (n = 71) and used these features in both the GBM testing set (n = 31) and LGG validation set (n = 107) for further analysis. Each patient's risk score, calculated based on those optimal radiomics signatures, was chosen to represent the radiomics model. We compared the radiomics model with clinical, gene status models, and combined model integrating radiomics, clinical, and gene status in predicting survival. The average iAUCs of combined models in training, testing, and validation sets were respectively 0.804, 0.878, and 0.802, and those of radiomics models were 0.798, 0.867, and 0.717. The average iAUCs of gene status and clinical models ranged from 0.522 to 0.735 in all three sets. The radiomics model trained in GBM patients can effectively predict the overall survival of GBM and LGG patients, and the combined model improved this ability.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Glioma , Humanos , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/genética , Estudos Retrospectivos , Glioma/diagnóstico por imagem , Imageamento por Ressonância Magnética , Aprendizado de Máquina
20.
Brief Funct Genomics ; 2023 Jul 29.
Artigo em Inglês | MEDLINE | ID: mdl-37519050

RESUMO

Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA