RESUMO
The current state of the COVID-19 pandemic is a global health crisis. To fight the novel coronavirus, one of the best-known ways is to block enzymes essential for virus replication. Currently, we know that the SARS-CoV-2 virus encodes about 29 proteins such as spike protein, 3C-like protease (3CLpro), RNA-dependent RNA polymerase (RdRp), Papain-like protease (PLpro), and nucleocapsid (N) protein. SARS-CoV-2 uses human angiotensin-converting enzyme 2 (ACE2) for viral entry and transmembrane serine protease family member II (TMPRSS2) for spike protein priming. Thus in order to speed up the discovery of potential drugs, we develop DockCoV2, a drug database for SARS-CoV-2. DockCoV2 focuses on predicting the binding affinity of FDA-approved and Taiwan National Health Insurance (NHI) drugs with the seven proteins mentioned above. This database contains a total of 3,109 drugs. DockCoV2 is easy to use and search against, is well cross-linked to external databases, and provides the state-of-the-art prediction results in one site. Users can download their drug-protein docking data of interest and examine additional drug-related information on DockCoV2. Furthermore, DockCoV2 provides experimental information to help users understand which drugs have already been reported to be effective against MERS or SARS-CoV. DockCoV2 is available at https://covirus.cc/drugs/.
Assuntos
Antivirais/uso terapêutico , Tratamento Farmacológico da COVID-19 , Bases de Dados de Produtos Farmacêuticos/estatística & dados numéricos , SARS-CoV-2/efeitos dos fármacos , Antivirais/metabolismo , COVID-19/epidemiologia , COVID-19/virologia , Curadoria de Dados/métodos , Mineração de Dados/métodos , Humanos , Internet , Modelos Moleculares , Pandemias , Ligação Proteica/efeitos dos fármacos , Domínios Proteicos , SARS-CoV-2/metabolismo , SARS-CoV-2/fisiologia , Proteínas Virais/química , Proteínas Virais/metabolismo , Replicação Viral/efeitos dos fármacosRESUMO
BACKGROUND: Ankylosing spondylitis (AS) is an autoimmune disease affecting mainly spine and sacroiliac joints and adjacent soft tissues. Genome-wide association studies (GWASs) are used to evaluate genetic associations and to predict genetic risk factors that determine the biological basis of disease susceptibility. We aimed to explore the race-specific SNP susceptibility of AS in Taiwanese individuals and to investigate the association between HLA-B27 and AS susceptibility SNPs in Taiwan. METHODS: Genotyping data were collected from a medical center participating in the Taiwan Precision Medicine Initiative (TPMI) in the northern district of Taiwan. We designed a case-control study to identify AS susceptibility SNPs through GWAS. We searched the genome browser to find the corresponding susceptibility genes and used the GTEx database to confirm the regulation of gene expression. A polygenic risk score approach was also applied to evaluate the genetic variants in the prediction of developing AS. RESULTS: The results showed that the SNPs located on the sixth chromosome were related to higher susceptibility in the AS group. There was no overlap between our results and the susceptibility SNPs found in other races. The 12 tag SNPs located in the MHC region that were found through the linkage disequilibrium method had higher gene expression. Furthermore, Taiwanese people with HLA-B27 positivity had a higher proportion of minor alleles. This might be the reason that the AS prevalence is higher in Taiwan than in other countries. We developed AS polygenic risk score models with six different methods in which those with the top 10% polygenic risk had a fivefold increased risk of developing AS compared to the remaining group with low risk. CONCLUSION: A total of 147 SNPs in the Taiwanese population were found to be statistically significantly associated with AS on the sixth pair of chromosomes and did not overlap with previously published sites in the GWAS Catalog. Whether those genes mapped by AS-associated SNPs are involved in AS and what the pathogenic mechanism of the mapped genes is remain to be further studied.
Assuntos
Estudo de Associação Genômica Ampla , Espondilite Anquilosante , Humanos , Antígeno HLA-B27/genética , Estudos de Casos e Controles , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único/genética , Espondilite Anquilosante/genética , Espondilite Anquilosante/patologiaRESUMO
BACKGROUND: Cluster headache is a highly debilitating neurological disorder with considerable inter-ethnic differences. Genome-wide association studies (GWAS) recently identified replicable genomic loci for cluster headache in Europeans, but the genetic underpinnings for cluster headache in Asians remain unclear. The objective of this study is to investigate the genetic architecture and susceptibility loci of cluster headache in Han Chinese resided in Taiwan. METHODS: We conducted a two-stage genome-wide association study in a Taiwanese cohort enrolled from 2007 through 2022 to identify the genetic variants associated with cluster headache. Diagnosis of cluster headache was retrospectively ascertained with the criteria of International Classification of Headache Disorders, third edition. Control subjects were enrolled from the Taiwan Biobank. Genotyping was conducted with the Axiom Genome-Wide Array TWB chip, followed by whole genome imputation. A polygenic risk score was developed to differentiate patients from controls. Downstream analyses including gene-set and tissue enrichment, linkage disequilibrium score regression, and pathway analyses were performed. RESULTS: We enrolled 734 patients with cluster headache and 9,846 population-based controls. We identified three replicable loci, with the lead SNPs being rs1556780 in CAPN2 (odds ratio = 1.59, 95% CI 1.42â1.78, p = 7.61 × 10-16), rs10188640 in MERTK (odds ratio = 1.52, 95% CI 1.33â1.73, p = 8.58 × 10-13), and rs13028839 in STAB2 (odds ratio = 0.63, 95% CI 0.52â0.78, p = 2.81 × 10-8), with the latter two replicating the findings in European populations. Several previously reported genes also showed significant associations with cluster headache in our samples. Polygenic risk score differentiated patients from controls with an area under the receiver operating characteristic curve of 0.77. Downstream analyses implicated circadian regulation and immunological processes in the pathogenesis of cluster headache. CONCLUSIONS: This study revealed the genetic architecture and novel susceptible loci of cluster headache in Han Chinese residing in Taiwan. Our findings support the common genetic contributions of cluster headache across ethnicities and provide novel mechanistic insights into the pathogenesis of cluster headache.
Assuntos
Cefaleia Histamínica , Estudo de Associação Genômica Ampla , Humanos , Cefaleia Histamínica/genética , Predisposição Genética para Doença , Taiwan , Estudos Retrospectivos , Povo Asiático/genética , ChinaRESUMO
Immune Checkpoint Blockade (ICB) has revolutionized cancer treatment, however the mechanisms determining patient response remain poorly understood. Here, we used machine learning to predict ICB response from germline and somatic biomarkers and interpreted the learned model to uncover putative mechanisms driving superior outcomes. Patients with higher infiltration of T follicular helper cells had responses even in the presence of defects in the class-I Major Histocompatibility Complex (MHC-I). Further investigation uncovered different ICB responses in tumors when responses were reliant on MHC-I versus MHC-II neoantigens. Despite similar response rates, MHC-II reliant responses were associated with significantly longer durable clinical benefit (Discovery: Median OS=63.6 vs. 34.5 months P=0.0074; Validation: Median OS=37.5 vs. 33.1 months, P=0.040). Characteristics of the tumor immune microenvironment reflected MHC neoantigen reliance, and analysis of immune checkpoints revealed LAG3 as a potential target in MHC-II but not MHC-I reliant responses. This study highlights the value of interpretable machine learning models in elucidating the biological basis of therapy responses.
RESUMO
Immune Checkpoint Blockade (ICB) has revolutionized cancer treatment, however mechanisms determining patient response remain poorly understood. Here we used machine learning to predict ICB response from germline and somatic biomarkers and interpreted the learned model to uncover putative mechanisms driving superior outcomes. Patients with higher T follicular helper infiltrates were robust to defects in the class-I Major Histocompatibility Complex (MHC-I). Further investigation uncovered different ICB responses in MHC-I versus MHC-II neoantigen reliant tumors across patients. Despite similar response rates, MHC-II reliant responses were associated with significantly longer durable clinical benefit (Discovery: Median OS=63.6 vs. 34.5 months P=0.0074; Validation: Median OS=37.5 vs. 33.1 months, P=0.040). Characteristics of the tumor immune microenvironment reflected MHC neoantigen reliance, and analysis of immune checkpoints revealed LAG3 as a potential target in MHC-II but not MHC-I reliant responses. This study highlights the value of interpretable machine learning models in elucidating the biological basis of therapy responses.
RESUMO
Polygenic risk score (PRS) predictions often show bias toward the population of available genome-wide association studies (GWASs), which is typically of European ancestry. This study aimed to assess the performance differences of ancestry-specific PRS and test the implementation of multi-ancestry PRS to enhance the generalizability of low-density lipoprotein (LDL) cholesterol predictions in the East Asian (EAS) population. In this study, we computed ancestry-specific and multi-ancestry PRSs for LDL using data obtained from the Global Lipid Genetics Consortium, while accounting for population-specific linkage disequilibrium patterns using the PRS-CSx method in the United Kingdom Biobank dataset (UKB, n = 423,596) and Taiwan Biobank dataset (TWB, n = 68,978). Population-specific PRSs were able to predict LDL levels better within the target population, whereas multi-ancestry PRSs were more generalizable. In the TWB dataset, covariate-adjusted R 2 values were 9.3% for ancestry-specific PRS, 6.7% for multi-ancestry PRS, and 4.5% for European-specific PRS. Similar trends (8.6%, 7.8%, and 6.2%) were observed in the smaller EAS population of the UKB (n = 1,480). Consistent with R 2 values, PRS stratification in EAS regions (TWB) effectively captured a heterogenous variability in LDL blood cholesterol levels across PRS strata. The mean difference in LDL levels between the lowest and highest EAS-specific PRS (EAS_PRS) deciles was 0.82, compared to 0.59 for European-specific PRS (EUR_PRS) and 0.76 for multi-ancestry PRS. Notably, the mean LDL values in the top decile of multi-ancestry PRS were comparable to those of EAS_PRS (3.543 vs. 3.541, p = 0.86). Our analysis of the PRS prediction model for LDL cholesterol further supports the issue of PRS generalizability across populations. Our targeted analysis of the EAS population revealed that integrating non-European genotyping data with a powerful European-based GWAS can enhance the generalizability of LDL PRS.
RESUMO
The selection of peptides presented by MHC molecules is crucial for antigen discovery. Previously, several predictors have shown impressive performance on binding affinity. However, the decisive MHC residues and their relation to the selection of binding peptides are still unrevealed. Here, we connected HLA alleles with binding motifs via our deep learning-based framework, MHCfovea. MHCfovea expanded the knowledge of MHC-I-binding motifs from 150 to 13,008 alleles. After clustering N-terminal and C-terminal sub-motifs on both observed and unobserved alleles, MHCfovea calculated the hyper-motifs and the corresponding allele signatures on the important positions to disclose the relation between binding motifs and MHC-I sequences. MHCfovea delivered 32 pairs of hyper-motifs and allele signatures (HLA-A: 13, HLA-B: 12, and HLA-C: 7). The paired hyper-motifs and allele signatures disclosed the critical polymorphic residues that determine the binding preference, which are believed to be valuable for antigen discovery and vaccine design when allele specificity is concerned.
Assuntos
Alelos , Aprendizado Profundo , Genes MHC Classe I/genética , Peptídeos/química , Humanos , Ligação ProteicaRESUMO
BACKGROUND: Severe sepsis and septic shock are still the leading causes of death in Intensive Care Units (ICUs), and timely diagnosis is crucial for treatment outcomes. The progression of electronic medical records (EMR) offers the possibility of storing a large quantity of clinical data that can facilitate the development of artificial intelligence (AI) in medicine. However, several difficulties, such as poor structure and heterogenicity of the raw EMR data, are encountered when introducing AI with ICU data. Labor-intensive work, including manual data entry, personal medical records sorting, and laboratory results interpretation may hinder the progress of AI. In this article, we introduce the developing of an AI algorithm designed for sepsis diagnosis using pre-selected features; and compare the performance of the AI algorithm with SOFA score based diagnostic method. MATERIALS AND METHODS: This is a prospective open-label cohort study. A specialized EMR, named TED_ICU, was implemented for continuous data recording. One hundred six clinical features relevant to sepsis diagnosis were selected prospectively. A labeling work to allocate SEPSIS or NON_SEPSIS status for each ICU patient was performed by the in-charge intensivist according to SEPSIS-3 criteria, along with the automatic recording of selected features every day by TED_ICU. Afterward, we use de-identified data to develop the AI algorithm. Several machine learning methods were evaluated using 5-fold cross-validation, and XGBoost, a decision-tree based algorithm was adopted for our AI algorithm development due to best performance. RESULTS: The study was conducted between August 2018 and December 2018 for the first stage of analysis. We collected 1588 instances, including 444 SEPSIS and 1144 NON-SEPSIS, from 434 patients. The 434 patients included 259 (59.6%) male patients and 175 female patients. The mean age was 67.6-year-old, and the mean APACHE II score was 13.8. The SEPSIS cohort had a higher SOFA score and increased use of organ support treatment. The AI algorithm was developed with a shuffle method using 80% of the instances for training and 20% for testing. The established AI algorithm achieved the following: accuracy = 82% ± 1%; sensitivity = 65% ± 5%; specificity = 88% ± 2%; precision = 67% ± 3%; and F1 = 0.66 ± 0.02. The area under the receiver operating characteristic curve (AUROC) was approximately 0.89. The SOFA score was used on the same 1588 instances for sepsis diagnosis, and the result was inferior to our AI algorithm (AUROC = 0.596). CONCLUSION: Using real-time data, collected by EMR, from the ICU daily practice, our AI algorithm established with pre-selected features and XGBoost can provide a timely diagnosis of sepsis with an accuracy greater than 80%. AI algorithm also outperforms the SOFA score in sepsis diagnosis and exhibits practicality as clinicians can deploy appropriate treatment earlier. The early and precise response of this AI algorithm will result in cost reduction, outcome improvement, and benefit for healthcare systems, medical staff, and patients as well.