Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 231
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 40(6)2024 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-38775719

RESUMEN

MOTIVATION: In predicting HIV therapy outcomes, a critical clinical question is whether using historical information can enhance predictive capabilities compared with current or latest available data analysis. This study analyses whether historical knowledge, which includes viral mutations detected in all genotypic tests before therapy, their temporal occurrence, and concomitant viral load measurements, can bring improvements. We introduce a method to weigh mutations, considering the previously enumerated factors and the reference mutation-drug Stanford resistance tables. We compare a model encompassing history (H) with one not using this information (NH). RESULTS: The H-model demonstrates superior discriminative ability, with a higher ROC-AUC score (76.34%) than the NH-model (74.98%). Wilcoxon test results confirm significant improvement of predictive accuracy for treatment outcomes through incorporating historical information. The increased performance of the H-model might be attributed to its consideration of latent HIV reservoirs, probably obtained when leveraging historical information. The findings emphasize the importance of temporal dynamics in acquiring mutations. However, our result also shows that prediction accuracy remains relatively high even when no historical information is available. AVAILABILITY AND IMPLEMENTATION: This analysis was conducted using the Euresist Integrated DataBase (EIDB). For further validation, we encourage reproducing this study with the latest release of the EIDB, which can be accessed upon request through the Euresist Network.


Asunto(s)
Infecciones por VIH , VIH-1 , Mutación , VIH-1/genética , Humanos , Infecciones por VIH/tratamiento farmacológico , Infecciones por VIH/virología , Farmacorresistencia Viral/genética , Carga Viral , Fármacos Anti-VIH/uso terapéutico , Fármacos Anti-VIH/farmacología , Resultado del Tratamiento
2.
Immunity ; 45(5): 1148-1161, 2016 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-27851915

RESUMEN

The impact of epigenetics on the differentiation of memory T (Tmem) cells is poorly defined. We generated deep epigenomes comprising genome-wide profiles of DNA methylation, histone modifications, DNA accessibility, and coding and non-coding RNA expression in naive, central-, effector-, and terminally differentiated CD45RA+ CD4+ Tmem cells from blood and CD69+ Tmem cells from bone marrow (BM-Tmem). We observed a progressive and proliferation-associated global loss of DNA methylation in heterochromatic parts of the genome during Tmem cell differentiation. Furthermore, distinct gradually changing signatures in the epigenome and the transcriptome supported a linear model of memory development in circulating T cells, while tissue-resident BM-Tmem branched off with a unique epigenetic profile. Integrative analyses identified candidate master regulators of Tmem cell differentiation, including the transcription factor FOXP1. This study highlights the importance of epigenomic changes for Tmem cell biology and demonstrates the value of epigenetic data for the identification of lineage regulators.


Asunto(s)
Linfocitos T CD4-Positivos/inmunología , Diferenciación Celular/inmunología , Epigénesis Genética/inmunología , Epigenómica/métodos , Memoria Inmunológica/inmunología , Femenino , Citometría de Flujo , Perfilación de la Expresión Génica/métodos , Humanos , Aprendizaje Automático , Reacción en Cadena de la Polimerasa , Transcriptoma
3.
J Infect Dis ; 2024 Jan 17.
Artículo en Inglés | MEDLINE | ID: mdl-38230877

RESUMEN

BACKGROUND: Torque Teno Virus (TTV) is a non-enveloped, circular single-strand DNA virus and part of the human virome. The replication of TTV was related to the immune status in patients treated with immunosuppressive drugs after organ transplantation. We hypothesize that TTV load could be an additional marker for immune function in people living with HIV (PLWH). METHODS: In this analysis serum samples of PLWH from the RESINA multicenter cohort were reanalysed for TTV. Investigated clinical and epidemiological parameters included Pegivirus (HPgV) load, age, sex, HIV load, CD4+ cell count (CDC 1, 2, 3) and CDC clinical stages (1993 CDC classification system, A, B, C) before initiation of antiretroviral treatment. Regression analysis was used to detect possible associations among parameters. RESULTS: Our analysis confirmed TTV as a strong predictor of CD4+ cell count and CDC class 3. This relationship was used to propose a first classification of TTV load in regard to clinical stage. We found no association with clinical CDC stages A, B and C. HPgV load was inversely correlated with HIV load but not TTV load. CONCLUSIONS: TTV load was associated with immunodeficiency in PLWH. Neither TTV- nor HIV load were predictive for the clinical categories of HIV infection.

4.
J Med Virol ; 95(1): e28389, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36484375

RESUMEN

Human immunodeficiency virus (HIV) can develop resistance to all antiretroviral drugs. Multidrug resistance, however, is a rare event in modern HIV treatment, but can be life-threatening, particular in patients with very long therapy histories and in areas with limited access to novel drugs. To understand the evolution of multidrug resistance, we analyzed the EuResist database to uncover the accumulation of mutations over time. We hypothesize that the accumulation of resistance mutations is not acquired simultaneously and randomly across viral genotypes but rather tends to follow a predetermined order. The knowledge of this order might help to elucidate potential mechanisms of multidrug resistance. Our evolutionary model shows an almost monotonic increase of resistance with each acquired mutation, including less well-known nucleoside reverse transcriptase (RT) inhibitor-related mutations like K223Q, L228H, and Q242H. Mutations within the integrase (IN) (T97A, E138A/K G140S, Q148H, N155H) indicate high probability of multidrug resistance. Hence, these IN mutations also tend to be observed together with mutations in the protease (PR) and RT. We followed up with an analysis of the mutation-specific error rates of our model given the data. We identified several mutations with unusual rates (PR: M41L, L33F, IN: G140S). This could imply the existence of previously unknown virus variants in the viral quasispecies. In conclusion, our bioinformatics model supports the analysis and understanding of multidrug resistance.


Asunto(s)
Farmacorresistencia Viral , Infecciones por VIH , VIH-1 , Humanos , Fármacos Anti-VIH/farmacología , Fármacos Anti-VIH/uso terapéutico , Farmacorresistencia Viral/genética , Genotipo , Infecciones por VIH/tratamiento farmacológico , VIH-1/genética , Mutación , Inhibidores de la Transcriptasa Inversa/farmacología , Inhibidores de la Transcriptasa Inversa/uso terapéutico
5.
Nucleic Acids Res ; 48(8): e46, 2020 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-32103242

RESUMEN

DNA methylation is an epigenetic mark with important regulatory roles in cellular identity and can be quantified at base resolution using bisulfite sequencing. Most studies are limited to the average DNA methylation levels of individual CpGs and thus neglect heterogeneity within the profiled cell populations. To assess this within-sample heterogeneity (WSH) several window-based scores that quantify variability in DNA methylation in sequencing reads have been proposed. We performed the first systematic comparison of four published WSH scores based on simulated and publicly available datasets. Moreover, we propose two new scores and provide guidelines for selecting appropriate scores to address cell-type heterogeneity, cellular contamination and allele-specific methylation. Most of the measures were sensitive in detecting DNA methylation heterogeneity in these scenarios, while we detected differences in susceptibility to technical bias. Using recently published DNA methylation profiles of Ewing sarcoma samples, we show that DNA methylation heterogeneity provides information complementary to the DNA methylation level. WSH scores are powerful tools for estimating variance in DNA methylation patterns and have the potential for detecting novel disease-associated genomic loci not captured by established statistics. We provide an R-package implementing the WSH scores for integration into analysis workflows.


Asunto(s)
Metilación de ADN , Análisis de Secuencia de ADN , Humanos , Sarcoma de Ewing/genética
6.
BMC Public Health ; 22(1): 1167, 2022 06 11.
Artículo en Inglés | MEDLINE | ID: mdl-35690802

RESUMEN

BACKGROUND: Lower respiratory tract infections are among the main causes of death. Although there are many respiratory viruses, diagnostic efforts are focused mainly on influenza. The Respiratory Viruses Network (RespVir) collects infection data, primarily from German university hospitals, for a high diversity of infections by respiratory pathogens. In this study, we computationally analysed a subset of the RespVir database, covering 217,150 samples tested for 17 different viral pathogens in the time span from 2010 to 2019. METHODS: We calculated the prevalence of 17 respiratory viruses, analysed their seasonality patterns using information-theoretic measures and agglomerative clustering, and analysed their propensity for dual infection using a new metric dubbed average coinfection exclusion score (ACES). RESULTS: After initial data pre-processing, we retained 206,814 samples, corresponding to 1,408,657 performed tests. We found that Influenza viruses were reported for almost the half of all infections and that they exhibited the highest degree of seasonality. Coinfections of viruses are frequent; the most prevalent coinfection was rhinovirus/bocavirus and most of the virus pairs had a positive ACES indicating a tendency to exclude each other regarding infection. CONCLUSIONS: The analysis of respiratory viruses dynamics in monoinfection and coinfection contributes to the prevention, diagnostic, treatment, and development of new therapeutics. Data obtained from multiplex testing is fundamental for this analysis and should be prioritized over single pathogen testing.


Asunto(s)
Coinfección , Infecciones del Sistema Respiratorio , Virosis , Virus , Coinfección/epidemiología , Humanos , Lactante , Rhinovirus , Virosis/epidemiología
7.
Nucleic Acids Res ; 47(20): 10580-10596, 2019 11 18.
Artículo en Inglés | MEDLINE | ID: mdl-31584093

RESUMEN

Chromatin accessibility maps are important for the functional interpretation of the genome. Here, we systematically analysed assay specific differences between DNase I-seq, ATAC-seq and NOMe-seq in a side by side experimental and bioinformatic setup. We observe that most prominent nucleosome depleted regions (NDRs, e.g. in promoters) are roboustly called by all three or at least two assays. However, we also find a high proportion of assay specific NDRs that are often 'called' by only one of the assays. We show evidence that these assay specific NDRs are indeed genuine open chromatin sites and contribute important information for accurate gene expression prediction. While technically ATAC-seq and DNase I-seq provide a superb high NDR calling rate for relatively low sequencing costs in comparison to NOMe-seq, NOMe-seq singles out for its genome-wide coverage allowing to not only detect NDRs but also endogenous DNA methylation and as we show here genome wide segmentation into heterochromatic B domains and local phasing of nucleosomes outside of NDRs. In summary, our comparisons strongly suggest to consider assay specific differences for the experimental design and for generalized and comparative functional interpretations.


Asunto(s)
Secuenciación de Inmunoprecipitación de Cromatina/métodos , Secuenciación de Inmunoprecipitación de Cromatina/normas , Células Hep G2 , Humanos , Nucleosomas/química , Nucleosomas/metabolismo , Regiones Promotoras Genéticas
8.
BMC Public Health ; 21(1): 1178, 2021 06 21.
Artículo en Inglés | MEDLINE | ID: mdl-34154549

RESUMEN

BACKGROUND: Non-pharmaceutical measures to control the spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) should be carefully tuned as they can impose a heavy social and economic burden. To quantify and possibly tune the efficacy of these anti-SARS-CoV-2 measures, we have devised indicators based on the abundant historic and current prevalence data from other respiratory viruses. METHODS: We obtained incidence data of 17 respiratory viruses from hospitalized patients and outpatients collected by 37 clinics and laboratories between 2010-2020 in Germany. With a probabilistic model for Bayes inference we quantified prevalence changes of the different viruses between months in the pre-pandemic period 2010-2019 and the corresponding months in 2020, the year of the pandemic with noninvasive measures of various degrees of stringency. RESULTS: We discovered remarkable reductions δ in rhinovirus (RV) prevalence by about 25% (95% highest density interval (HDI) [-0.35,-0.15]) in the months after the measures against SARS-CoV-2 were introduced in Germany. In the months after the measures began to ease, RV prevalence increased to low pre-pandemic levels, e.g. in August 2020 δ=-0.14 (95% HDI [-0.28,0.12]). CONCLUSIONS: RV prevalence is negatively correlated with the stringency of anti-SARS-CoV-2 measures with only a short time delay. This result suggests that RV prevalence could possibly be an indicator for the efficiency for these measures. As RV is ubiquitous at higher prevalence than SARS-CoV-2 or other emerging respiratory viruses, it could reflect the efficacy of noninvasive measures better than such emerging viruses themselves with their unevenly spreading clusters.


Asunto(s)
COVID-19 , Rhinovirus , Teorema de Bayes , Alemania , Humanos , Prevalencia , SARS-CoV-2
9.
Nucleic Acids Res ; 46(W1): W271-W277, 2018 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-29718426

RESUMEN

Identifying resistance to antiretroviral drugs is crucial for ensuring the successful treatment of patients infected with viruses such as human immunodeficiency virus (HIV) or hepatitis C virus (HCV). In contrast to Sanger sequencing, next-generation sequencing (NGS) can detect resistance mutations in minority populations. Thus, genotypic resistance testing based on NGS data can offer novel, treatment-relevant insights. Since existing web services for analyzing resistance in NGS samples are subject to long processing times and follow strictly rules-based approaches, we developed geno2pheno[ngs-freq], a web service for rapidly identifying drug resistance in HIV-1 and HCV samples. By relying on frequency files that provide the read counts of nucleotides or codons along a viral genome, the time-intensive step of processing raw NGS data is eliminated. Once a frequency file has been uploaded, consensus sequences are generated for a set of user-defined prevalence cutoffs, such that the constructed sequences contain only those nucleotides whose codon prevalence exceeds a given cutoff. After locally aligning the sequences to a set of references, resistance is predicted using the well-established approaches of geno2pheno[resistance] and geno2pheno[hcv]. geno2pheno[ngs-freq] can assist clinical decision making by enabling users to explore resistance in viral populations with different abundances and is freely available at http://ngs.geno2pheno.org.


Asunto(s)
Farmacorresistencia Viral/genética , Infecciones por VIH/tratamiento farmacológico , VIH-1/genética , Programas Informáticos , Genoma Viral/genética , Genotipo , Infecciones por VIH/genética , Infecciones por VIH/virología , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación
10.
Nucleic Acids Res ; 45(1): 54-66, 2017 01 09.
Artículo en Inglés | MEDLINE | ID: mdl-27899623

RESUMEN

The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively.


Asunto(s)
Cromatina/metabolismo , ADN/genética , Regulación de la Expresión Génica , Histonas/genética , Aprendizaje Automático , Factores de Transcripción/genética , Algoritmos , Sitios de Unión , Linfocitos T CD4-Positivos/citología , Linfocitos T CD4-Positivos/metabolismo , Línea Celular , Línea Celular Tumoral , Cromatina/química , Ensamble y Desensamble de Cromatina , ADN/metabolismo , Células Hep G2 , Hepatocitos/citología , Hepatocitos/metabolismo , Histonas/metabolismo , Células Madre Embrionarias Humanas/citología , Células Madre Embrionarias Humanas/metabolismo , Humanos , Células K562 , Especificidad de Órganos , Cultivo Primario de Células , Análisis de Componente Principal , Unión Proteica , Factores de Transcripción/metabolismo
11.
BMC Med ; 16(1): 150, 2018 08 27.
Artículo en Inglés | MEDLINE | ID: mdl-30145981

RESUMEN

BACKGROUND: Personalized, precision, P4, or stratified medicine is understood as a medical approach in which patients are stratified based on their disease subtype, risk, prognosis, or treatment response using specialized diagnostic tests. The key idea is to base medical decisions on individual patient characteristics, including molecular and behavioral biomarkers, rather than on population averages. Personalized medicine is deeply connected to and dependent on data science, specifically machine learning (often named Artificial Intelligence in the mainstream media). While during recent years there has been a lot of enthusiasm about the potential of 'big data' and machine learning-based solutions, there exist only few examples that impact current clinical practice. The lack of impact on clinical practice can largely be attributed to insufficient performance of predictive models, difficulties to interpret complex model predictions, and lack of validation via prospective clinical trials that demonstrate a clear benefit compared to the standard of care. In this paper, we review the potential of state-of-the-art data science approaches for personalized medicine, discuss open challenges, and highlight directions that may help to overcome them in the future. CONCLUSIONS: There is a need for an interdisciplinary effort, including data scientists, physicians, patient advocates, regulatory agencies, and health insurance organizations. Partially unrealistic expectations and concerns about data science-based solutions need to be better managed. In parallel, computational methods must advance more to provide direct benefit to clinical practice.


Asunto(s)
Medicina de Precisión/métodos , Humanos , Estudios Prospectivos
12.
Bioinformatics ; 33(13): 2063-2064, 2017 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-28334349

RESUMEN

MOTIVATION: While large amounts of epigenomic data are publicly available, their retrieval in a form suitable for downstream analysis is a bottleneck in current research. The DeepBlue Epigenomic Data Server provides a powerful interface and API for filtering, transforming, aggregating and downloading data from several epigenomic consortia. RESULTS: To make public epigenomic data conveniently available for analysis in R, we developed an R/Bioconductor package that connects to the DeepBlue Epigenomic Data Server, enabling users to quickly gather and transform epigenomic data from selected experiments for analysis in the Bioconductor ecosystem. AVAILABILITY AND IMPLEMENTATION: http://deepblue.mpi-inf.mpg.de/R . REQUIREMENTS: R 3.3, Bioconductor 3.4. CONTACT: felipe.albrecht@mpi-inf.mpg.de or markus.list@mpi-inf.mpg.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Epigenómica/métodos , Programas Informáticos , Humanos
13.
Nucleic Acids Res ; 44(W1): W581-6, 2016 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-27084938

RESUMEN

Large amounts of epigenomic data are generated under the umbrella of the International Human Epigenome Consortium, which aims to establish 1000 reference epigenomes within the next few years. These data have the potential to unravel the complexity of epigenomic regulation. However, their effective use is hindered by the lack of flexible and easy-to-use methods for data retrieval. Extracting region sets of interest is a cumbersome task that involves several manual steps: identifying the relevant experiments, downloading the corresponding data files and filtering the region sets of interest. Here we present the DeepBlue Epigenomic Data Server, which streamlines epigenomic data analysis as well as software development. DeepBlue provides a comprehensive programmatic interface for finding, selecting, filtering, summarizing and downloading region sets. It contains data from four major epigenome projects, namely ENCODE, ROADMAP, BLUEPRINT and DEEP. DeepBlue comes with a user manual, examples and a well-documented application programming interface (API). The latter is accessed via the XML-RPC protocol supported by many programming languages. To demonstrate usage of the API and to enable convenient data retrieval for non-programmers, we offer an optional web interface. DeepBlue can be openly accessed at http://deepblue.mpi-inf.mpg.de.


Asunto(s)
Epigenómica , Genoma , Almacenamiento y Recuperación de la Información , Internet , Programas Informáticos , Humanos , Lenguajes de Programación , Interfaz Usuario-Computador
14.
Mol Biol Evol ; 33(10): 2720-34, 2016 10.
Artículo en Inglés | MEDLINE | ID: mdl-27436007

RESUMEN

The widely used model for evolutionary relationships is a bifurcating tree with all taxa/observations placed at the leaves. This is not appropriate if the taxa have been densely sampled across evolutionary time and may be in a direct ancestral relationship, or if there is not enough information to fully resolve all the branching points in the evolutionary tree. In this article, we present a fast distance-based agglomeration method called family-joining (FJ) for constructing so-called generally labeled trees in which taxa may be placed at internal vertices and the tree may contain polytomies. FJ constructs such trees on the basis of pairwise distances and a distance threshold. We tested three methods for threshold selection, FJ-AIC, FJ-BIC, and FJ-CV, which minimize Akaike information criterion, Bayesian information criterion, and cross-validation error, respectively. When compared with related methods on simulated data, FJ-BIC was among the best at reconstructing the correct tree across a wide range of simulation scenarios. FJ-BIC was applied to HIV sequences sampled from individuals involved in a known transmission chain. The FJ-BIC tree was found to be compatible with almost all transmission events. On average, internal branches in the FJ-BIC tree have higher bootstrap support than branches in the leaf-labeled bifurcating tree constructed using RAxML. 36% and 25% of the internal branches in the FJ-BIC tree and RAxML tree, respectively, have bootstrap support greater than 70%. To the best of our knowledge the method presented here is the first attempt at modeling evolutionary relationships using generally labeled trees.


Asunto(s)
Algoritmos , Modelos Genéticos , Filogenia , Estadística como Asunto/métodos , Teorema de Bayes , Evolución Biológica , Simulación por Computador
16.
Nat Methods ; 11(11): 1138-1140, 2014 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-25262207

RESUMEN

RnBeads is a software tool for large-scale analysis and interpretation of DNA methylation data, providing a user-friendly analysis workflow that yields detailed hypertext reports (http://rnbeads.mpi-inf.mpg.de/). Supported assays include whole-genome bisulfite sequencing, reduced representation bisulfite sequencing, Infinium microarrays and any other protocol that produces high-resolution DNA methylation data. Notable applications of RnBeads include the analysis of epigenome-wide association studies and epigenetic biomarker discovery in cancer cohorts.


Asunto(s)
Metilación de ADN , ADN/química , Epigénesis Genética , Genoma Humano , Programas Informáticos , Secuencia de Bases , Humanos , Análisis de Secuencia de ADN/métodos , Sulfitos/química
17.
J Chem Phys ; 146(1): 014105, 2017 Jan 07.
Artículo en Inglés | MEDLINE | ID: mdl-28063433

RESUMEN

New exact equations are derived for the terms contributing to the binding free energy (ΔG0) of a ligand-receptor pair using our recently introduced formalism which we here call perturbation-divergence formalism (PDF). Specifically, ΔG0 equals the sum of the average of the perturbation (pertaining to new interactions) and additional dissipative terms. The average of the perturbation includes the sum of the average receptor-ligand interactions and the average of the change of solvation energies upon association. The Kullback-Leibler (KL) divergence quantifies the energetically dissipative terms, which are due to the configurational changes and, using the chain rule of KL divergence, can be decomposed into (i) dissipation due to limiting the external liberation (translation and rotation) of the ligand relative to the receptor and (ii) dissipation due to conformational (internal) changes inside the receptor and the ligand. We also identify all exactly canceling energetic terms which do not contribute to ΔG0. Furthermore, the PDF provides a new approach towards dimensionality reduction in the representation of the association process and towards relating the dynamic (high dimensional) with the thermodynamic (one-dimensional) changes.

19.
Retrovirology ; 13(1): 85, 2016 Dec 20.
Artículo en Inglés | MEDLINE | ID: mdl-27998283

RESUMEN

BACKGROUND: CCR5-coreceptor antagonists can be used for treating HIV-2 infected individuals. Before initiating treatment with coreceptor antagonists, viral coreceptor usage should be determined to ensure that the virus can use only the CCR5 coreceptor (R5) and cannot evade the drug by using the CXCR4 coreceptor (X4-capable). However, until now, no online tool for the genotypic identification of HIV-2 coreceptor usage had been available. Furthermore, there is a lack of knowledge on the determinants of HIV-2 coreceptor usage. Therefore, we developed a data-driven web service for the prediction of HIV-2 coreceptor usage from the V3 loop of the HIV-2 glycoprotein and used the tool to identify novel discriminatory features of X4-capable variants. RESULTS: Using 10 runs of tenfold cross validation, we selected a linear support vector machine (SVM) as the model for geno2pheno[coreceptor-hiv2], because it outperformed the other SVMs with an area under the ROC curve (AUC) of 0.95. We found that SVMs were highly accurate in identifying HIV-2 coreceptor usage, attaining sensitivities of 73.5% and specificities of 96% during tenfold nested cross validation. The predictive performance of SVMs was not significantly different (p value 0.37) from an existing rules-based approach. Moreover, geno2pheno[coreceptor-hiv2] achieved a predictive accuracy of 100% and outperformed the existing approach on an independent data set containing nine new isolates with corresponding phenotypic measurements of coreceptor usage. geno2pheno[coreceptor-hiv2] could not only reproduce the established markers of CXCR4-usage, but also revealed novel markers: the substitutions 27K, 15G, and 8S were significantly predictive of CXCR4 usage. Furthermore, SVMs trained on the amino-acid sequences of the V1 and V2 loops were also quite accurate in predicting coreceptor usage (AUCs of 0.84 and 0.65, respectively). CONCLUSIONS: In this study, we developed geno2pheno[coreceptor-hiv2], the first online tool for the prediction of HIV-2 coreceptor usage from the V3 loop. Using our method, we identified novel amino-acid markers of X4-capable variants in the V3 loop and found that HIV-2 coreceptor usage is also influenced by the V1/V2 region. The tool can aid clinicians in deciding whether coreceptor antagonists such as maraviroc are a treatment option and enables epidemiological studies investigating HIV-2 coreceptor usage. geno2pheno[coreceptor-hiv2] is freely available at http://coreceptor-hiv2.geno2pheno.org .


Asunto(s)
Técnicas de Genotipaje , Proteína gp120 de Envoltorio del VIH/genética , Infecciones por VIH/virología , VIH-2/genética , Fragmentos de Péptidos/genética , Receptores CXCR4/metabolismo , Máquina de Vectores de Soporte , Antagonistas de los Receptores CCR5/uso terapéutico , Sistemas de Apoyo a Decisiones Clínicas , Genotipo , Proteína gp120 de Envoltorio del VIH/química , Infecciones por VIH/tratamiento farmacológico , VIH-2/metabolismo , Humanos , Internet , Fragmentos de Péptidos/química , Receptores CCR5/metabolismo
20.
Bioinformatics ; 31(4): 616-7, 2015 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-25644272

RESUMEN

UNLABELLED: Speed is of the essence in combating Ebola; thus, computational approaches should form a significant component of Ebola research. As for the development of any modern drug, computational biology is uniquely positioned to contribute through comparative analysis of the genome sequences of Ebola strains and three-dimensional protein modeling. Other computational approaches to Ebola may include large-scale docking studies of Ebola proteins with human proteins and with small-molecule libraries, computational modeling of the spread of the virus, computational mining of the Ebola literature and creation of a curated Ebola database. Taken together, such computational efforts could significantly accelerate traditional scientific approaches. In recognition of the need for important and immediate solutions from the field of computational biology against Ebola, the International Society for Computational Biology (ISCB) announces a prize for an important computational advance in fighting the Ebola virus. ISCB will confer the ISCB Fight against Ebola Award, along with a prize of US$2000, at its July 2016 annual meeting (ISCB Intelligent Systems for Molecular Biology 2016, Orlando, FL). CONTACT: dkovats@iscb.org or rost@in.tum.de.


Asunto(s)
Distinciones y Premios , Investigación Biomédica , Biología Computacional , Fiebre Hemorrágica Ebola/virología , Sociedades Científicas , Bases de Datos Factuales , Ebolavirus/genética , Ebolavirus/patogenicidad , Humanos , Agencias Internacionales
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA