Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 42
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Bioinformatics ; 39(1)2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36440918

RESUMEN

SUMMARY: It has been observed in different kinds of networks, such as social or biological ones, a typical behavior inspired by the general principle 'similarity breeds connections'. These networks are defined as homophilic as nodes belonging to the same class preferentially interact with each other. In this work, we present HONTO (HOmophily Network TOol), a user-friendly open-source Python3 package designed to evaluate and analyze homophily in complex networks. The tool takes in input from the network along with a partition of its nodes into classes and yields a matrix whose entries are the homophily/heterophily z-score values. To complement the analysis, the tool also provides z-score values of nodes that do not interact with any other node of the same class. Homophily/heterophily z-scores values are presented as a heatmap allowing a visual at-a-glance interpretation of results. AVAILABILITY AND IMPLEMENTATION: Tool's source code is available at https://github.com/cumbof/honto under the MIT license, installable as a package from PyPI (pip install honto) and conda-forge (conda install -c conda-forge honto), and has a wrapper for the Galaxy platform available on the official Galaxy ToolShed (Blankenberg et al., 2014) at https://toolshed.g2.bx.psu.edu/view/fabio/honto.


Asunto(s)
Programas Informáticos , Humanos
2.
Int J Mol Sci ; 25(9)2024 May 03.
Artículo en Inglés | MEDLINE | ID: mdl-38732207

RESUMEN

Prediction of binding sites for transcription factors is important to understand how the latter regulate gene expression and how this regulation can be modulated for therapeutic purposes. A consistent number of references address this issue with different approaches, Machine Learning being one of the most successful. Nevertheless, we note that many such approaches fail to propose a robust and meaningful method to embed the genetic data under analysis. We try to overcome this problem by proposing a bidirectional transformer-based encoder, empowered by bidirectional long-short term memory layers and with a capsule layer responsible for the final prediction. To evaluate the efficiency of the proposed approach, we use benchmark ChIP-seq datasets of five cell lines available in the ENCODE repository (A549, GM12878, Hep-G2, H1-hESC, and Hela). The results show that the proposed method can predict TFBS within the five different cell lines very well; moreover, cross-cell predictions provide satisfactory results as well. Experiments conducted across cell lines are reinforced by the analysis of five additional lines used only to test the model trained using the others. The results confirm that prediction across cell lines remains very high, allowing an extensive cross-transcription factor analysis to be performed from which several indications of interest for molecular biology may be drawn.


Asunto(s)
Aprendizaje Profundo , Factores de Transcripción , Humanos , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Sitios de Unión , Biología Computacional/métodos , Células HeLa , Unión Proteica , Secuenciación de Inmunoprecipitación de Cromatina/métodos , Línea Celular
3.
J Theor Biol ; 526: 110806, 2021 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-34111456

RESUMEN

The genetic code consists in a set of rules used by living organisms to translate genomic information, contained in genes, into proteins; every amino acid is coded by a set of nucleotide triplets or codons. We refer to codon choice as the choice of a given codon, among the synonymous available ones, to code a given amino acid occurrence. The aim of this work is to shed light on the pivotal role that codon choice plays in regulating the timing of translation process, through patterns of low and high translation efficiency codons. A translation efficiency value, namely codon score, was associated to each codon through a formula based on the number of tRNAs gene copies able to translate the given codon. By using codon scores, those k-mers of the proteome of Saccharomyces cerevisiae, showing low and high average scores associated to the correspondent codons, were computed. The analysis of distribution of both low and high average score k-mers clearly showed that, in particular for higher k-mer size, they occur much more than expected, strongly suggesting a functional role. Moreover performed analysis highlighted that significant k-mers preferentially occur in some protein folding classes, such as those containing alpha helices, and in some functional classes mainly involved in transcription process while codon choice seems to have a very low impact in proteins associated to energy production and metabolism. The relationship between secondary structures and significant k-mers was investigated, revealing that low score k-mers tend to preferentially occur in coil or close to coil regions and almost never in beta sheets, while high score k-mers preferentially occur in alpha helices, avoiding beta sheets, and close to coil regions for high k-mer sizes. Finally the analysis of distribution of significant codon patterns along the proteins highlighted a relevant enrichment of low average score k-mers at the 5' end of protein-coding sequences in the region from 5th to 25th amino acid.


Asunto(s)
Proteínas , Saccharomyces cerevisiae , Codón/genética , Biosíntesis de Proteínas/genética , Pliegue de Proteína , Estructura Secundaria de Proteína , Proteínas/genética , Saccharomyces cerevisiae/genética
4.
Genomics ; 111(6): 1620-1628, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-30453062

RESUMEN

Nucleosomes are not uniformly distributed along DNA and their positioning (termed "nucleosomal landscape") can be derived using data available for several genomes. In this study we analyzed DNA helical rise profiles through a tetranucleotide code, and we defined the nucleosomal landscape of several sequences forming dinucleosomes and of the sequences of huntingtin, myotonic dystrophy type 1 and fragile mental retardation 2 genes, which contained several repeated sequences. We also analyzed the profiles of some sequences interacting with transcription factors or with RNA polymerase II. In the genomes of Cenorhabditis elegans, Mus musculus and Homo sapiens we found profiles with extremely low helical rise values, characteristic of nucleosome free regions. We defined these regions as "holes" and found that their presence correlates with lamina associated domains sequences. Altogether, this study shows that DNA helical rise profile may have a role in gene expression modulation and in shaping chromosomal structure.


Asunto(s)
Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , ADN de Helmintos/genética , ARN Polimerasa II/genética , Factores de Transcripción/genética , Animales , Humanos , Ratones
5.
J Theor Biol ; 391: 13-20, 2016 Feb 21.
Artículo en Inglés | MEDLINE | ID: mdl-26656109

RESUMEN

Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones.


Asunto(s)
Evolución Molecular , Modelos Genéticos , Proteínas/química , Proteínas/genética , Secuencia de Aminoácidos
6.
J Chem Inf Model ; 54(1): 159-68, 2014 Jan 27.
Artículo en Inglés | MEDLINE | ID: mdl-24289204

RESUMEN

The identification of modules in protein structures has major relevance in structural biology, with consequences in protein stability and functional classification, adding new perspectives in drug design. In this work, we present the comparison between a topological (spectral clustering) and a geometrical (k-means) approach to module identification, in the frame of a multiscale analysis of the protein architecture principles. The global consistency of an adjacency matrix based technique (spectral clustering) and a method based on full rank geometrical information (k-means) give a proof-of-concept of the relevance of protein contact networks in structure determination. The peculiar "small-world" character of protein contact graphs is established as well, pointing to average shortest path as a mesoscopic crucial variable to maximize the efficiency of within-molecule signal transmission. The specific nature of protein architecture indicates topological approach as the most proper one to highlight protein functional domains, and two new representations linking sequence and topological role of aminoacids are demonstrated to be of use for protein structural analysis. Here we present a case study regarding azurin, a small copper protein implied in the Pseudomonas aeruginosa respiratory chain. Its pocket molecular shape and its electron transfer function have challenged the method, highlighting its potentiality to catch jointly the structure and function features of protein structures through their decomposition into modules.


Asunto(s)
Modelos Moleculares , Proteínas/química , Azurina/química , Azurina/metabolismo , Biología Computacional , Simulación por Computador , Bases de Datos de Proteínas , Transporte de Electrón , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Mapeo de Interacción de Proteínas/estadística & datos numéricos , Pseudomonas aeruginosa/metabolismo
7.
Gene ; 922: 148556, 2024 Sep 05.
Artículo en Inglés | MEDLINE | ID: mdl-38754568

RESUMEN

COVID-19 emergency has pushed the international scientific community to use every resource to combat the spread of the virus, to understand its biology and predict its possible evolution in terms of new variants. Since the first SARS-CoV-2 virus nucleotide and amino acid sequences were made available, information theory was used to study how viral information content was changing over time and then trace the evolution of its mutational landscape. In this work we analyzed SARS-CoV-2 sequences collected mainly in the USA in a period from March 2020 until December 2022 and computed mutation profiles of viral proteins over time through an entropy-based approach using Shannon Entropy and Hellinger distance. This representation allows an at-a-glance view of the mutational landscape of viral proteins over time and can provide new insights on the evolution of the virus from different points of view. Non-structural proteins typically showed flat mutation profiles, characterized by a very low Average mutation Entropy, while accessory and structural proteins showed mostly non uniform and high mutation profiles, often coupled with the predominance of variants. Interestingly NSP2 protein, whose function is currently still debated, falls in the same branch of NSP14 and NSP10 in the phylogenetic tree of mutations constructed through correlations of mutation profiles, suggesting a co-evolution of those proteins and a possible functional link with each other. To the best of our knowledge this is the first study based on a massive amount of data (n = 107,939,973) that analyzes from an entropy point of view the mutational landscape of SARS-CoV-2 over time and depicts a mutational temporal profile of each protein of the virus.


Asunto(s)
COVID-19 , Entropía , Mutación , SARS-CoV-2 , SARS-CoV-2/genética , COVID-19/virología , COVID-19/genética , Humanos , Estados Unidos , Evolución Molecular , Proteínas Virales/genética , Proteínas no Estructurales Virales/genética , Glicoproteína de la Espiga del Coronavirus/genética
8.
J Comput Biol ; 31(5): 416-428, 2024 05.
Artículo en Inglés | MEDLINE | ID: mdl-38687334

RESUMEN

A Coding DNA Sequence (CDS) is a fraction of DNA whose nucleotides are grouped into consecutive triplets called codons, each one encoding an amino acid. Because most amino acids can be encoded by more than one codon, the same amino acid chain can be obtained by a very large number of different CDSs. These synonymous CDSs show different features that, also depending on the organism the transcript is expressed in, could affect translational efficiency and yield. The identification of optimal CDSs with respect to given transcript indicators is in general a challenging task, but it has been observed in recent literature that integer linear programming (ILP) can be a very flexible and efficient way to achieve it. In this article, we add evidence to this observation by proposing a new ILP model that simultaneously optimizes different well-grounded indicators. With this model, we efficiently find solutions that dominate those returned by six existing codon optimization heuristics.


Asunto(s)
Algoritmos , Codón , Modelos Genéticos , Programación Lineal , Codón/genética , Secuencia de Bases/genética , ADN/genética , Biología Computacional/métodos
9.
J Immunol Methods ; 517: 113474, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-37068621

RESUMEN

BACKGROUND: Class I Major Histocompatibility Complex plays a critical role in the adaptive immune response by binding to peptides processed by Proteasome and Transporter associated with antigen processing complex and presenting them on the cell surface to cytotoxic T-cells. Understanding the process of peptide presentation and studying how presented peptides are distributed in the huge space of all potential epitopes could have a dramatic impact in the context of vaccine design, transplantation, autoimmunity, and cancer development. METHODS: In the present work we propose a graph-driven approach to investigate the landscape of both self (human) and viral (254 organisms) peptides presented on cell surface through class I Major Histocompatibility Complex considering specific HLAs. For each considered HLA (N = 89) we designed a network, namely Peptide Hamming Graph, where nodes are peptides predicted to be presented by a given HLA and an edge is set when the Hamming distance between two peptides is equal or smaller than 2 (i.e. the same amino acid occurs in at least 7 positions of the two sequences). RESULTS: Through the analysis of Peptide Hamming Graphs we studied how predicted presented peptides are distributed in the whole configurational space for different HLAs, identifying sets of viral peptides that can constitute a potential target for the immune system. In particular we selected connected components of the graph made exclusively of viral peptides and sets of viral peptides with high node degree interacting exclusively with viral neighbours. CONCLUSIONS: This work constitutes an innovative approach to study potential cytotoxic T-cell epitopes relying on a network approach, overcoming the classical paradigm based on the identification of potential epitopes only considering their features as single peptides. T-cell cross-reactivity plays a focal role for the efficacy of this strategy increasing the probability of recognition, and consequently a stronger immune response, of presented peptides far from self, sharing a common pattern in terms of sequence similarity.


Asunto(s)
Antígenos HLA , Péptidos , Humanos , Presentación de Antígeno , Antígenos de Histocompatibilidad , Epítopos de Linfocito T
10.
Viruses ; 15(5)2023 05 18.
Artículo en Inglés | MEDLINE | ID: mdl-37243274

RESUMEN

SARS-CoV-2 and its many variants have caused a worldwide emergency. Host cells colonised by SARS-CoV-2 present a significantly different gene expression landscape. As expected, this is particularly true for genes that directly interact with virus proteins. Thus, understanding the role that transcription factors can play in driving differential regulation in patients affected by COVID-19 is a focal point to unveil virus infection. In this regard, we have identified 19 transcription factors which are predicted to target human proteins interacting with Spike glycoprotein of SARS-CoV-2. Transcriptomics RNA-Seq data derived from 13 human organs are used to analyse expression correlation between identified transcription factors and related target genes in both COVID-19 patients and healthy individuals. This resulted in the identification of transcription factors showing the most relevant impact in terms of most evident differential correlation between COVID-19 patients and healthy individuals. This analysis has also identified five organs such as the blood, heart, lung, nasopharynx and respiratory tract in which a major effect of differential regulation mediated by transcription factors is observed. These organs are also known to be affected by COVID-19, thereby providing consistency to our analysis. Furthermore, 31 key human genes differentially regulated by the transcription factors in the five organs are identified and the corresponding KEGG pathways and GO enrichment are also reported. Finally, the drugs targeting those 31 genes are also put forth. This in silico study explores the effects of transcription factors on human genes interacting with Spike glycoprotein of SARS-CoV-2 and intends to provide new insights to inhibit the virus infection.


Asunto(s)
COVID-19 , Humanos , COVID-19/genética , SARS-CoV-2 , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Regulación de la Expresión Génica , Glicoproteínas/genética
11.
Phys Rev E ; 108(5-1): 054130, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-38115426

RESUMEN

Homophily is the principle whereby "similarity breeds connections." We give a quantitative formulation of this principle within networks. Given a network and a labeled partition of its vertices, the vector indexed by each class of the partition, whose entries are the number of edges of the subgraphs induced by the corresponding classes, is viewed as the observed outcome of the random vector described by picking labeled partitions at random among labeled partitions whose classes have the same cardinalities as the given one. This is the recently introduced random coloring model for network homophily. In this perspective, the value of any homophily score Θ, namely, a nondecreasing real-valued function in the sizes of subgraphs induced by the classes of the partition, evaluated at the observed outcome, can be thought of as the observed value of a random variable. Consequently, according to the score Θ, the input network is homophillic at the significance level α whenever the one-sided tail probability of observing a value of Θ at least as extreme as the observed one is smaller than α. Since, as we show, even approximating α is an NP-hard problem, we resort to classical tails inequality to bound α from above. These upper bounds, obtained by specializing Θ, yield a class of quantifiers of network homophily. Computing the upper bounds requires the knowledge of the covariance matrix of the random vector, which was not previously known within the random coloring model. In this paper we close this gap. Interestingly, the matrix depends on the input partition only through the cardinalities of its classes and depends on the network only through its degrees. Furthermore all the covariances have the same sign, and this sign is a graph invariant. Plugging this structure into the bounds yields a meaningful, easy to compute class of indices for measuring network homophily. As demonstrated in real-world network applications, these indices are effective and reliable, and may lead to discoveries that cannot be captured by the current state of the art.

12.
J Chem Inf Model ; 52(2): 474-82, 2012 Feb 27.
Artículo en Inglés | MEDLINE | ID: mdl-22235848

RESUMEN

The analysis of a large database of protein structures by means of topological and shape indexes inspired by complex network and fractal analysis shed light on some organizational principles of proteins. Proteins appear much more similar to "fractal" sponges than to closely packed spheres, casting doubts on the tenability of the hydrophobic core concept. Principal component analysis highlighted three main order parameters shaping the protein universe: (1) "size", with the consequent generation of progressively less dense and more empty structures at an increasing number of residues, (2) "microscopic structuring", linked to the existence of a spectrum going from the prevalence of heterologous (different hydrophobicity) to the prevalence of homologous (similar hydrophobicity) contacts, and (3) "fractal shape", an organizing protein data set along a continuum going from approximately linear to very intermingled structures. Perhaps the time has come for seriously taking into consideration the real relevance of time-honored principles like the hydrophobic core and hydrophobic effect.


Asunto(s)
Bases de Datos de Proteínas , Interacciones Hidrofóbicas e Hidrofílicas , Proteínas/química , Conformación Proteica
13.
Virus Res ; 317: 198814, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35588940

RESUMEN

Adaptive immune response is triggered when specific pathogen peptides called epitopes are recognised as exogenous according to the paradigm of self/non-self. To be recognized by immune cells, epitopes have to be exposed (presented) on the surface of the cell. Predicting if a peptide is exposed is important to shed light on the rules that govern immune response and, thus, identify potential targets and design vaccine and drugs. We focused on peptides exposed on cell surface and made accessible to immune system through the MHC Class I complex. Before this can happen, three successive selection steps have to take place: a) Proteasome cleveage, b) TAP Transport, and c) binding to MHC-class I. Starting from a set of 211 host human reference viruses, we computed the set of unique peptides occurring in the correspondent proteomes. Then, we obtained the probability values of Proteasome Cleveage, TAP Transport and Binding to MHC Class I associated to those peptides through established prediction software tools. Such values were analysed in conjunction with two other features that could play a major role: the distance from self, strictly linked to the concept of nullomers, and the sequence entropy, measuring the complexity of the peptide amino acid composition. The analysis confirmed and extended previous results on a larger, more significant and consistent data set; we showed that the higher the distances from self, the higher the score of TAP Transport and binding to MHC class I; no significant association was instead found between distance from self and Proteasome Cleveage. Additionally, amino acid peptide composition entropy was significantly associated with the other features. In particular, higher entropies were linked with higher scores of Proteasome Cleveage, TAP Transport, Binding to MHC Class I, and higher distance from self. The relationship among the three selection steps provided evidence of a tight inter-correlation, clearly suggesting it could be the product of a co-evolutive process. We believe that these results give new insights on the complex processes that regulate peptide presentation through MHC class I, and unveil the mechanisms the allow the immune system to distinguish self and viral non-self peptides.


Asunto(s)
Complejo de la Endopetidasa Proteasomal , Virus , Transportadoras de Casetes de Unión a ATP/genética , Aminoácidos , Presentación de Antígeno , Entropía , Epítopos , Antígenos de Histocompatibilidad Clase I/metabolismo , Humanos , Péptidos , Complejo de la Endopetidasa Proteasomal/metabolismo , Virus/metabolismo
14.
Infect Genet Evol ; 97: 105154, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34808395

RESUMEN

The pandemic of COVID-19 has been haunting us for almost the past two years. Although, the vaccination drive is in full swing throughout the world, different mutations of the SARS-CoV-2 virus are making it very difficult to put an end to the pandemic. The second wave in India, one of the worst sufferers of this pandemic, can be mainly attributed to the Delta variant i.e. B.1.617.2. Thus, it is very important to analyse and understand the mutational trajectory of SARS-CoV-2 through the study of the 26 virus proteins. In this regard, more than 17,000 protein sequences of Indian SARS-CoV-2 genomes are analysed using entropy-based approach in order to find the monthly mutational trajectory. Furthermore, Hellinger distance is also used to show the difference of the mutation events between the consecutive months for each of the 26 SARS-CoV-2 protein. The results show that the mutation rates and the mutation events of the viral proteins though changing in the initial months, start stabilizing later on for mainly the four structural proteins while the non-structural proteins mostly exhibit a more constant trend. As a consequence, it can be inferred that the evolution of the new mutative configurations will eventually reduce.


Asunto(s)
COVID-19/epidemiología , Genoma Viral , Tasa de Mutación , SARS-CoV-2/genética , Glicoproteína de la Espiga del Coronavirus/genética , Proteínas no Estructurales Virales/genética , Proteínas Estructurales Virales/genética , COVID-19/virología , Entropía , Monitoreo Epidemiológico , Evolución Molecular , Expresión Génica , Humanos , India/epidemiología , Filogenia , SARS-CoV-2/clasificación , SARS-CoV-2/patogenicidad , Glicoproteína de la Espiga del Coronavirus/metabolismo , Proteínas no Estructurales Virales/clasificación , Proteínas no Estructurales Virales/metabolismo , Proteínas Estructurales Virales/clasificación , Proteínas Estructurales Virales/metabolismo
15.
Sci Rep ; 12(1): 9757, 2022 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-35697749

RESUMEN

We present a new method for assessing and measuring homophily in networks whose nodes have categorical attributes, namely when the nodes of networks come partitioned into classes (colors). We probe this method in two different classes of networks: (i) protein-protein interaction (PPI) networks, where nodes correspond to proteins, partitioned according to their functional role, and edges represent functional interactions between proteins (ii) Pokec on-line social network, where nodes correspond to users, partitioned according to their age, and edges respresent friendship between users.Similarly to other classical and well consolidated approaches, our method compares the relative edge density of the subgraphs induced by each class with the corresponding expected relative edge density under a null model. The novelty of our approach consists in prescribing an endogenous null model, namely, the sample space of the null model is built on the input network itself. This allows us to give exact explicit expression for the [Formula: see text]-score of the relative edge density of each class as well as other related statistics. The [Formula: see text]-scores directly quantify the statistical significance of the observed homophily via Cebysëv inequality. The expression of each [Formula: see text]-score is entered by the network structure through basic combinatorial invariant such as the number of subgraphs with two spanning edges. Each [Formula: see text]-score is computed in [Formula: see text] time for a network with n nodes and m edges. This leads to an overall efficient computational method for assesing homophily. We complement the analysis of homophily/heterophily by considering [Formula: see text]-scores of the number of isolated nodes in the subgraphs induced by each class, that are computed in O(nm) time. Theoretical results are then exploited to show that, as expected, both the analyzed network classes are significantly homophilic with respect to the considered node properties.

16.
Infect Genet Evol ; 101: 105294, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35513162

RESUMEN

This study aimed at updating previous data on HIV-1 integrase variability, by using effective bioinformatics methods combining different statistical instruments from simple entropy and mutation rate to more specific approaches such as Hellinger distance. A total of 2133 HIV-1 integrase sequences were analyzed in: i) 1460 samples from drug-naïve [DN] individuals; ii) 386 samples from drug-experienced but INI-naïve [IN] individuals; iii) 287 samples from INI-experienced [IE] individuals. Within the three groups, 76 amino acid positions were highly conserved (≤0.2% variation, Hellinger distance: <0.25%), with 35 fully invariant positions; while, 80 positions were conserved (>0.2% to <1% variation, Hellinger distance: <1%). The H12-H16-C40-C43 and D64-D116-E152 motifs were all well conserved. Some residues were affected by dramatic changes in their mutation distributions, especially between DN and IE samples (Hellinger distance ≥1%). In particular, 15 positions (D6, S24, V31, S39, L74, A91, S119, T122, T124, T125, V126, K160, N222, S230, C280) showed a significant decrease of mutation rate in IN and/or IE samples compared to DN samples. Conversely, 8 positions showed significantly higher mutation rate in samples from treated individuals (IN and/or IE) compared to DN. Some of these positions, such as E92, T97, G140, Y143, Q148 and N155, were already known to be associated with resistance to integrase inhibitors; other positions including S24, M154, V165 and D270 are not yet documented to be associated with resistance. Our study confirms the high conservation of HIV-1 integrase and identified highly invariant positions using robust and innovative methods. The role of novel mutations located in the critical region of HIV-1 integrase deserves further investigation.


Asunto(s)
Infecciones por VIH , Inhibidores de Integrasa VIH , Integrasa de VIH , VIH-1 , Farmacorresistencia Viral/genética , Infecciones por VIH/tratamiento farmacológico , Integrasa de VIH/química , Inhibidores de Integrasa VIH/farmacología , VIH-1/genética , Humanos , Mutación
17.
PLoS Comput Biol ; 6(12): e1001032, 2010 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-21187905

RESUMEN

Two T helper (Th) cell subsets, namely Th1 and Th2 cells, play an important role in inflammatory diseases. The two subsets are thought to counter-regulate each other, and alterations in their balance result in different diseases. This paradigm has been challenged by recent clinical and experimental data. Because of the large number of genes involved in regulating Th1 and Th2 cells, assessment of this paradigm by modeling or experiments is difficult. Novel algorithms based on formal methods now permit the analysis of large gene regulatory networks. By combining these algorithms with in silico knockouts and gene expression microarray data from human T cells, we examined if the results were compatible with a counter-regulatory role of Th1 and Th2 cells. We constructed a directed network model of genes regulating Th1 and Th2 cells through text mining and manual curation. We identified four attractors in the network, three of which included genes that corresponded to Th0, Th1 and Th2 cells. The fourth attractor contained a mixture of Th1 and Th2 genes. We found that neither in silico knockouts of the Th1 and Th2 attractor genes nor gene expression microarray data from patients with immunological disorders and healthy subjects supported a counter-regulatory role of Th1 and Th2 cells. By combining network modeling with transcriptomic data analysis and in silico knockouts, we have devised a practical way to help unravel complex regulatory network topology and to increase our understanding of how network actions may differ in health and disease.


Asunto(s)
Biología Computacional/métodos , Redes Reguladoras de Genes , Células TH1/fisiología , Células Th2/fisiología , Algoritmos , Simulación por Computador , Bases de Datos Genéticas , Perfilación de la Expresión Génica , Técnicas de Inactivación de Genes , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos , Fenotipo , Células TH1/metabolismo , Células Th2/metabolismo
18.
Comput Biol Chem ; 92: 107480, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33826970

RESUMEN

Epigenetics and DNA methylation play a pivotal role in many processes of the cell and we often observe that an aberrant methylation pattern characterizes pathologies. In this work we investigate the role that the flanking sequences of CGs play in the methylation process in human. We built four different CG datasets: methylated, unmethylated, and two randomly extracted ones. We evaluated features associated to the flanking sequences of those CG sets, for different size around the CG, through five measures accounting for different aspects of sequence composition complexity and structure. The analysis performed through those measures revealed evident different behaviors between methylated and unmethylated probe sets. Major differences were observed for GC content and CG dinucleotide frequency in a window size of 300-400 bp and for CG self-attraction in 3K bp. It is remarkable as the effect of methylated CG lasts much more than expected far from the CG.


Asunto(s)
Islas de CpG/genética , ADN/genética , ADN/metabolismo , Metilación de ADN/genética , Entropía , Humanos
19.
J Immunol Methods ; 481-482: 112787, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32335161

RESUMEN

Alarms periodically emerge for viral pneumonia infections due to coronavirus. In all cases, these are zoonoses passing the barrier between species and infect humans. The legitimate concern of the international community is due to the fact that the new identified coronavirus, named SARS-CoV-2 (previously called 2019-nCoV), has a quite high mortality rate, around 2%, and a strong ability to spread, with an estimated reproduction number higher than 2. Even though all countries are doing their utmost to stop the pandemic, the only reliable solution to tackle the infection is the rapid development of a vaccine. For this purpose, the means of bioinformatics, applied in the context of reverse-vaccinology paradigm, can be of fundamental help to select the most promising peptides able to trigger an effective immune response. In this short report, using the concept of nullomer and introducing a distance from human self, we provide a list of peptides that could deserve experimental investigation in the view of a potential vaccine for SARS-CoV-2.


Asunto(s)
Betacoronavirus/inmunología , Biología Computacional , Epítopos/inmunología , COVID-19 , Vacunas contra la COVID-19 , Infecciones por Coronavirus/inmunología , Infecciones por Coronavirus/prevención & control , Genes MHC Clase I , Humanos , Pandemias , Péptidos/inmunología , Neumonía Viral , SARS-CoV-2 , Programas Informáticos , Proteínas Virales/inmunología , Vacunas Virales/inmunología
20.
PLoS One ; 15(12): e0243285, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33284846

RESUMEN

More than twenty years ago the reverse vaccinology paradigm came to light trying to design new vaccines based on the analysis of genomic information in order to select those pathogen peptides able to trigger an immune response. In this context, focusing on the proteome of Trypanosoma cruzi, we investigated the link between the probabilities for pathogen peptides to be presented on a cell surface and their distance from human self. We found a reasonable but, as far as we know, undiscovered property: the farther the distance between a peptide and the human-self the higher the probability for that peptide to be presented on a cell surface. We also found that the most distant peptides from human self bind, on average, a broader collection of HLAs than expected, implying a potential immunological role in a large portion of individuals. Finally, introducing a novel quantitative indicator for a peptide to measure its potential immunological role, we proposed a pool of peptides that could be potential epitopes and that can be suitable for experimental testing. The software to compute peptide classes according to the distance from human self is free available at http://www.iasi.cnr.it/~dsantoni/nullomers.


Asunto(s)
Enfermedad de Chagas/inmunología , Antígenos de Histocompatibilidad Clase I/inmunología , Péptidos/inmunología , Proteínas Protozoarias/inmunología , Trypanosoma cruzi/inmunología , Secuencia de Aminoácidos , Epítopos/química , Epítopos/inmunología , Humanos , Péptidos/química , Proteoma/química , Proteoma/inmunología , Proteínas Protozoarias/química , Trypanosoma cruzi/química
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA