Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
J Biol Chem ; 297(2): 100913, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34175310

RESUMEN

Trypanosomatid parasites are responsible for various human diseases, such as sleeping sickness, animal trypanosomiasis, or cutaneous and visceral leishmaniases. The few available drugs to fight related parasitic infections are often toxic and present poor efficiency and specificity, and thus, finding new molecular targets is imperative. Aminoacyl-tRNA synthetases (aaRSs) are essential components of the translational machinery as they catalyze the specific attachment of an amino acid onto cognate tRNA(s). In trypanosomatids, one gene encodes both cytosolic- and mitochondrial-targeted aaRSs, with only three exceptions. We identify here a unique specific feature of aaRSs from trypanosomatids, which is that most of them harbor distinct insertion and/or extension sequences. Among the 26 identified aaRSs in the trypanosome Leishmania tarentolae, 14 contain an additional domain or a terminal extension, confirmed in mature mRNAs by direct cDNA nanopore sequencing. Moreover, these RNA-Seq data led us to address the question of aaRS dual localization and to determine splice-site locations and the 5'-UTR lengths for each mature aaRS-encoding mRNA. Altogether, our results provided evidence for at least one specific mechanism responsible for mitochondrial addressing of some L. tarentolae aaRSs. We propose that these newly identified features of trypanosomatid aaRSs could be developed as relevant drug targets to combat the diseases caused by these parasites.


Asunto(s)
Aminoácidos/metabolismo , Aminoacil-ARNt Sintetasas/metabolismo , Leishmania/enzimología , Leishmaniasis/patología , ARN de Transferencia/genética , Secuencia de Aminoácidos , Aminoacil-ARNt Sintetasas/química , Aminoacil-ARNt Sintetasas/genética , Animales , Citosol/metabolismo , Humanos , Leishmania/aislamiento & purificación , Leishmaniasis/enzimología , Leishmaniasis/parasitología , Mitocondrias/metabolismo , Filogenia , ARN de Transferencia/metabolismo , Homología de Secuencia de Aminoácido
2.
BMC Bioinformatics ; 22(1): 561, 2021 Nov 23.
Artículo en Inglés | MEDLINE | ID: mdl-34814826

RESUMEN

BACKGROUND: Ab initio prediction of splice sites is an essential step in eukaryotic genome annotation. Recent predictors have exploited Deep Learning algorithms and reliable gene structures from model organisms. However, Deep Learning methods for non-model organisms are lacking. RESULTS: We developed Spliceator to predict splice sites in a wide range of species, including model and non-model organisms. Spliceator uses a convolutional neural network and is trained on carefully validated data from over 100 organisms. We show that Spliceator achieves consistently high accuracy (89-92%) compared to existing methods on independent benchmarks from human, fish, fly, worm, plant and protist organisms. CONCLUSIONS: Spliceator is a new Deep Learning method trained on high-quality data, which can be used to predict splice sites in diverse organisms, ranging from human to protists, with consistently high accuracy.


Asunto(s)
Algoritmos , Redes Neurales de la Computación , Animales , Genoma , Humanos
3.
Hum Mutat ; 38(10): 1316-1324, 2017 10.
Artículo en Inglés | MEDLINE | ID: mdl-28608363

RESUMEN

Numerous mutations in each of the mitochondrial aminoacyl-tRNA synthetases (aaRSs) have been implicated in human diseases. The mutations are autosomal and recessive and lead mainly to neurological disorders, although with pleiotropic effects. The processes and interactions that drive the etiology of the disorders associated with mitochondrial aaRSs (mt-aaRSs) are far from understood. The complexity of the clinical, genetic, and structural data requires concerted, interdisciplinary efforts to understand the molecular biology of these disorders. Toward this goal, we designed MiSynPat, a comprehensive knowledge base together with an ergonomic Web server designed to organize and access all pertinent information (sequences, multiple sequence alignments, structures, disease descriptions, mutation characteristics, original literature) on the disease-linked human mt-aaRSs. With MiSynPat, a user can also evaluate the impact of a possible mutation on sequence-conservation-structure in order to foster the links between basic and clinical researchers and to facilitate future diagnosis. The proposed integrated view, coupled with research on disease-related mt-aaRSs, will help to reveal new functions for these enzymes and to open new vistas in the molecular biology of the cell. The purpose of MiSynPat, freely available at http://misynpat.org, is to constitute a reference and a converging resource for scientists and clinicians.


Asunto(s)
Aminoacil-ARNt Sintetasas/genética , Bases de Datos Genéticas , Mitocondrias/enzimología , Mutación/genética , Secuencia de Aminoácidos , Aminoacil-ARNt Sintetasas/química , Evolución Molecular , Enfermedades Genéticas Congénitas/genética , Humanos , Mitocondrias/genética , Estructura Molecular , Conformación Proteica
4.
Nucleic Acids Res ; 40(Web Server issue): W71-5, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22641855

RESUMEN

A major challenge in the post-genomic era is a better understanding of how human genetic alterations involved in disease affect the gene products. The KD4v (Comprehensible Knowledge Discovery System for Missense Variant) server allows to characterize and predict the phenotypic effects (deleterious/neutral) of missense variants. The server provides a set of rules learned by Induction Logic Programming (ILP) on a set of missense variants described by conservation, physico-chemical, functional and 3D structure predicates. These rules are interpretable by non-expert humans and are used to accurately predict the deleterious/neutral status of an unknown mutation. The web server is available at http://decrypthon.igbmc.fr/kd4v.


Asunto(s)
Enfermedad/genética , Mutación Missense , Polimorfismo de Nucleótido Simple , Programas Informáticos , Estudios de Asociación Genética , Humanos , Internet , Bases del Conocimiento , Fenotipo , Proteínas/química , Proteínas/genética
5.
Hum Mutat ; 31(2): 127-35, 2010 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-19921752

RESUMEN

Understanding how genetic alterations affect gene products at the molecular level represents a first step in the elucidation of the complex relationships between genotypic and phenotypic variations, and is thus a major challenge in the postgenomic era. Here, we present SM2PH-db (http://decrypthon.igbmc.fr/sm2ph), a new database designed to investigate structural and functional impacts of missense mutations and their phenotypic effects in the context of human genetic diseases. A wealth of up-to-date interconnected information is provided for each of the 2,249 disease-related entry proteins (August 2009), including data retrieved from biological databases and data generated from a Sequence-Structure-Evolution Inference in Systems-based approach, such as multiple alignments, three-dimensional structural models, and multidimensional (physicochemical, functional, structural, and evolutionary) characterizations of mutations. SM2PH-db provides a robust infrastructure associated with interactive analysis tools supporting in-depth study and interpretation of the molecular consequences of mutations, with the more long-term goal of elucidating the chain of events leading from a molecular defect to its pathology. The entire content of SM2PH-db is regularly and automatically updated thanks to a computational grid data federation facilities provided in the context of the Decrypthon program.


Asunto(s)
Bases de Datos de Proteínas , Enfermedades Genéticas Congénitas/genética , Mutación Missense/genética , Programas Informáticos , Humanos , Internet , Fenotipo , Proteínas , Interfaz Usuario-Computador
6.
Bioinformatics ; 24(2): 276-8, 2008 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-18037684

RESUMEN

UNLABELLED: With the establishment of high-throughput (HT) screening methods there is an increasing need for automatic analysis methods. Here we present RReportGenerator, a user-friendly portal for automatic routine analysis using the statistical platform R and Bioconductor. RReportGenerator is designed to analyze data using predefined analysis scenarios via a graphical user interface (GUI). A report in pdf format combining text, figures and tables is automatically generated and results may be exported. To demonstrate suitable analysis tasks we provide direct web access to a collection of analysis scenarios for summarizing data from transfected cell arrays (TCA), segmentation of CGH data, and microarray quality control and normalization. AVAILABILITY: RReportGenerator, a user manual and a collection of analysis scenarios are available under a GNU public license on http://www-bio3d-igbmc.u-strasbg.fr/~wraff


Asunto(s)
Algoritmos , Gráficos por Computador , Documentación/métodos , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Programas Informáticos , Interfaz Usuario-Computador , Interpretación Estadística de Datos
7.
BMC Bioinformatics ; 8: 62, 2007 Feb 23.
Artículo en Inglés | MEDLINE | ID: mdl-17319945

RESUMEN

BACKGROUND: The post-genomic era is characterised by a torrent of biological information flooding the public databases. As a direct consequence, similarity searches starting with a single query sequence frequently lead to the identification of hundreds, or even thousands of potential homologues. The huge volume of data renders the subsequent structural, functional and evolutionary analyses very difficult. It is therefore essential to develop new strategies for efficient sampling of this large sequence space, in order to reduce the number of sequences to be processed. At the same time, it is important to retain the most pertinent sequences for structural and functional studies. RESULTS: An exhaustive analysis on a large scale test set (284 protein families) was performed to compare the efficiency of four different sampling methods aimed at selecting the most pertinent sequences. These four methods sample the proteins detected by BlastP searches and can be divided into two categories: two customisable methods where the user defines either the maximal number or the percentage of sequences to be selected; two automatic methods in which the number of sequences selected is determined by the program. We focused our analysis on the potential information content of the sampled sets of sequences using multiple alignment of complete sequences as the main validation tool. The study considered two criteria: the total number of sequences in BlastP and their associated E-values. The subsequent analyses investigated the influence of the sampling methods on the E-value distributions, the sequence coverage, the final multiple alignment quality and the active site characterisation at various residue conservation thresholds as a function of these criteria. CONCLUSION: The comparative analysis of the four sampling methods allows us to propose a suitable sampling strategy that significantly reduces the number of homologous sequences required for alignment, while at the same time maintaining the relevant information concerning the active site residues.


Asunto(s)
Algoritmos , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información/métodos , Proteínas/química , Proteínas/metabolismo , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Secuencia Conservada , Sistemas de Administración de Bases de Datos , Datos de Secuencia Molecular , Homología de Secuencia de Aminoácido , Relación Estructura-Actividad
8.
Nucleic Acids Res ; 31(13): 3829-32, 2003 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-12824430

RESUMEN

PipeAlign is a protein family analysis tool integrating a five step process ranging from the search for sequence homologues in protein and 3D structure databases to the definition of the hierarchical relationships within and between subfamilies. The complete, automatic pipeline takes a single sequence or a set of sequences as input and constructs a high-quality, validated MACS (multiple alignment of complete sequences) in which sequences are clustered into potential functional subgroups. For the more experienced user, the PipeAlign server also provides numerous options to run only a part of the analysis, with the possibility to modify the default parameters of each software module. For example, the user can choose to enter an existing multiple sequence alignment for refinement, validation and subsequent clustering of the sequences. The aim is to provide an interactive workbench for the validation, integration and presentation of a protein family, not only at the sequence level, but also at the structural and functional levels. PipeAlign is available at http://igbmc.u-strasbg.fr/PipeAlign/.


Asunto(s)
Proteínas/clasificación , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Internet , Proteínas/química , Control de Calidad , Alineación de Secuencia , Programas Informáticos/normas , Interfaz Usuario-Computador
9.
Biochimie ; 100: 18-26, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24120687

RESUMEN

Mammalian mitochondrial aminoacyl-tRNA synthetases are nuclear-encoded enzymes that are essential for mitochondrial protein synthesis. Due to an endosymbiotic origin of the mitochondria, many of them share structural domains with homologous bacterial enzymes of same specificity. This is also the case for human mitochondrial aspartyl-tRNA synthetase (AspRS) that shares the so-called bacterial insertion domain with bacterial homologs. The function of this domain in the mitochondrial proteins is unclear. Here, we show by bioinformatic analyses that the sequences coding for the bacterial insertion domain are less conserved in opisthokont and protist than in bacteria and viridiplantae. The divergence suggests a loss of evolutionary pressure on this domain for non-plant mitochondrial AspRSs. This discovery is further connected with the herein described occurrence of alternatively spliced transcripts of the mRNAs coding for some mammalian mitochondrial AspRSs. Interestingly, the spliced transcripts alternately lack one of the four exons that code for the bacterial insertion domain. Although we showed that the human alternative transcript is present in all tested tissues; co-exists with the full-length form, possesses 5'- and 3'-UTRs, a poly-A tail and is bound to polysomes, we were unable to detect the corresponding protein. The relaxed selective pressure combined with the occurrence of alternative splicing, involving a single structural sub-domain, favors the hypothesis of the loss of function of this domain for AspRSs of mitochondrial location. This evolutionary divergence is in line with other characteristics, established for the human mt-AspRS, that indicate a functional relaxation of non-viridiplantae mt-AspRSs when compared to bacterial and plant ones, despite their common ancestry.


Asunto(s)
Aspartato-ARNt Ligasa/química , Mitocondrias/genética , Proteínas Mitocondriales/química , Biosíntesis de Proteínas , ARN Mensajero/química , Empalme Alternativo , Alveolados/enzimología , Alveolados/genética , Secuencia de Aminoácidos , Amebozoos/enzimología , Amebozoos/genética , Animales , Archaea/enzimología , Archaea/genética , Aspartato-ARNt Ligasa/genética , Aspartato-ARNt Ligasa/metabolismo , Secuencia de Bases , Evolución Molecular , Hongos/enzimología , Hongos/genética , Expresión Génica , Humanos , Mitocondrias/enzimología , Proteínas Mitocondriales/genética , Proteínas Mitocondriales/metabolismo , Modelos Moleculares , Datos de Secuencia Molecular , Mutagénesis Insercional , Estructura Terciaria de Proteína , ARN Mensajero/genética , ARN Mensajero/metabolismo , Selección Genética , Alineación de Secuencia , Viridiplantae/enzimología , Viridiplantae/genética
10.
Database (Oxford) ; 2012: bas018, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22491796

RESUMEN

The elucidation of the complex relationships linking genotypic and phenotypic variations to protein structure is a major challenge in the post-genomic era. We present MSV3d (Database of human MisSense Variants mapped to 3D protein structure), a new database that contains detailed annotation of missense variants of all human proteins (20 199 proteins). The multi-level characterization includes details of the physico-chemical changes induced by amino acid modification, as well as information related to the conservation of the mutated residue and its position relative to functional features in the available or predicted 3D model. Major releases of the database are automatically generated and updated regularly in line with the dbSNP (database of Single Nucleotide Polymorphism) and SwissVar releases, by exploiting the extensive Décrypthon computational grid resources. The database (http://decrypthon.igbmc.fr/msv3d) is easily accessible through a simple web interface coupled to a powerful query engine and a standard web service. The content is completely or partially downloadable in XML or flat file formats. Database URL: http://decrypthon.igbmc.fr/msv3d.


Asunto(s)
Bases de Datos de Proteínas , Mutación Missense , Proteínas/química , Proteínas/genética , Sustitución de Aminoácidos , Sistemas de Administración de Bases de Datos , Humanos , Internet , Modelos Moleculares , Polimorfismo de Nucleótido Simple , Conformación Proteica
11.
Acta Crystallogr D Biol Crystallogr ; 59(Pt 12): 2094-103, 2003 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-14646067

RESUMEN

Structural refinement of proteins involves the minimization of a target function that combines X-ray data with a set of restraints enforcing stereochemistry and packing. Electrostatic interactions are not ordinarily included in the target function, partly because they cannot be calculated reliably without a description of dielectric screening by solvent in the crystal. With the recent development of accurate implicit solvent models to describe this screening, the question arises as to whether a more detailed target function including electrostatic and solvation terms can yield more accurate structures or somewhat different structures of equivalent accuracy. The Generalized Born (GB) model is one such model that describes the solvent as a dielectric continuum, taking into account its heterogeneous distribution within the crystal. It is used here for X-ray refinements of three protein structures with experimental diffraction data to 2.4, 2.9 and 3.2 A, respectively. In each case, a higher resolution structure is available for comparison. The new target function includes stereochemical restraints, van der Waals, Coulomb and solvation interactions, along with the usual X-ray pseudo-energy term, which employs the likelihood estimator of Pannu and Read. Multiple simulated-annealing refinements were performed in torsion-angle space with a conventional target function and the new GB target function, yielding ensembles of refined structures. The new target function yields structures of similar accuracy, as measured by the free R factor, map/model correlations and deviations from the high-resolution structures. About 10% of side-chain conformations differ between the two sets of refinements, in the sense that the two ensembles of conformations do not completely overlap. Over 75% of the differences correspond to surface side chains. For one of the proteins, the GB set has a greater dispersion, indicating that for this case the conventional target function overestimates the true precision. As GB parameterization continues to improve, we expect that this approach will become increasingly useful.


Asunto(s)
Cristalografía por Rayos X/métodos , Proteínas/química , Aspartato-ARNt Ligasa/química , Simulación por Computador , Antígenos de Histocompatibilidad Clase I/química , Transferasas de Hidroximetilo y Formilo/química , Modelos Químicos , Modelos Moleculares , Conformación Proteica , Solventes/química , Electricidad Estática
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA