Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 201
Filtrar
1.
Proc Biol Sci ; 291(2027): 20240985, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-39081174

RESUMO

Land plants (embryophytes) came about in a momentous evolutionary singularity: plant terrestrialization. This event marks not only the conquest of land by plants but also the massive radiation of embryophytes into a diverse array of novel forms and functions. The unique suite of traits present in the earliest land plants is thought to have been ushered in by a burst in genomic novelty. Here, we asked the question of how these bursts were possible. For this, we explored: (i) the initial emergence and (ii) the reshuffling of domains to give rise to hallmark environmental response genes of land plants. We pinpoint that a quarter of the embryophytic genes for stress physiology are specific to the lineage, yet a significant portion of this novelty arises not de novo but from reshuffling and recombining of pre-existing domains. Our data suggest that novel combinations of old genomic substrate shaped the plant terrestrialization toolkit, including hallmark processes in signalling, biotic interactions and specialized metabolism.


Assuntos
Evolução Biológica , Embriófitas , Domínios Proteicos , Embriófitas/genética
2.
Genome Med ; 16(1): 88, 2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-38992748

RESUMO

BACKGROUND: One of the major hurdles in clinical genetics is interpreting the clinical consequences associated with germline missense variants in humans. Recent significant advances have leveraged natural variation observed in large-scale human populations to uncover genes or genomic regions that show a depletion of natural variation, indicative of selection pressure. We refer to this as "genetic constraint". Although existing genetic constraint metrics have been demonstrated to be successful in prioritising genes or genomic regions associated with diseases, their spatial resolution is limited in distinguishing pathogenic variants from benign variants within genes. METHODS: We aim to identify missense variants that are significantly depleted in the general human population. Given the size of currently available human populations with exome or genome sequencing data, it is not possible to directly detect depletion of individual missense variants, since the average expected number of observations of a variant at most positions is less than one. We instead focus on protein domains, grouping homologous variants with similar functional impacts to examine the depletion of natural variations within these comparable sets. To accomplish this, we develop the Homologous Missense Constraint (HMC) score. We utilise the Genome Aggregation Database (gnomAD) 125 K exome sequencing data and evaluate genetic constraint at quasi amino-acid resolution by combining signals across protein homologues. RESULTS: We identify one million possible missense variants under strong negative selection within protein domains. Though our approach annotates only protein domains, it nonetheless allows us to assess 22% of the exome confidently. It precisely distinguishes pathogenic variants from benign variants for both early-onset and adult-onset disorders. It outperforms existing constraint metrics and pathogenicity meta-predictors in prioritising de novo mutations from probands with developmental disorders (DD). It is also methodologically independent of these, adding power to predict variant pathogenicity when used in combination. We demonstrate utility for gene discovery by identifying seven genes newly significantly associated with DD that could act through an altered-function mechanism. CONCLUSIONS: Grouping variants of comparable functional impacts is effective in evaluating their genetic constraint. HMC is a novel and accurate predictor of missense consequence for improved variant interpretation.


Assuntos
Mutação de Sentido Incorreto , Humanos , Domínios Proteicos , Predisposição Genética para Doença
3.
J Theor Biol ; 592: 111878, 2024 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-38901778

RESUMO

Heaps' or Herdan-Heaps' law is a linguistic law describing the relationship between the vocabulary/dictionary size (type) and word counts (token) to be a power-law function. Its existence in genomes with certain definition of DNA words is unclear partly because the dictionary size in genome could be much smaller than that in a human language. We define a DNA word as a coding region in a genome that codes for a protein domain. Using human chromosomes and chromosome arms as individual samples, we establish the existence of Heaps' law in the human genome within limited range. Our definition of words in a genomic or proteomic context is different from other definitions such as over-represented k-mers which are much shorter in length. Although an approximate power-law distribution of protein domain sizes due to gene duplication and the related Zipf's law is well known, their translation to the Heaps' law in DNA words is not automatic. Several other animal genomes are shown herein also to exhibit range-limited Heaps' law with our definition of DNA words, though with various exponents. When tokens were randomly sampled and sample sizes reach to the maximum level, a deviation from the Heaps' law was observed, but a quadratic regression in log-log type-token plot fits the data perfectly. Investigation of type-token plot and its regression coefficients could provide an alternative narrative of reusage and redundancy of protein domains as well as creation of new protein domains from a linguistic perspective.


Assuntos
DNA , Genoma Humano , Humanos , DNA/genética , Animais , Linguística , Domínios Proteicos
4.
Mol Cell Proteomics ; 23(7): 100796, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38851451

RESUMO

Protein O-linked mannose (O-Man) glycosylation is an evolutionary conserved posttranslational modification that fulfills important biological roles during embryonic development. Three nonredundant enzyme families, POMT1/POMT2, TMTC1-4, and TMEM260, selectively coordinate the initiation of protein O-Man glycosylation on distinct classes of transmembrane proteins, including α-dystroglycan, cadherins, and plexin receptors. However, a systematic investigation of their substrate specificities is lacking, in part due to the ubiquitous expression of O-Man glycosyltransferases in cells, which precludes analysis of pathway-specific O-Man glycosylation on a proteome-wide scale. Here, we apply a targeted workflow for membrane glycoproteomics across five human cell lines to extensively map O-Man substrates and genetically deconstruct O-Man initiation by individual and combinatorial knockout of O-Man glycosyltransferase genes. We established a human cell library for the analysis of substrate specificities of individual O-Man initiation pathways by quantitative glycoproteomics. Our results identify 180 O-Man glycoproteins, demonstrate new protein targets for the POMT1/POMT2 pathway, and show that TMTC1-4 and TMEM260 pathways widely target distinct Ig-like protein domains of plasma membrane proteins involved in cell-cell and cell-extracellular matrix interactions. The identification of O-Man on Ig-like folds adds further knowledge on the emerging concept of domain-specific O-Man glycosylation which opens for functional studies of O-Man-glycosylated adhesion molecules and receptors.


Assuntos
Manose , Humanos , Glicosilação , Manose/metabolismo , Especificidade por Substrato , Glicoproteínas/metabolismo , Proteômica/métodos , Linhagem Celular , Glicosiltransferases/metabolismo , Glicosiltransferases/genética , Processamento de Proteína Pós-Traducional , Engenharia Celular/métodos
5.
Front Microbiol ; 15: 1230997, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38690360

RESUMO

A rapid increase in antimicrobial resistant bacterial infections around the world is causing a global health crisis. The Gram-negative bacterium Acinetobacter baumannii is categorized as a Priority 1 pathogen for research and development of new antimicrobials by the World Health Organization due to its numerous intrinsic antibiotic resistance mechanisms and ability to quickly acquire new resistance determinants. Specialized phage enzymes, called depolymerases, degrade the bacterial capsule polysaccharide layer and show therapeutic potential by sensitizing the bacterium to phages, select antibiotics, and serum killing. The functional domains responsible for the capsule degradation activity are often found in the tail fibers of select A. baumannii phages. To further explore the functional domains associated with depolymerase activity, tail-associated proteins of 71 sequenced and fully characterized phages were identified from published literature and analyzed for functional domains using InterProScan. Multisequence alignments and phylogenetic analyses were conducted on the domain groups and assessed in the context of noted halo formation or depolymerase characterization. Proteins derived from phages noted to have halo formation or a functional depolymerase, but no functional domain hits, were modeled with AlphaFold2 Multimer, and compared to other protein models using the DALI server. The domains associated with depolymerase function were pectin lyase-like (SSF51126), tailspike binding (cd20481), (Trans)glycosidases (SSF51445), and potentially SGNH hydrolases. These findings expand our knowledge on phage depolymerases, enabling researchers to better exploit these enzymes for therapeutic use in combating the antimicrobial resistance crisis.

6.
Protein Sci ; 33(6): e4988, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38757367

RESUMO

Identifying unknown functional properties of proteins is essential for understanding their roles in both health and disease states. The domain composition of a protein can reveal critical information in this context, as domains are structural and functional units that dictate how the protein should act at the molecular level. The expensive and time-consuming nature of wet-lab experimental approaches prompted researchers to develop computational strategies for predicting the functions of proteins. In this study, we proposed a new method called Domain2GO that infers associations between protein domains and function-defining gene ontology (GO) terms, thus redefining the problem as domain function prediction. Domain2GO uses documented protein-level GO annotations together with proteins' domain annotations. Co-annotation patterns of domains and GO terms in the same proteins are examined using statistical resampling to obtain reliable associations. As a use-case study, we evaluated the biological relevance of examples selected from the Domain2GO-generated domain-GO term mappings via literature review. Then, we applied Domain2GO to predict unknown protein functions by propagating domain-associated GO terms to proteins annotated with these domains. For function prediction performance evaluation and comparison against other methods, we employed Critical Assessment of Function Annotation 3 (CAFA3) challenge datasets. The results demonstrated the high potential of Domain2GO, particularly for predicting molecular function and biological process terms, along with advantages such as producing interpretable results and having an exceptionally low computational cost. The approach presented here can be extended to other ontologies and biological entities to investigate unknown relationships in complex and large-scale biological data. The source code, datasets, results, and user instructions for Domain2GO are available at https://github.com/HUBioDataLab/Domain2GO. Additionally, we offer a user-friendly online tool at https://huggingface.co/spaces/HUBioDataLab/Domain2GO, which simplifies the prediction of functions of previously unannotated proteins solely using amino acid sequences.


Assuntos
Anotação de Sequência Molecular , Domínios Proteicos , Proteínas , Proteínas/química , Proteínas/metabolismo , Proteínas/genética , Bases de Dados de Proteínas , Biologia Computacional/métodos , Ontologia Genética , Humanos , Software
7.
Int J Mol Sci ; 25(5)2024 Feb 29.
Artigo em Inglês | MEDLINE | ID: mdl-38474085

RESUMO

Mowat-Wilson syndrome (MWS) is a rare genetic neurodevelopmental congenital disorder associated with various defects of the zinc finger E-box binding homeobox 2 (ZEB2) gene. The ZEB2 gene is autosomal dominant and encodes six protein domains including the SMAD-binding protein, which functions as a transcriptional corepressor involved in the conversion of neuroepithelial cells in early brain development and as a mediator of trophoblast differentiation. This review summarizes reported ZEB2 gene variants, their types, and frequencies among the 10 exons of ZEB2. Additionally, we summarized their corresponding encoded protein defects including the most common variant, c.2083 C>T in exon 8, which directly impacts the homeodomain (HD) protein domain. This single defect was found in 11% of the 298 reported patients with MWS. This review demonstrates that exon 8 encodes at least three of the six protein domains and accounts for 66% (198/298) of the variants identified. More than 90% of the defects were due to nonsense or frameshift changes. We show examples of protein modeling changes that occurred as a result of ZEB2 gene defects. We also report a novel pathogenic variant in exon 8 in a 5-year-old female proband with MWS. This review further explores other genes predicted to be interacting with the ZEB2 gene and their predicted gene-gene molecular interactions with protein binding effects on embryonic multi-system development such as craniofacial, spine, brain, kidney, cardiovascular, and hematopoiesis.


Assuntos
Fácies , Doença de Hirschsprung , Deficiência Intelectual , Microcefalia , Proteínas Repressoras , Feminino , Humanos , Pré-Escolar , Proteínas Repressoras/genética , Homeobox 2 de Ligação a E-box com Dedos de Zinco/genética , Deficiência Intelectual/genética , Proteínas de Homeodomínio/genética , Fatores de Transcrição
8.
J Mol Biol ; : 168551, 2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38548261

RESUMO

CATH (https://www.cathdb.info) classifies domain structures from experimental protein structures in the PDB and predicted structures in the AlphaFold Database (AFDB). To cope with the scale of the predicted data a new NextFlow workflow (CATH-AlphaFlow), has been developed to classify high-quality domains into CATH superfamilies and identify novel fold groups and superfamilies. CATH-AlphaFlow uses a novel state-of-the-art structure-based domain boundary prediction method (ChainSaw) for identifying domains in multi-domain proteins. We applied CATH-AlphaFlow to process PDB structures not classified in CATH and AFDB structures from 21 model organisms, expanding CATH by over 100%. Domains not classified in existing CATH superfamilies or fold groups were used to seed novel folds, giving 253 new folds from PDB structures (September 2023 release) and 96 from AFDB structures of proteomes of 21 model organisms. Where possible, functional annotations were obtained using (i) predictions from publicly available methods (ii) annotations from structural relatives in AFDB/UniProt50. We also predicted functional sites and highly conserved residues. Some folds are associated with important functions such as photosynthetic acclimation (in flowering plants), iron permease activity (in fungi) and post-natal spermatogenesis (in mice). CATH-AlphaFlow will allow us to identify many more CATH relatives in the AFDB, further characterising the protein structure landscape.

9.
PeerJ ; 12: e16063, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38188152

RESUMO

Calcium (Ca2 +) homeostasis is essential in conducting various cellular processes including nerve transmission, muscular movement, and immune response. Changes in Ca2 + concentration in the cytoplasm are significant in bringing about various immune responses such as pathogen clearance and apoptosis. Various key players are involved in calcium homeostasis such as calcium binders, pumps, and channels. Sequence-based evolutionary information has recently been exploited to predict the biophysical behaviors of proteins, giving critical clues about their functionality. Ion channels are reportedly the first channels developed during evolution. Calcium homeostasis modulator protein 6 (CALHM6) is one such channel. Comprised of a single domain called Ca_hom_mod, CALHM6 is a stable protein interacting with various other proteins in calcium regulation. No previous attempt has been made to trace the exact evolutionary events in the domain of CALHM6, leaving plenty of room for exploring its evolution across a wide range of organisms. The current study aims to answer the questions by employing a computational-based strategy that used profile Hidden Markov Models (HMMs) to scan for the CALHM6 domain, integrated the data with a time-calibrated phylogenetic tree using BEAST and Mesquite, and visualized through iTOL. Around 4,000 domains were identified, and 14,000 domain gain, loss, and duplication events were observed at the end which also included various protein domains other than CALHM6. The data were analyzed concerning CALHM6 evolution as well as the domain gain, loss, and duplication of its interacting partners: Calpain, Vinculin, protein S100-A7, Thioredoxin, Peroxiredoxin, and Calmodulin-like protein 5. Duplication events of CALHM6 near higher eukaryotes showed its increasing complexity in structure and function. This in-silico phylogenetic approach applied to trace the evolution of CALHM6 was an effective approach to get a better understanding of the protein CALHM6.


Assuntos
Conservadores da Densidade Óssea , Filogenia , Domínios Proteicos , Cálcio da Dieta , Homeostase , Antagonistas de Hormônios
10.
Microbiol Spectr ; 12(1): e0346423, 2024 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-38038435

RESUMO

IMPORTANCE: We found that in contrast to the best-studied model organisms, such as Escherichia coli and Bacillus subtilis, most bacterial and archaeal species have a CheA protein with a different domain composition. We report variations in CheA architecture, such as domain duplication and acquisition as well as class-specific domain composition. Our results will be of interest to those working on signal transduction in bacteria and archaea and lay the foundation for experimental studies.


Assuntos
Archaea , Proteínas de Escherichia coli , Histidina Quinase/genética , Histidina Quinase/metabolismo , Archaea/genética , Archaea/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Quimiotaxia , Bactérias/metabolismo , Proteínas de Escherichia coli/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Fosforilação
11.
mBio ; : e0238823, 2023 Nov 29.
Artigo em Inglês | MEDLINE | ID: mdl-38018991

RESUMO

IMPORTANCE: Short linear motifs (SLiMs) are 3-10 amino acid long binding motifs in intrinsically disordered protein regions (IDRs) that serve as ubiquitous protein-protein interaction modules in eukaryotic cells. Through molecular mimicry, viruses hijack these sequence motifs to control host cellular processes. It is thought that the small size of SLiMs and the high mutation frequencies of viral IDRs allow rapid host adaptation. However, a salient characteristic of RNA viruses, due to high replication errors, is their obligate existence as mutant swarms. Taking advantage of the uniquely large genomic database of SARS-CoV-2, here, we analyze the role of sequence diversity in the presentation of SLiMs, focusing on the highly abundant, multi-functional nucleocapsid protein. We find that motif mimicry is a highly dynamic process that produces an abundance of motifs transiently present in subsets of mutant species. This diversity allows the virus to efficiently explore eukaryotic motifs and evolve the host-virus interface.

12.
Comput Struct Biotechnol J ; 21: 4743-4758, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37822561

RESUMO

Background: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and data-centric analysis of sequence variations. Method: In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. Results: We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.

13.
Microorganisms ; 11(8)2023 Jul 28.
Artigo em Inglês | MEDLINE | ID: mdl-37630493

RESUMO

The Candidate Phyla Radiation (CPR) was found to harbor a vast repertoire of genes encoding for enzymes with potential antibiotic resistance activity. Among these, as many as 3349 genes were predicted in silico to contain a metallo-beta-lactamase-like (MBL-like) fold. These proteins were subject to an in silico functional characterization by comparing their protein profiles (presence/absence of conserved protein domains) to other MBLs, including 24 already expressed in vitro, along with those of the beta-lactamase database (BLDB) (n = 761). The sequence similarity network (SSN) was then used to predict the functional clusters of CPR MBL-like sequences. Our findings showed that CPR MBL-like sequences were longer and more diverse than bacterial MBL sequences, with a high content of functional domains. Most CPR MBL-like sequences did not show any SSN connectivity with expressed MBLs, indicating the presence of many potential, yet unidentified, functions in CPR. In conclusion, CPR was shown to have many protein functions and a large sequence variability of MBL-like folds, exceeding all known MBLs. Further experimental and evolutionary studies of this superfamily of hydrolyzing enzymes are necessary to illustrate their functional annotation, origin, and expansion for adaptation or specialization within a given niche or compared to a specific substrate.

14.
PeerJ ; 11: e15715, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37492397

RESUMO

Background: Functionally related genes are well known to be often grouped in close vicinity in the genomes, particularly in prokaryotes. Notwithstanding the diverse evolutionary mechanisms leading to this phenomenon, it can be used to predict functions of uncharacterized genes. Methods: Here, we provide a simple but robust statistical approach that leverages the vast amounts of genomic data available today. Considering a protein domain as a functional unit, one can explore other functional units (domains) that significantly often occur within the genomic neighborhoods of the queried domain. This analysis can be performed across different taxonomic levels. Provisions can also be made to correct for the uneven sampling of the taxonomic space by genomic sequencing projects that often focus on large numbers of very closely related strains, e.g., pathogenic ones. To this end, an optional procedure for averaging occurrences within subtaxa is available. Results: Several examples show this approach can provide useful functional predictions for uncharacterized gene families, and how to combine this information with other approaches. The method is made available as a web server at http://bioinfo.sggw.edu.pl/neighborhood_analysis.


Assuntos
Genoma , Proteínas , Mapeamento Cromossômico/métodos , Proteínas/genética , Genômica/métodos , Sequência de Bases
15.
Vet Microbiol ; 284: 109820, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37364454

RESUMO

Duck Tembusu virus (DTMUV) causes severe reduction in egg production and neurological symptoms in ducklings. Vaccination is the primary measure used to prevent DTMUV infections. In this study, self-assembled nanoparticles with the E protein domain III of DTMUV, using ferritin as a carrier (EDⅢ-RFNp), were prepared using a prokaryotic expression system. Ducks were intramuscularly vaccinated with EDⅢ-RFNp, EDⅢ protein, an inactivated vaccine HB strain (InV-HB), and PBS. At 0, 4, and 6 weeks post-primary vaccination, the EDIII protein-specific antibody titre, IL-4, and IFN-γ concentrations in serum were determined by ELISA, and neutralising antibodies titres in sera were determined by virus neutralising assay. Peripheral blood lymphocytes proliferation was determined by CCK-8 kit. Following challenge with the virulent DTMUV strain, the clinical signals and survival rate of the vaccinated ducks were recorded, and DTMUV RNA levels in the blood and tissues of the surviving ducks were determined by real-time quantitative RT-PCR. The near-spherical EDⅢ-RFNp nanoparticles with 13.29 ± 1.43 nm diameter were observed by transmission electron microscope. At 4 and 6 weeks post-primary vaccination, special and Virus neutralisation (VN) antibodies, lymphocyte proliferation (stimulator index, SI), and concentrations of IL-4 and IFN-γ in the EDⅢ-RFNp group were significantly higher than in the EDⅢ and PBS groups. In the DTMUV virulent strain challenge test, the EDⅢ-RFNp-vaccinated ducks showed milder clinical signs and higher survival rates than EDⅢ- and PBS-vaccinated ducks. The DTMUV RNA levels in the blood and tissues of EDⅢ-RFNp-vaccinated ducks were significantly lower than those in EDⅢ- and PBS-vaccinated ducks. Additionally, the EDⅢ protein-special and VN antibodies, SI value, and concentration of IL-4 and IFN-γ in the InV-HB group was significantly higher than that of the PBS group at 4 and 6 weeks post-primary vaccination. InV-HB provided more efficient protection than PBS based on a higher survival rate, milder signals, and lower levels of the DTMUV virus in the blood and tissues. These results indicated that EDⅢ-RFNp effectively protected ducks against DTMUV challenge and could be a vaccine candidate to prevent DTMUV infection.


Assuntos
Infecções por Flavivirus , Flavivirus , Doenças das Aves Domésticas , Animais , Patos , Infecções por Flavivirus/veterinária , Ferritinas , Interleucina-4 , Domínios Proteicos , Anticorpos Antivirais , Flavivirus/genética , Imunidade
16.
Proteins ; 91(12): 1558-1570, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37254889

RESUMO

Processing of CASP15 targets into evaluation units (EUs) and assigning them to evolutionary-based prediction classes is presented in this study. The targets were first split into structural domains based on compactness and similarity to other proteins. Models were then evaluated against these domains and their combinations. The domains were joined into larger EUs if predictors' performance on the combined units was similar to that on individual domains. Alternatively, if most predictors performed better on the individual domains, then they were retained as EUs. As a result, 112 evaluation units were created from 77 tertiary structure prediction targets. The EUs were assigned to four prediction classes roughly corresponding to target difficulty categories in previous CASPs: TBM (template-based modeling, easy or hard), FM (free modeling), and the TBM/FM overlap category. More than a third of CASP15 EUs were attributed to the historically most challenging FM class, where homology or structural analogy to proteins of known fold cannot be detected.


Assuntos
Biologia Computacional , Dobramento de Proteína , Modelos Moleculares , Bases de Dados de Proteínas , Proteínas/química
17.
Biomol NMR Assign ; 17(1): 151-157, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37155029

RESUMO

SASH1 is a scaffold protein with context-dependent biological functions in cell adhesion, tumor metastasis, lung development, and pigmentation. As a member of the SLy protein family, it contains the conserved SLY, SH3, and SAM domains. The 19 kDa SLY domain harbors over 70% of the SASH1 variants associated with pigmentation disorders. However, its solution structure or dynamics have not been investigated yet, and its exact position in the sequence is not clearly defined. Based on the bioinformatic and experimental evidence, we propose renaming this region to the SLy Proteins Associated Disordered Region (SPIDER) and defining the exact position to be amino acids 400-554 of SASH1. We have previously identified a variant in this region linked to a pigmentation disorder, S519N. Here, we used a novel deuteration technique, a suite of TROSY-based 3D NMR experiments, and a high-quality HNN to obtain near complete solution backbone assignment of SASH1's SPIDER. A comparison with the chemical shifts of non-variant (S519) SPIDER shows that the S519N substitution does not alter the free form solution structural propensities of SPIDER. This assignment is the first step to characterize the role of SPIDER in SASH1-mediated cellular functions and provides a model for the future study of sister SPIDER domains in the SLy protein family.


Assuntos
Proteínas Supressoras de Tumor , Linhagem Celular Tumoral , Movimento Celular , Espectroscopia de Ressonância Magnética , Ressonância Magnética Nuclear Biomolecular , Proteínas Supressoras de Tumor/química , Proteínas Supressoras de Tumor/metabolismo
18.
bioRxiv ; 2023 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-37066283

RESUMO

Insertion and deletion mutations (indels) are important mechanisms of generating protein diversity. Indels in coding sequences are under considerable selective pressure to maintain reading frames and to preserve protein function, but once generated, indels provide raw material for the acquisition of new protein properties and functions. We reported recently that coding sequence insertions in the Candida albicans NDU1 protein, a mitochondrial protein involved in the assembly of the NADH:ubiquinone oxidoreductase are imperative for respiration, biofilm formation and pathogenesis. NDU1 inserts are specific to CTG-clade fungi, absent in human ortholog and successfully harnessed as drug targets. Here, we present the first comprehensive report investigating indels and clade-defining insertions (CDIs) in fungal proteomes. We investigated 80 ascomycete proteomes encompassing CTG clade species, the Saccharomycetaceae family, the Aspergillaceae family and the Herpotrichiellaceae (black yeasts) family. We identified over 30,000 insertions, 4,000 CDIs and 2,500 clade-defining deletions (CDDs). Insert sizes range from 1 to over 1,000 residues in length, while maximum deletion length is 19 residues. Inserts are strikingly over-represented in protein kinases, and excluded from structural domains and transmembrane segments. Inserts are predicted to be highly disordered. The amino acid compositions of the inserts are highly depleted in hydrophobic residues and enriched in polar residues. An indel in the Saccharomyces cerevisiae Sth1 protein, the catalytic subunit of the RSC (Remodel the Structure of Chromatin) complex is predicted to be disordered until it forms a ß-strand upon interaction. This interaction performs a vital role in RSC-mediated transcriptional regulation, thereby expanding protein function.

19.
In Silico Pharmacol ; 11(1): 11, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37113323

RESUMO

The emergence of resistant bacteria strains against traditional antibiotics and treatments increases each year. Doderlin is a cationic and amphiphilic peptide active against gram-positive, negative and yeast stains. The aim of the present work was prospect potentials receptors associated of antimicrobial activity of Doderlin using in silico bioinformatics tools. To search for potential targets of Doderlin, PharmMapper software was used. Molecular docking between Doderlin and the receptor was performed by PatchDock. Additional interaction and ligand site prediction for each receptor was performed by I-TASSER software. Those PDB Id, 1XDJ (score: 11,746), 1JMH (score: 11,046), 1YR3 (score: 10,578), 1NG3 (score: 10,082) showed highest dock score. Doderlin was found to predicted/real sites co-localize with 1XDJ and 1JMH, enzymes accountable for nitrogenic bases synthesis. The resulting receptor bioprospecting is highly correlated and suggests that Doderlin might act by interfering with DNA metabolism/production of bacteria, altering microorganism homeostasis and growth impairment. Supplementary Information: The online version contains supplementary material available at 10.1007/s40203-023-00149-1.

20.
Mol Microbiol ; 120(1): 45-53, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-36840376

RESUMO

Bacteria are continuously exposed to predation from bacteriophages (phages) and, in response, have evolved a broad range of defence systems. These systems can prevent the replication of phages and other mobile genetic elements (MGE). Defence systems are often encoded together in genomic loci defined as "defence islands", a tendency that has been extensively exploited to identify novel antiphage systems. In the last few years, >100 new antiphage systems have been discovered, and some display homology to components of the immune systems of plants and animals. In many instances, prediction tools have found domains with similar predicted functions present as different combinations within distinct antiphage systems. In this Perspective Article, we review recent reports describing the discovery and the predicted domain composition of several novel antiphage systems. We discuss several examples of similar protein domains adopted by different antiphage systems, including domains of unknown function (DUFs), domains involved in nucleic acid recognition and degradation, and domains involved in NAD+ depletion. We further discuss the potential evolutionary advantages that could have driven the independent acquisition of these domains by different antiphage systems.


Assuntos
Bacteriófagos , Animais , Bacteriófagos/genética , Bactérias/genética , Domínios Proteicos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA