RESUMO
MOTIVATION: While the release of AlphaFold (AF) represented a breakthrough for the prediction of protein complex structures, its sensitivity, especially when using full length protein sequences, still remains limited. Modeling success rates might increase if AF predictions were guided by likely interacting protein fragments. This approach requires available sets of highly confident protein-protein interface types. Computational resources, such as 3did, infer interacting globular domain types from observed contacts in protein structures. Assessing the accuracy of these predicted interface types is difficult because we lack hand-curated reference sets of verified domain-domain interface (DDI) types. RESULTS: To improve protein complex modeling of DDIs by AF, we manually inspected 80 randomly selected DDI types from the 3did resource to generate a first reference set of DDI types. Identified cases of DDI type nonapproval (40%) primarily resulted from inaccurate Pfam domain matches, crystal contacts, and synthetic protein constructs. Using logistic regression, we predicted a subset of 2411 out of 5724 considered DDI types in 3did to be of high confidence, which we subsequently applied to 53 000 human-protein interactions to predict DDIs followed by AF modeling. We obtained highly confident AF models for 604 out of 1129 predicted DDIs. Of note, for 47% of them no confident AF structural model could be obtained using full length protein sequences. AVAILABILITY AND IMPLEMENTATION: Code is available at https://github.com/KatjaLuckLab/DDI_manuscript.
Assuntos
Proteínas , Proteínas/química , Proteínas/metabolismo , Domínios Proteicos , Modelos Moleculares , Bases de Dados de Proteínas , Software , Biologia Computacional/métodos , Conformação ProteicaRESUMO
Structural resolution of protein interactions enables mechanistic and functional studies as well as interpretation of disease variants. However, structural data is still missing for most protein interactions because we lack computational and experimental tools at scale. This is particularly true for interactions mediated by short linear motifs occurring in disordered regions of proteins. We find that AlphaFold-Multimer predicts with high sensitivity but limited specificity structures of domain-motif interactions when using small protein fragments as input. Sensitivity decreased substantially when using long protein fragments or full length proteins. We delineated a protein fragmentation strategy particularly suited for the prediction of domain-motif interfaces and applied it to interactions between human proteins associated with neurodevelopmental disorders. This enabled the prediction of highly confident and likely disease-related novel interfaces, which we further experimentally corroborated for FBXO23-STX1B, STX1B-VAMP2, ESRRG-PSMC5, PEX3-PEX19, PEX3-PEX16, and SNRPB-GIGYF1 providing novel molecular insights for diverse biological processes. Our work highlights exciting perspectives, but also reveals clear limitations and the need for future developments to maximize the power of Alphafold-Multimer for interface predictions.
Assuntos
Proteínas de Transporte , Proteínas , Humanos , Proteínas/metabolismo , Proteínas de Membrana/metabolismoRESUMO
Gut microbiota are essential for host health and survival, but we are still far from understanding the processes involved in shaping their composition and evolution. Controlled experimental work under lab conditions as well as human studies pointed at environmental factors (i.e., diet) as the main determinant of the microbiota with little evidence of genetic effects, while comparative interspecific studies detected significant phylogenetic effects. Different species, however, also differ in diet, feeding behavior, and environmental characteristics of habitats, all of which also vary interspecifically, and, therefore, can potentially explain most of the detected phylogenetic patterns. Here, we take advantage of the reproductive strategy of avian brood parasites and investigate gut microbiotas (esophageal (food and saliva) and intestinal) of great spotted cuckoo (Clamator glandarius) and magpie (Pica pica) nestlings that grow in the same nests. We also estimated diet received by each nestling and explored its association with gut microbiota characteristics. Although esophageal microbiota of magpies and great spotted cuckoos raised within the same environment (nest) did not vary, the microbiota of cloacal samples showed clear interspecific differences. Moreover, diet of great spotted cuckoo and magpie nestlings explained the microbiota composition of esophageal samples, but not of cloaca samples. These results strongly suggest a genetic component determining the intestinal microbiota of host and parasitic bird species, indicating that interspecific differences in gut morphology and physiology are responsible for such interspecific differences.