Search | VHL Regional Portal

AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily.

Gaschignard, Geoffroy; Millet, Maxime; Bruley, Apolline; Benzerara, Karim; Dezi, Manuela; Skouri-Panet, Feriel; Duprat, Elodie; Callebaut, Isabelle.

Proteins ; 92(6): 776-794, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38258321

ABSTRACT

Three-dimensional (3D) structure information, now available at the proteome scale, may facilitate the detection of remote evolutionary relationships in protein superfamilies. Here, we illustrate this with the identification of a novel family of protein domains related to the ferredoxin-like superfold, by combining (i) transitive sequence similarity searches, (ii) clustering approaches, and (iii) the use of AlphaFold2 3D structure models. Domains of this family were initially identified in relation with the intracellular biomineralization of calcium carbonates by Cyanobacteria. They are part of the large heavy-metal-associated (HMA) superfamily, departing from the latter by specific sequence and structural features. In particular, most of them share conserved basic amino acids (hence their name CoBaHMA for Conserved Basic residues HMA), forming a positively charged surface, which is likely to interact with anionic partners. CoBaHMA domains are found in diverse modular organizations in bacteria, existing in the form of monodomain proteins or as part of larger proteins, some of which are membrane proteins involved in transport or lipid metabolism. This suggests that the CoBaHMA domains may exert a regulatory function, involving interactions with anionic lipids. This hypothesis might have a particular resonance in the context of the compartmentalization observed for cyanobacterial intracellular calcium carbonates.

Subject(s)

Amino Acid Sequence , Bacterial Proteins , Metals, Heavy , Models, Molecular , Bacterial Proteins/chemistry , Bacterial Proteins/metabolism , Bacterial Proteins/genetics , Metals, Heavy/chemistry , Metals, Heavy/metabolism , Protein Domains , Cyanobacteria/metabolism , Cyanobacteria/chemistry , Cyanobacteria/genetics , Ferredoxins/chemistry , Ferredoxins/metabolism , Protein Folding

A sequence-based foldability score combined with AlphaFold2 predictions to disentangle the protein order/disorder continuum.

Bruley, Apolline; Bitard-Feildel, Tristan; Callebaut, Isabelle; Duprat, Elodie.

Proteins ; 91(4): 466-484, 2023 04.

Article in English | MEDLINE | ID: mdl-36306150

ABSTRACT

Order and disorder govern protein functions, but there is a great diversity in disorder, from regions that are-and stay-fully disordered to conditional order. This diversity is still difficult to decipher even though it is encoded in the amino acid sequences. Here, we developed an analytic Python package, named pyHCA, to estimate the foldability of a protein segment from the only information of its amino acid sequence and based on a measure of its density in regular secondary structures associated with hydrophobic clusters, as defined by the hydrophobic cluster analysis (HCA) approach. The tool was designed by optimizing the separation between foldable segments from databases of disorder (DisProt) and order (SCOPe [soluble domains] and OPM [transmembrane domains]). It allows to specify the ratio between order, embodied by regular secondary structures (either participating in the hydrophobic core of well-folded 3D structures or conditionally formed in intrinsically disordered regions) and disorder. We illustrated the relevance of pyHCA with several examples and applied it to the sequences of the proteomes of 21 species ranging from prokaryotes and archaea to unicellular and multicellular eukaryotes, for which structure models are provided in the AlphaFold protein structure database. Cases of low-confidence scores related to disorder were distinguished from those of sequences that we identified as foldable but are still excluded from accurate modeling by AlphaFold2 due to a lack of sequence homologs or to compositional biases. Overall, our approach is complementary to AlphaFold2, providing guides to map structural innovations through evolutionary processes, at proteome and gene scales.

Subject(s)

Proteome , Amino Acid Sequence , Proteome/metabolism , Protein Structure, Secondary , Hydrophobic and Hydrophilic Interactions , Protein Domains

Digging into the 3D Structure Predictions of AlphaFold2 with Low Confidence: Disorder and Beyond.

Bruley, Apolline; Mornon, Jean-Paul; Duprat, Elodie; Callebaut, Isabelle.

Biomolecules ; 12(10)2022 10 13.

Article in English | MEDLINE | ID: mdl-36291675

ABSTRACT

AlphaFold2 (AF2) has created a breakthrough in biology by providing three-dimensional structure models for whole-proteome sequences, with unprecedented levels of accuracy. In addition, the AF2 pLDDT score, related to the model confidence, has been shown to provide a good measure of residue-wise disorder. Here, we combined AF2 predictions with pyHCA, a tool we previously developed to identify foldable segments and estimate their order/disorder ratio, from a single protein sequence. We focused our analysis on the AF2 predictions available for 21 reference proteomes (AFDB v1), in particular on their long foldable segments (>30 amino acids) that exhibit characteristics of soluble domains, as estimated by pyHCA. Among these segments, we provided a global analysis of those with very low pLDDT values along their entire length and compared their characteristics to those of segments with very high pLDDT values. We highlighted cases containing conditional order, as well as cases that could form well-folded structures but escape the AF2 prediction due to a shallow multiple sequence alignment and/or undocumented structure or fold. AF2 and pyHCA can therefore be advantageously combined to unravel cryptic structural features in whole proteomes and to refine predictions for different flavors of disorder.

Subject(s)

Furylfuramide , Proteome , Proteome/chemistry , Amino Acid Sequence , Sequence Alignment , Amino Acids/chemistry , Protein Conformation

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL