RESUMEN
The complex cellular milieu can spontaneously demix, or phase separate, in a process controlled in part by intrinsically disordered (ID) proteins. A protein's propensity to phase separate is thought to be driven by a preference for protein-protein over protein-solvent interactions. The hydrodynamic size of monomeric proteins, as quantified by the polymer scaling exponent (v), is driven by a similar balance. We hypothesized that mean v, as predicted by protein sequence, would be smaller for proteins with a strong propensity to phase separate. To test this hypothesis, we analyzed protein databases containing subsets of proteins that are folded, disordered, or disordered and known to spontaneously phase separate. We find that the phase-separating disordered proteins, on average, had lower calculated values of v compared with their non-phase-separating counterparts. Moreover, these proteins had a higher sequence-predicted propensity for ß-turns. Using a simple, surface area-based model, we propose a physical mechanism for this difference: transient ß-turn structures reduce the desolvation penalty of forming a protein-rich phase and increase exposure of atoms involved in π/sp2 valence electron interactions. By this mechanism, ß-turns could act as energetically favored nucleation points, which may explain the increased propensity for turns in ID regions (IDRs) utilized biologically for phase separation. Phase-separating IDRs, non-phase-separating IDRs, and folded regions could be distinguished by combining v and ß-turn propensity. Finally, we propose a new algorithm, ParSe (partition sequence), for predicting phase-separating protein regions, and which is able to accurately identify folded, disordered, and phase-separating protein regions based on the primary sequence.