Computational methods exploit the sequence signatures of disorder to predict whether a protein is disordered, given its amino acid sequence. The table below, which was originally adapted from[1] and has been recently updated, shows the main features of software for disorder prediction. Note that different software use different definitions of disorder.
Predict the protein intrinsic disorder regions, degree of disorder as well as folding patterns.
Based on five amino acids, the folding variations along sequence are presented by Protein Folding Shape Code (PFSC) in Protein Folding Variation Matrix (PFVM).
Assigns binary order/disorder class and corresponding confidence score for each protein residues using optimized SVM with Radial basis kernel from protein sequence
AA composition, Physical Properties, Helix, strand and coil probability, Accessible surface area, torsion angle fluctuation, monogram, bigram.
Disorder definitions include: missing x-ray atoms (short), Disprot style disorder (long), and NMR flexibility. A probability of disorder is supplied with two decision thresholds which depend on a user preferred false positive rate.
Bi-directional neural networks with diverse and high quality data derived from the Protein Data Bank and DisProt. Compares extremely well with other CASP 9 servers. The method was designed to be very fast.
Regions that lack a well-defined 3D structure under native conditions (REMARK-465)
Meta method, which uses other disorder predictors (like RONN, IUPred, POODLE, and many more). Based on them the consensus is calculated according method accuracy (optimized using ANN, filtering and other techniques). Currently the best available method (first 2 places in last CASP experiment (blind test))
Disorder definitions include: missing x-ray atoms (short) and DisProt style disorder (long). A probability of disorder is supplied with two decision thresholds which depend on the false positive rate. Linear motifs within a disorder segment are determined by simple pattern matching from ELM.
Support Vector Machine and Bi-directional neural networks with high quality and diverse data derived from the Protein Data Bank and Disprot. Structural information is also supplied in the form of homologous templates. Compares extremely well with other CASP 9 servers.
Different types of disorder including random coils, unstructured regions, molten globules, and REMARK-465-based regions.
An ensemble of 3 SVMs specialized for the prediction of short, long and generic disordered regions, which combines three complementary disorder predictors, sequence, sequence profiles, predicted secondary structure, solvent accessibility, backbone dihedral torsion angles, residue flexibility and B-factors. MFDp (unofficially) secured 3rd place in last CASP experiment)
LOOPS (regions devoid of regular secondary structure); HOT LOOPS (highly mobile loops); REMARK465 (regions lacking electron density in crystal structure)
Low-complexity segments that is, “simple sequences” or “compositionally biased regions”.
Locally optimized low-complexity segments are produced at defined levels of stringency and then refined according to the equations of Wootton and Federhen
The transition between structurally ordered and mobile or disordered amino acids intervals under native conditions.
OnD-CRF applies Conditional Random Fields, CRFs, which rely on features generated from the amino acid sequence and from secondary structure prediction.
Regions of different "types". MeDor provides a unified view of multiple disorder predictors.
Meta method, which uses other disorder predictors (like FoldIndex, DisEMBL REMARK465, IUPred, RONN ...) and provides additional features (like HCA plot, Secondary Structure prediction, Transmembrane domains ... ) that all together help the user in defining regions involved in disorder.
No
References
^Ferron F, Longhi S, Canard B, Karlin D (October 2006). "A practical overview of protein disorder prediction methods". Proteins. 65 (1): 1–14. doi:10.1002/prot.21075. PMID16856179. S2CID30231497.
^Hanson J, Paliwal K, Zhou Y (2018). "Accurate Single-Sequence Prediction of Protein Intrinsic Disorder by an Ensemble of Deep Recurrent and Convolutional Architectures". Journal of Chemical Information and Modeling. 58 (11): 2369–2376. doi:10.1021/acs.jcim.8b00636. hdl:10072/382201. PMID30395465. S2CID53235372.
^Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT (March 2004). "Prediction and functional analysis of native disorder in proteins from the three kingdoms of life". J. Mol. Biol. 337 (3): 635–45. CiteSeerX10.1.1.120.5605. doi:10.1016/j.jmb.2004.02.002. PMID15019783.
^Sormanni P, Camilloni C, Fariselli P, Vendruscolo M (February 2015). "The s2D Method: Simultaneous Sequence- Based Prediction of the Statistical Populations of Ordered and Disordered Regions in Proteins". J. Mol. Biol. 427 (4): 982–996. doi:10.1016/j.jmb.2014.12.007. PMID25534081.