The webserver accepts up to 1000 proteins. Each protein requires three lines and multiple proteins should be placed in consecutive lines.

    The format of each input protein is as follows:
  • Line 1: >protein ID
  • Line 2: protein sequence (one-letter amino acid encoding)
  • Line 3: comma-separated disorder predictions

The protein sequence from Line 1 to Line 2 should be formatted in the FASTA format. The disorder predictions on Line 3 should be formatted as Comma-Separated Values (CSV).

Here is an example input file.


  • Training dataset - the sequences, disorder predictions and native disorder annotations for the training dataset
  • Test dataset - the sequences, disorder predictions, native disorder annotations, and predicted quality assessment scores for the test dataset
  • Human proteome - the sequences, disorder predictions, native disorder annotations, predicted quality assessment scores, and annotations of high quality correct predictions for the human proteome
  • README - descriptions of file format for the datasets above


