This web page provides datasets associated
with
Mizianty M, Kurgan LA, 2009. Meta Prediction of Protein Crystallization Propensity. Biochemical and Biophysical Research Communications, 390(1):10-15
Three dataset are used:
- TEST144 was developed by Overton
and colleagues (2008)
(originally this set was called TEST) and
can be downloaded from http://compbio.dundee.ac.uk/xtal/
- The TRAINING and TEST500 dataset
can be downloaded from TRAINING set link and TEST500 set link
The datasets are in a tab-separated
text file format with three columns that correspond to sequence id
(from either TargetDBTargetDB or PepcDB), annotation of the class label
(crystallizable or noncrystallizable), and sequence.
|