An SVM approach for the prediction of protein crystallization property using sequence derived properties

 

 

Abstract: X-ray crystallography is the most widely used method for protein 3-dimensional structure determination. Selection of target protein that can yield high quality crystal for X-ray crystallography is a challenging task. Prediction of protein crystallization propensity from sequence information is useful for the selection of target protein for crystallization. In this work, we present a machine learning method [SVMCRYS] to classify ‘amenable to crystallization’ from ‘recalcitrant to crystallization’ using sequence derived features. An overall prediction accuracy of 77.44% was obtained from 10-fold cross-validation and 81.68% from resubstitution test. Our performance comparison study shows that SVMCRYS outperforms other methods SECRET, CRYSTALP, OB-score and ParCrys with accuracy of 89.53% on TEST-RL dataset and 86.80% on TEST dataset. SVMCRYS proves to be useful in the improvement of experimental success rate by selecting correct target protein for crystallization.

 

Click here to download the Executables