Redox-sensitive protein predictor, based on Deep Neural Network
Prediction of redox active proteins is important for analysis of redox signaling pathways.
Model is created with cross-reference on OxICAT data from an article
Model is evaluated on data from an article
The predictor is implemented on F# with Microsoft Cognitive Tool (CNTK). As a base for implementation the code of Mathias Brandewinder was used, as well as general tutorial for creating different NN-architectures with CNTK.
-
Feature feeding
Extract features out of the protein sequence for each Cysteine residue and use them as input value for the NN.
Set level of oxidation of a Cysteine as a corresponding output.
Use an architecture of vanilla Neural Network with several hidden layers -
Sequence feeding
Break the whole protein sequence into peptides such that there were peptides from experiment (containing active Cysteine residue) and rest of the sequence and use them as sequential input value for the NN.
Set level of oxidation of a Cysteine as a corresponding output for Cys-containing peptide or 0 otherwise.
Use an architecture of Recurrent Neural Network with first set of layers for peptide2word embedding.