View Code? Open in Web Editor
NEW
Repository containing the codes for the Datastorm 2.0
sc-datastorm-2's Introduction
This is the repo of Stat_Collective for the workings in Datastorm 2.0
- Load Dataset directly as csv into panda dataframe
- Train set, validation set and test set is loaded seperately.
- Duplicate dataset for undersampled (Skip this for first fit)
- Check for Missing Data
- Check for ordinal data masked us numerical data
- Plots and charts for Data
- Dummy variables
- Impute variables?
- Feature Selection
- Scaling
3.2 Random Forest Classification
3.6 Support Vector Machine
4. Model Evaluation (Hyper Parameter Tuning)
- F1 score
- confusion matrix
- F1 score
- Confusion matrix
sc-datastorm-2's People
Watchers