This project purpose is to practice on:
- Data Pipeline
- Data Quality Control
- Modelling
This project is managed by Poetry package management.
To install dependencies, run:
poetry install --no-root
This project leverage kedro pipeline to manage multiple data pipeline including:
- Data Engineering
- Feature Engineering
- Modelling (Training)
- Validation
To run the project, run:
kedro run
Or run a part of the project by:
kedro run --pipeline=data-engineering
Test are in src/tests/test_run.py
. Run tests by:
kedro test
To configure the coverage threshold, go to the .coveragerc
file.