This is a project for data analysis based on Alibaba trace of 2018 (https://github.com/alibaba/clusterdata/tree/master/cluster-trace-v2018).
The architecutre of this project is shown below:
This includes some test and initial work.
This includes the process of converting csv to parquet format.
This includes the logic of drawing graphs using Matplotlib.
This calculate some staging results for the processing.
This includes the graphs drawn by the project.