This repository contains Kubernetes configurations to deploy Clickhouse and Superset on Minikube. Clickhouse serves as an open-source data warehouse, while Superset is utilized for business intelligence purposes, enabling visualization and analysis of data stored in Clickhouse.
Before getting started, ensure you have the following installed:
- Minikube: Local Kubernetes cluster management tool.
- kubectl: Kubernetes command-line tool for interacting with clusters.
- Docker: Containerization platform knowledge for building and managing images.
- Description: StatefulSet deployment for Clickhouse with persistent storage of 10GB.
- Image: Uses
yandex/clickhouse-server:latest
. - Ports: Exposes port 9000 for native Clickhouse communication.
- Description: Deployment of Superset, a BI tool for creating reports and visualizations.
- Image: Uses
apache/superset:latest
. - Ports: Exposes port 8088 for accessing the Superset UI.
- Clone the Repository:
git clone https://github.com/saiguptha2003/datazipAssessment.git cd datazipAssessment
- Deploy ClickHouse and apache superset:
kubectl apply -f clickhouse.yaml kubectl apply -f superset.yaml
- Accessing Superset::
kubectl describe service superset | grep NodePort
In Superset UI, navigate to Data > Databases. Add a new database connection for Clickhouse using the provided connection string.
Ensure Docker is running and accessible to Minikube for pulling images.
Adjust YAML configurations for production deployments as necessary.
Name: V D Panduranga Sai Guptha
Email: [email protected]
GitHub: saiguptha2003
LinkedIn: saiguptha2003