Snowplow is a complete open source event data collection plaform. It support many different type tracker, like android, ios, javascript and etc.
change environment varible in gcloud-config-mustr.sh
to your gcp project and save new file to gcloud-config.sh
.
execute gcloud-init-project.sh
to set up relate gcp services.
$ ./gcloud-init-project.sh
start pipeline:
$ ./start_etl.sh
stop pipeline:
$ ./stop_etl.sh
- create the schema dictionary structure and schema file like below:
schemas/<prefix1>.<prefix2>.<prefix3>/<sub_path>/jsonschema/1-0-0
schemas
└── com.company.699
├── funnel_event
│ └── jsonschema
│ └── 1-0-0
└── user_context
└── jsonschema
└── 1-0-0
- validate the format of schema files are correct.
$ igluctl lint schemas/<prefix1>.<prefix2>.<prefix3>/<sub_path>/jsonschema/*
- upload schema file to GCS
$ gsutil cp schemas/<prefix1>.<prefix2>.<prefix3>/<sub_path>/jsonschema/1-0-0 gs://<YOUR_GCS_BUCKET>/schemas/<prefix1>.<prefix2>.<prefix3>/<sub_path>/jsonschema/1-0-0