Collective Knowledge Framework (CK)

Linux/MacOS: Windows: Coverage:

Project websites

Introduction

Collective Knowledge is a small, cross-platform and community driven Python framework to abstract, automate, share, reproduce and reuse any R&D task in the form of actions, workflows and Python modules. Such actions and workflows have a unified API, CLI, JSON meta-description, web interface and UID to simplify the integration with existing production systems.

Our long-term goal is to enable collaborative, reproducible and production-ready research based on DevOps principles. We want researchers and engineers to focus on innovation while CK takes care of the rest (see CK real use cases, publications and the FOSDEM'19 presentation for more details):

CK actions can be shared and reused across research projects: see the list of available actions and modules.
Standardized CK APIs and meta-descriptions help users to easily connect actions into automated, portable and customizable workflows, and quickly integrate them with practically all major tools, frameworks and Continuous Integration Services: see the list of shared repositories with CK workflows and actions.
CK helps to perform reproducible experiments and generate papers with reusable research components: see the list of articles with CK workflows and the CK-based interactive report with the Raspberry Pi foundation.
CK can be used to crowdsource autotuning and co-design of efficient software, hardware and models for emerging AI, ML and quantum computing workloads in terms of speed, accuracy, energy and costs: see the live CK dashboard with results from different Hackathons, tournaments and crowd-tuning campaigns.

Thanks to your feedback we are brainstorming a new and backward-compatible version of CK - please stay tuned!

Our community

Don't hesitate to ask questions using our public mailing list and the Slack channel.

You are also welcome to help the community fix or improve third-party components (actions, modules, packages, software plugins, workflows) when they fail on new platforms or miss some functionality - you can provide your feedback and report bugs in the respective GitHub repositories!

Documentation

CK wiki - a major revision is planned.

Installation

You can install CK from PyPi via pip install ck or pip install ck --user.

You can then test that it works from the command line via ck version or from your python environment as follows:

$ python

> import ck.kernel as ck
> ck.version({})

CK requires just a few tools in your PATH:

Python 2.7 or 3.3+ with PIP (limitation is mainly due to unitests). CK automatically adapts to Python 2 or 3 and provides extra API to let users write workflows for any Python version;
Git command line client;
wget (Linux/MacOS).

Example of installing dependencies and CK across different platforms:

	Ubuntu	MacOS	Windows
Third-party	`sudo apt-get install python3 python3-pip git wget`	`brew install python3 git wget`	1) Download and install Git from https://git-for-windows.github.io 2) Download and install any Python from https://www.python.org/downloads/windows
PyPi	`sudo pip install ck` or `pip install ck --user` Check that ck is in your PATH: `ck version`	`pip install ck`	`pip install ck` You can also download a CK installer which already includes Git 2.20.1 and Python 3.7.2 Just unzip it and run `install-pip.bat` to install CK via PIP This script will install Python in your dedicated directory and will ask you to add several environment variables to your system (just copy/paste them) - that's all!
GitHub	`git clone http://github.com/ctuning/ck` `export PATH=$PWD/ck/bin:$PATH` `export PYTHONPATH=$PWD/ck:$PYTHONPATH` You can also set CK environment variables and test dependencies using provided script as follows: `git clone http://github.com/ctuning/ck` `. ./set-env.sh`	Similar to Ubuntu	`git clone https://github.com/ctuning/ck.git ck-master` `set PATH={CURRENT PATH}\ck-master\bin;%PATH%` `set PYTHONPATH={CURRENT PATH}\ck-master;%PYTHONPATH%` You can also download a CK installer which already includes Git 2.20.1 and Python 3.7.2 Just unzip it and run `install-github.bat` to install CK from GitHub This script will install Python in your dedicated directory and will ask you to add several environment variables to your system (just copy/paste them)

CK allows very flexible customization to adapt to your platform and different requirements as described here. For example, you can change directories where to store CK repositories or install CK packages, change search paths during software detection (useful for HPC setups) and so on.

If you experience problems with installation and customization, please tell us or open a GitHub issue.

Usage

When you see an archive or a repository with a badge , it means you can reuse its functionality (code, data, models, packages, workflows) via unified CK interfaces, and integrate it with your own projects.

For example, you can pull ck-tensorflow and use different automation tasks such detecting or rebuilding different TensorFlow versions across diverse platforms, running AI/ML workflows and many more:

$ ck pull repo:ck-tensorflow

$ ck detect platform

$ ck install package:lib-tensorflow-1.12.0-cpu

$ ck run program:tensorflow-classification

$ ck show env

Note that tensorflow 1.12.0 can work only with Python version<=3.6 - please select an appropriate Python version during automatic environment detection

You can find a non-exhaustive index of CK-compatible repositories at ReproIndex.com - just follow their READMEs to find out more about shared components and workflows!

Please check our Getting Started Guide to try different CK examples.

Examples of popular and reusable CK-based automation tasks

Pull one of CK repositories with shared benchmarks, data sets, software detection plugins, packages, etc:

$ ck pull repo:ck-crowdtuning

See the list of installed CK repos:

$ ck ls repo | sort

Find where CK repository with benchmarks is installed on your machine and browse it to get familiar with the structure (consistent across all repos):

$ ck where repo:ctuning-programs

Detect your platform properties via extensible CK plugins as follows (needed to unify benchmarking across diverse platforms with Linux, Windows, MacOS and Android):

$ ck detect platform

Check JSON output

$ ck detect platform --out=json

Now detect available compilers on your machine and register virtual environments in the CK:

$ ck detect soft --tags=compiler,gcc
$ ck detect soft --tags=compiler,llvm
$ ck detect soft --tags=compiler,icc

See virtual environments in the CK:

$ ck show env

Find and explore CK env entries:

$ ck search env --tags=compiler

We recommend to setup CK to install new packages inside CK virtual env entries:

$ ck set kernel var.install_to_env=yes

Try to install LLVM binary via CK packages:

$ ck install package --tags=llvm

Check available data sets:

$ ck search dataset
$ ck search dataset --tags=jpeg

Now you can compile and run shared benchmarks with some data sets, benchmark and crowd-tune some C program.

$ ck ls program

Let's check the CK JSON meta for benchmark "cbench-automotive-susan":

$ ck load program:cbench-automotive-susan --min

Now let's compile and run it:

$ ck compile program:cbench-automotive-susan --speed
$ ck run program:cbench-automotive-susan

You can now benchmark this program (CK will execute several times while monitoring the state of the system):

$ ck benchmark program:cbench-automotive-susan

Finally, you can autotune this program using shared CK autotuning scenarios, record results and reply them:

$ ck autotune program:cbench-automotive-susan

You can also crowdtune this program, i.e. autotune it while sharig best results in the public repository:

$ ck crowdtune program:cbench-automotive-susan

You can now add (and later customize) your own program workflow using shared templates as follows:

$ ck add program:my-new-program

When CK asks you to select a template, please choose "C program "Hello world". You can then immediately compile and run your C program as follows:

$ ck compile program:my-new-program --speed
$ ck run program:my-new-program
$ ck run program:my-new-program --env.CK_VAR1=222

Example of a unified image classification via TensorFlow and Caffe

Get shared ck-tensorflow repository with all dependencies:

$ ck pull repo:ck-tensorflow

Now install CPU-version of TensorFlow via CK packages:

$ ck install package --tags=lib,tensorflow,vcpu,vprebuilt,v1.11.0

Check that it's installed fine:

$ ck show env --tags=lib,tensorflow

You can find a path to a given entry (with TF installation) as follows:

$ ck find env:{env UID from above list}

Run CK virtual environment and test TF:

$ ck virtual env --tags=lib,tensorflow
$ ipython
> import tensorflow as tf

Run CK classification workflow example using installed TF:

$ ck run program:tensorflow --cmd_key=classify

Now you can try a more complex example to build Caffe with CUDA support and run classification. Note that CK should automatically detect your CUDA compilers, libraries and other deps or install missing packages:

$ ck pull repo --url=https://github.com/dividiti/ck-caffe
$ ck install package:lib-caffe-bvlc-master-cuda-universal
$ ck run program:caffe --cmd_key=classify

You can see how to install Caffe for Linux, MacOS, Windows and Android via CK here.

Adding your own repository and API:

You can make your existing Git repository compatible with CK as follows:

$ ck pull repo --url={URL of your Git repository}

CK will add .ckr.json file to the root of your repository which you should commit back to your repository - that's all!

You can then add CK components to your repository using it's public Git name, for example my-repo:

For example, you can now add a CK module to prepare APIs:

$ ck add my-repo:module:hello

It will create an entry "module:hello" in the my-repo with a dummy module.py:

$ ls `ck find module:hello`

Now you can add "say" API to the CK python module "hello":

$ ck add_action module:hello --func=say

CK will add a dummy function "say" in the module.py in "module:hello" which you can immediately use (!):

$ ck say hello
$ ck say hello --out=json

Furthermore, you can now create a data entry for your module "hello":

$ ck add hello:world --tags=cool,api

$ ck search hello --tags=api

$ ck say hello:world

$ ck ren hello:world hello:team

$ ck say hello:team

Alternatively, you can create a local (non-shared) repository to gradually organize your code and data in the CK format as follows:

$ ck add repo:my-repo --quiet

$ ck where repo:my-repo

$ ck ls repo:my-*

Such simple approach allowed our partners to gradually abstract complex AI, ML, and quantum experiments via shared CK APIs, crowdsource experiments, and even automatically generate reproducible and interactive articles with reusable research components!

Calling CK functions from other languages

We developed a small OpenME library to connect CK with different languages including C, C++, Fortran and Java.

Trying CK from a Docker image

You can try CK using the following Docker image:

 $ (sudo) docker run -it ctuning/ck-ubuntu-18.04

Note that we added Docker automation to CK to help evaluate artifacts at the conferences, share interactive and reproducible articles, crowdsource experiments and so on.

For example, you can participate in GCC or LLVM crowd-tuning on your machine as follows:

 $ (sudo) docker run ck-crowdtune-gcc
 $ (sudo) docker run ck-crowdtune-llvm

Top optimization results are continuously aggregated in the live CK repository: http://cKnowledge.org/repo .

Contributing

Please follow this guide to add your workflows and components. Note that major revision to simplify this guide based on your feedback is planned in 2019!

Feel free to provide your suggestions using our public mailing list and the Slack channel!

velazquezgilberto / ck Goto Github PK

ck's Introduction

Collective Knowledge Framework (CK)

Project websites

Introduction

Our community

Documentation

Installation

Usage

Examples of popular and reusable CK-based automation tasks

Example of a unified image classification via TensorFlow and Caffe

Adding your own repository and API:

Calling CK functions from other languages

Trying CK from a Docker image

Contributing

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent