Giter Club home page Giter Club logo

jeyn's Introduction

Jeyn

Welcome to jeyn, jeyn is an open source framework that aims at providing an infrastructure for production ML applications and give everything a DS team needs to be able to hit the ground running with as little dev-ops knowledge as possible.

As of writting jeyn is still in its infency and very much not ready to be used in any kind of production context.

core concepts

basic abstractions

Jeyn's python sdk is based around several main abastractions that you will interact with to build your machine learning pipelines

dataset abstractions

  • dataset formula a dataset formula is like a cooking reciepe for a given dataset, it tells other jeyn users how your dataset has been built for instance if it is a streaming pipeline that updates files regularly. Or another dataset passed through a model.
  • dataset batch: If we think of the dataset formula as a cooking reciepe a dataset batch is batch of cookies made with a specific formula. A model will always be trained on a precise batch and record this batch for recproducability purposes.
  • data catalog: it's the schema of any inputs, our outputs of a jeyn dataset or model.

model abstractions.

  • model checkpoint: a model checkpoint is a saved model. A set of weights that have been saved after a training
  • model serializers: a way of going from a python object to a checkpoint's bytes as well as the other way around.
  • machine learning use case: a machine learning use case is basically a way to tell that one or several model checkpoints are more or less "the same". Not in that they'll produce the same output (that's a checkpoint) but that they should be used the same.

artefacts, stores and relationships.

All the machine learning abstractions stated above are built withing jeyn to be converted to two lower level abstractions: artefacts and relationships.

  • an artefact is basically a thing that is saved.
  • a relationship links two artefacts together.

when building your pipeline, your operations always need to be convert their outputs to an artefact. To help you with this jeyn provides various stores (model_store, dataset_store that you can use) to save your models, checkpoints, batches, ect and they will return an artefact you can then output so that jeyn will know how to reload them in subsequent operations.

contributing to jeyn.

jeyn is based on 2 major open source technologies:

  • dapr: a framework that abstracts cloud implementation to be able to have the same code or architecture running in different setps with a change of configuration
  • Argo: to run your machine learning workflows.

getting started.

jeyn's People

Contributors

aredier avatar

Stargazers

 avatar

Watchers

Adrien B. avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.