operate-first / espresso-series Goto Github PK
View Code? Open in Web Editor NEWThis is where we plan our Expresso Series Production
License: GNU General Public License v3.0
This is where we plan our Expresso Series Production
License: GNU General Public License v3.0
Title
Tools for Data Science Collaboration
User Story
As a Data Scientist, I want to know how to use git and jupyter lab to collaborate with team members and contribute to upstream repositories.
...
Teaser
In the next video, Francesco will show you how, with the power of Thoth, we can propagate these changes into a shareable, reproducible and interactive content image.
Release Date
TBD
Script URL:
Script URL
No dependency management found for this repository. If you want to keep your dependencies managed, please submit Pipfile
or requirements.in
or requirements-dev.in
file.
To generate a Pipfile
, use:
$ pipenv install --skip-lock --code ./
$ git add Pipfile
$ git commit -m 'Add Pipfile for dependency management'
Make sure your Pipfile
or requirements.in
or requirements-dev.in
is placed in the root of your Git repository.
Title
Tag release Content Image for the the experiment
Description
Tag release Content Image for the the experiment, motivating the use of content images for reusable
experiments. teaser: next video you will see what happends when a tag release is created.
Release Date
User Story
As maintainer of an experiment,
I want to make sure new useful contributions are part of the experiment so everyone in the community can benefit of these changes directly on Operate First environment.
Title: Cloud-native Data Science development
Description: Why cloud-native and not a local laptop. Benefits of state of the art development practices applied to the data scientist domain.
Script URL: https://docs.google.com/document/d/1sq9LHu_HPDo3pr5oZ-xtBb0q8YcK1MDtD-dteAaX2NE/edit#heading=h.8a9yug4suz4n
No dependency management found for this repository. If you want to keep your dependencies managed, please submit Pipfile
or requirements.in
or requirements-dev.in
file.
To generate a Pipfile
, use:
$ pipenv install --skip-lock --code ./
$ git add Pipfile
$ git commit -m 'Add Pipfile for dependency management'
Make sure your Pipfile
or requirements.in
or requirements-dev.in
is placed in the root of your Git repository.
Title: Git for Operate 1st
Description: Data Scientist Friendly Git Workflows
Release Date: April 25, 2021
Script URL: https://docs.google.com/document/d/1sEwSceqTfyJj_dSphSw7AOMK60EiUt1jjdPI1e47jzs/edit?usp=sharing
User Story
Git workflow for data scientist. Starts with forking/cloning a repo, ends with pushing to fork.
No dependency management found for this repository. If you want to keep your dependencies managed, please submit Pipfile
or requirements.in
or requirements-dev.in
file.
To generate a Pipfile
, use:
$ pipenv install --skip-lock --code ./
$ git add Pipfile
$ git commit -m 'Add Pipfile for dependency management'
Make sure your Pipfile
or requirements.in
or requirements-dev.in
is placed in the root of your Git repository.
do it
Here is the rough outline of the content I am planning to share in Episode 4.
Please review the document for content, information and overall flow. Once this issue is signed off, I will start on the final script.
Title
Monitoring Jupyterhub notebooks
User Story
As a data scientist, I want to know how I can monitor my jupyter hub notebooks running on Operate First cluster. If I run a machine learning workload in a local environment, I use tools like htop to monitor memory, cpu, and disk usage. If I work on the cluster, I want to know how I can monitor it so that I can debug my notebook.
Teaser
So we have all the tools to develop a data science notebook, but what happens when different personas work together on a single project?
Release Date
TBD
Title
Add title here
User Story
As PERSONA,
...
Teaser
In the next episode...
Release Date
Add target date for the release
Script URL:
Add link to the script to be reviewed
Title
four Personae to work on one app
User Story
As a Data Scientist, I depend on work done by the Data Engineer, and I depend on work done by the MLDevOps Eng and I depend on work done by the AI DevSecOps Bots, so that I get my model into production
What are the Personae we talk about?
How do they interact with each other, what is their responsibility?
Can we describe interfaces between the three?
...
Teaser
In the next episode...
Release Date
Add target date for the release
Script URL:
Add link to the script to be reviewed
References
https://docs.google.com/document/d/1h3DRZvfaethUMls0svvovRvbRkO2QfoKtvh7ul1-81A/edit
https://docs.google.com/presentation/d/1XvdawGMIz68GXF7wKCBfIVZ2SBkdqy3bkIPNkkYIfMg/edit#slide=id.gdb4b126888_0_4
No dependency management found for the ubi8 environment. If you want
to keep your dependencies managed, please submit Pipfile
or requirements.in
or requirements-dev.in
file.
To generate a Pipfile
, use:
$ pipenv install --skip-lock --code ./
$ git add Pipfile
$ git commit -m 'Add Pipfile for dependency management'
Make sure your Pipfile
or requirements.in
or requirements-dev.in
is placed in the root of your Git repository.
/kind feature
/priority important-soon
Title
Please watch S01E05: Tools For Data Science Collaboration to review before publishing.
Release Date
Thurs, May 27
Script URL:
See #3 for script
Watch video here
CC: @durandom @goern @pacospace @MichaelClifford @oindrillac @Shreyanand
Organization for finishing out season 1!
Submit a tweet PR to https://github.com/operate-first/operate-first-twitter/tree/main/tweets to promote the following videos:
Title
Motivation: Why do we need content images? And what is it?
User Story
As Data Scientist,
I want to create a content image,
so that I can share the current state of an experiment with my colleagues
...
Teaser
In the next episode...
Release Date
TBD
Script URL:
TBD
/triage needs-information
High-level Goals
As a watcher of the Espresso Series, I want the videos to be recognizable via some sort of title slide/branding.
Describe the solution you'd like
Using the Operate First logo, I would like some sort of intro (animated or not) to distinguish this series from other videos on the YouTube channel.
Title
Creating Elyra Pipeline on ODH
User Story
As a Data Scientist, I want to create an AI Pipeline, so that data gets ingested, a model trained and the model persisted into the Git repo.
Teaser
what if I modify a step of the pipeline and add a new dependency?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.