Giter Club home page Giter Club logo

data-science-examples's Introduction

Data Science with Exasol

This repository contains a collection of examples and tutorials for Data Science and Machine Learning with Exasol. In those examples and tutorials you learn how to explore and prepare your data and build, train and deploy your model with and within Exasol.

Currently, this repository is under development and we will add more and more examples and tutorials in the future.

What's inside:

  • Tutorials: Tutorials show a complete workflow on a realistic use case and data.
  • Examples: Examples only show how to integrate a specific technology, but not a whole data science workflow with it.

Prerequisites:

In general, you need:

  • Exasol, in particular with user-defined functions (UDFs). In most cases Version 6.0 and above with Script Language Container support is required. We provide a Community Edition or Docker images.
  • Many examples or tutorials are provided as Jupyter Notebooks. We recommend to install a Jupyter server with access to the Database and the BucketFS (Documentation can be found in the Exasol User Manual in Section 3.6.4).
  • Furthermore, many examples heavily use pyexasol to communicate with the Database. We recommend to install it on your Jupyter server.

Specific prerequisites are stated in each tutorial.

data-science-examples's People

Contributors

cyroxx avatar hanizaidi110 avatar jakobbraun avatar kaklakariada avatar marlenekress79789 avatar redcatbear avatar tkilias avatar tomuben avatar umitbuyuksahin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data-science-examples's Issues

Update to Python3.8 minimal flavor in script-languages tutorial

Background

  1. Python3.6 is deprecated and we should use a new version
  2. There log location has been changed and needs to updated in the tutorial

Acceptance Criteria

  1. Switch from Python 3.6 to Python3.8 in the tutorial (script-languages.ipynb)
  2. Adjust the log location: exaslct.log => main.log

Use connection for BucketFS information in classification examples

Currently, the BucketFS connection information get embedded into the UDF code (classification.ipynb). A much cleaner method is, to provide this information via a connection. Either, by providing the BucketFS URL directly, or by providing a path to a configuration file in the BucketFS.

  • In order to upload an object into BucketFS, the notebook creates and uses an UDF called EXABUCKET_HELPER.
  • Then this UDF is called when an object is going to be uploaded. Instead of creating an UDF to upload an object into BucketFS, we can use bucketfs connection object and bucketfs_python libraries.

Add Example for invoking AzureML model from ExasolDB using UDF

Add a Jupyter Notebook example showcasing invoke a trained AzureML model from Exasol Database via UDF.
This should include all functionality and key-words.

Builds on #40

Might be similar to UseSagemakerModelFromExasol.ipynb .

Tasks

Add Tests to data-science-examples

Test Procedure:

  • Start Exasol Database Docker Container
  • Start Jupyter Notebook Docker Container
  • Tranform Notebook to Script with jupyter nbconvert
  • run script

Change header import in sagemaker example

As mentioned here, we directly import of the header of the scania trucks data set into exasol at the moment, and as a result use
all_columns = exasol.export_to_pandas("SELECT * FROM IDA.TRAIN LIMIT 1;")
for the reading of the header, which is not great.

please update in the three files mentioned below by using this or this:

we import the data using this notebook from sagemaker tutorial for the data import which needs to be changed, and then the data is read in the sagemaker tutorial and in the azureml tutorial

Add script-language example

Requirements:

  • Show how to build a language container
  • Show where logs can be found
  • Show how a flavor looks like
  • Show how to customize a container
  • Show how to upload a container
  • Show how to activate a container
  • Explain caching and hashes
  • Explain content of a container
  • Show utility scripts to investigate container

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.