Giter Club home page Giter Club logo

tko-data-engineering's Introduction

Data Engineering with Snowpark

This repository contains a hands-on lab for data engineering in Snowflake with Snowpark! Here is an overview of what we'll build in this lab:

Preview Features

Note: The following features/tools used in this lab are still in preview

Setup

You will need the following things before beginning:

  • Snowflake
    • A Snowflake Account
    • A Snowflake user created with ACCOUNTADMIN permissions. This user will be used to get things setup in Snowflake.
  • Anaconda
  • SnowSQL
    • SnowSQL installed on your computer. Go to the SnowSQL Download page and see the Installing SnowSQL page for more details.

    • Create a SnowSQL configuration for this lab by adding the following section to your ~/.snowsql/config file (replacing the account, username, and password with your values):

      [connections.dev]
      account = myaccount
      username = myusername
      password = mypassword
      rolename = HOL_ROLE
      warehousename = HOL_WH
      dbname = HOL_DB
      
  • Visual Studio Code with required extensions
    • Visual Studio Code installed on your computer. Check out the Visual Studio Code homepage for a link to the download page.
    • Python extension installed. Search for and install the "Python" extension in the Extensions pane in VS Code.
    • Snowflake extension installed. Search for and install the "Snowflake" extension in the Extensions pane in VS Code.
  • GitHub account with lab repository forked and cloned locally
    • A GitHub account. If you don't already have a GitHub account you can create one for free. Visit the Join GitHub page to get started.
    • A forked lab repository. You'll need to create a fork of this lab repository in your GitHub account. Visit the tko-data-engineering GitHub Repository and click on the "Fork" button near the top right. Complete any required fields and click "Create Fork".
    • A local clone of the forked lab repository. For connection details about your Git repository, open the Repository and copy the "HTTPS" link provided near the top of the page. If you have at least one file in your repository then click on the green "Code" icon near the top of the page and copy the "HTTPS" link. Use that link in VS Code to clone the repo to your computer. Please follow the instructions at Clone and use a GitHub repository in Visual Studio Code for more details.
  • Anaconda environment
    • Create and active a conda environment for this lab using the supplied conda_env.yml file. Run these commands from a terminal in the root of your local repository.

      conda env create -f conda_env.yml
      conda activate pysnowpark

tko-data-engineering's People

Contributors

msjones121 avatar sfc-gh-cbaechtold avatar sfc-gh-jhansen avatar

Forkers

sfc-gh-epoilvet sfc-gh-tgosden sfc-gh-jtricker sf-gh-mtessari sfc-gh-knguyen sfc-gh-tkaplan sfc-gh-tpetrache sfc-gh-sural sfc-gh-rjaynecoupe sfc-gh-tdahlberg sfc-gh-tklughardt sfc-gh-rkumar sfc-gh-jyost sfc-gh-gceballos sfc-gh-dphillips sunvenk sfc-gh-mmarzillo sfc-gh-tmustafic sfc-gh-aschanne sfc-gh-jhunt sfc-gh-shopperton sfc-gh-jparmar sfc-gh-mdrazek sfc-gh-rkorkus sfc-gh-clakkad sfc-gh-rsaenz sfc-gh-cfaust sfc-gh-amarean sfc-gh-mmasood sfc-gh-yostrinsky sfc-gh-jasvestis sfc-gh-gnorth mylihi11 sfc-gh-aalteirac sfc-gh-cjackson ranya-snow sfc-gh-druthven sfc-gh-hdavies sfc-gh-smishra mshen703 sfc-gh-jhkleinert sfc-gh-mstjohn sfc-gh-mcharpentier sfc-gh-smcgougan sfc-gh-jcrittenden sfc-gh-drelyea sfc-gh-jsnyder sfc-gh-yishikawa sfc-gh-ymita sfc-gh-nsrivastava sfc-gh-cmoynihan sfc-gh-hachen sfc-gh-asirkka sfc-gh-hvekaria sfc-gh-bchrist atelle-snow sfc-gh-coliviero sfc-gh-vshiv sfc-gh-cque sfc-gh-cbalko sfc-gh-pmelin sfc-gh-cchatlosh sfc-gh-dchen durandkwok-snowflake sfc-gh-aclaydon sfc-gh-ytakebe sfc-gh-kfukamori sfc-gh-tiversen sfc-gh-rmiedema sfc-gh-smahalingam ccarrero-sf sfc-gh-mcavar sfc-gh-fpacione sfc-gh-gwarrier sfc-gh-mvanoers sfc-gh-irahim sfc-gh-bmoussus sfc-gh-vkhandelwal sfc-gh-rgoriparthi sfc-gh-nduffy sfc-gh-markfrench sfc-gh-kponcet sfc-gh-smorton sfc-gh-skellough toricornel sfc-gh-jgriffith sfc-gh-hjablonski zoharsan sfc-gh-wdemeyer sfc-gh-hghoghari sfc-gh-knoda sfc-gh-jgrip sfc-gh-ltagliaferri sfc-gh-jbrunger sfc-gh-jbaines abishsnowflake sfc-gh-dcard sfc-gh-hagarwal sfc-gh-chammond sfc-gh-sodonnell

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.