Giter Club home page Giter Club logo

data_1_checks's Introduction

data_1_checks

##Knowledge Check Assignments Data Analytics Tuesday Class

Below are instructions from Knowledge Check 1 first class:

  • a statement that prints "hello world!"
  • a list populated with several values. They can be strings / integers / floats etc. Then, print() one of those values either to the prompt or in a cell (if using Jupyter Notebook)
  • a dictionary populated with two keys and two values. Print one of the values just like you did above.
  • a tuple with 4 values. Print one of them.

##Knowledge Check 2 from first class is listed in folder KnowledgeCheck1.py under kc_2.py

Per the example information where data was obtained:

This directory includes a few sample datasets to get you started.

california_housing_data*.csv is California housing data from the 1990 US Census; more information is available at: https://developers.google.com/machine-learning/crash-course/california-housing-data-description

##Knowledge Check 1 from second class (DataC2KC1Sept2.py):

  • Pull in data from an API.
  • Find and print TWO descriptive statistics about your data.
  • Write a query in Pandas to select a particular set of your data.
  • Select and print the SECOND AND THIRD columns of your data frame.
  • Select and print the FIRST 4 rows of you data frame.

The API from Punk API does not need a key. https://api.punkapi.com/v2/beers. Imports include pandas and requests. It was created in Visual Studio using Python.

##Knowledge Check 2 from second class (cleaning data sets)

  • Make a .py (or .ipynb) file that contains the following (your choice of editor does not matter!) and do the following: find and access a data set in any way you want. You can use an API, a CSV, anything.
  • Fix TWO issues with the data set using techniques you've learned in class. Here are some common fixes:
  • Remove null values
  • Fill in null values with 0's or blanks
  • fill in blanks
  • fix character strings that aren't formatted correctly (you could use regex for this)
  • correct column names if they're misnamed
  • correct spelling (for example, you might have a Country column with an entry that says "Unted States of America".) There are hundreds of other things you could fix depending on the issues, so don't worry about whether or not your fix "counts" for the check. It most likely will if you're fixing something. Commit your changes. Push your changes to GitHub, and make sure to turn in the GitHub link into Google Classroom.

data_1_checks's People

Contributors

michellerobertsbu avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.