Giter Club home page Giter Club logo

pro_pandas's Introduction

Pro Pandas

Do you know that 60% to 80% of the work in a Data Science project goes into Data Acquisition, Cleaning, Transformation, Analysis and Feature engineering? Pandas is the foremost and de facto library for Data Analysis in Python. By mastering Pandas, you complete half the journey in Data Science. The basics of all the features provided by Pandas in a beginner-friendly way represented in this file.

Pandas is a Python library for doing data analysis. It's really fast and lets you do exploratory work incredibly quickly.

The goal of this cookbook is to give you some concrete examples for getting started with pandas. The docs are really comprehensive. However, I've often had people tell me that they have some trouble getting started, so these are examples with real-world data, and all the bugs and weirdness that entails.

Quick Start

If you have both conda and git on your system (otherwise, read the next section for more detailed instructions):

Table of Contents

A quick tour of the IPython Notebook Shows off IPython's awesome tab completion and magic functions.

Chapter 1: Reading from a CSV

Reading your data into pandas is pretty much the easiest thing. Even when the encoding is wrong!

Chapter 2: Filtering & Sorting Your Data

It's not totally obvious how to select data from a pandas dataframe. Here I explain the basics (how to take Filtering and get columns)

Chapter 3: Grouping

The groupby/aggregate is seriously my favorite thing about pandas and I use it all the time. You should probably read this.

Chapter 4: Merging and Binding

In real life project we havee to mearge lots of table data or csv files to gather, so this is the very important part while learning pandas.

Chapter 5: Dealing with Null values

There are always null values in dataset you must have knowledge how to deal with Null values, here you will learn it with easy way with fun.

Chapter 6: Data Visualization

With out visulazation you unable to explain things to anyone, if you want to describd data with visulization anyone will understand it easily.

More Related with Pandas check PDF File

๐Ÿผ PANDAS ๐Ÿผ.pdf

pro_pandas's People

Contributors

data-science-vidhya avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.