Giter Club home page Giter Club logo

bikeshare-project's Introduction

Basic Data Exploration with pandas on Bikeshare Data

Basic Udacity project using pandas library in Python for their bikeshare data exploration.

Project Overview:

This project focuses on pandas library usage and simple statistics methods to perform a rudimentary analysis on the bikeshare data from three major U.S. cities - Chicago, Washington, and New York City - to display information such as most popular days or most common stations.

Running the program:

You can input 'python bikeshare.py' on your terminal to run this program. I use Anaconda's command prompt on a Windows 10 machine.

Program Details:

The program takes user input for the city (e.g. Chicago), month for which the user wants to view data (e.g. January; also includes an 'all' option), and day for which the user wants to view data (e.g. Monday; also includes an 'all' option).

Upon receiving the user input, it goes ahead and asks the user if they want to view the raw data (5 rows of data initially) or not. Following the input received, the program prints the following details:

  • Most popular month
  • Most popular day
  • Most popular hour
  • Most popular start station
  • Most popular end station
  • Most popular combination of start and end stations
  • Total trip duration
  • Average trip duration
  • Types of users by number
  • Types of users by gender (if available)
  • The oldest user (if available)
  • The youngest user (if available)
  • The most common birth year amongst users (if available)

Finally, the user is prompted with the choice of restarting the program or not.

Requirements:

  • Language: Python 3.6 or above
  • Libraries: pandas, numpy, time

Project Data:

  • chicago.csv - Stored in the data folder, the chicago.csv file is the dataset containing all bikeshare information for the city of Chicago provided by Udacity.

  • new_york_city.csv - Dataset containing all bikeshare information for the city of New York provided by Udacity.

  • washington.csv - Dataset containing all bikeshare information for the city of Washington provided by Udacity. Note: This does not include the 'Gender' or 'Birth Year' data.

Built with:

  • Python 3.6.6 - The language used to develop this.
  • pandas - One of the libraries used for this.
  • numpy - One of the libraries used for this.
  • time - One of the libraries used for this.

Author:

  • Aritra Chattaraj - Sole author for this program. Mentioned all the help received in 'Acknowledgements' section.

Acknowledgements:

  • xhlow - xhlow's repository helped with understanding the structure and details of certain functions.
  • philribbens - philribben's repository also added to better understanding of the structure for this project.
  • pandas docs - pandas documentation was immensely helpful in understanding the implemention of pandas methods used in this project.
  • Udacity - Udacity's Data Analyst Nanodegree program and their instructors were extremely helpful while I was pursuing this project.
  • Finally, I'd like to mention my college courses on Principles of Econometrics and Intermediary Econometrics for introducing me to data analysis and R programming. The concepts embodied in the pandas library (e.g. data frame) were very similar to the ones used while I was working on my R projects for college assignments.

bikeshare-project's People

Contributors

aritra96 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.