Giter Club home page Giter Club logo

data-fun-eda's Introduction

data-fun-eda

Dataset Description

This Python Exploratory Data Analysis project will analyze the "Dow Jones" dataset from the Seaborn library. It contains the price of the Dow Jones from 1914-12-01 to 1968-12-01. The dataset can be accessed via Github: https://github.com/mwaskom/seaborn-data/blob/master/dowjones.csv

Environment Setup and How to Install and Run the Project

  1. Create and clone repository to VSCode
  • Create a new GitHub repository named datafun-06-eda.
  • Clone the repository to your local machine.
  1. Create and Activate Virtual Environment
  • Create a Project Virtual Environment in the .venv folder.
  • Activate the Project Virtual Environment.
py -m venv .venv
.\.venv\Scripts\Activate
  1. Requirements
  • Install packages
py -m pip install jupyterlab pandas matplotlib seaborn pyarrow
  • Freeze your requirements to requirements.txt.
py -m pip install requests
py -m pip freeze > requirements.txt
  1. Git Ignore
  • Add a useful .gitignore to the root project folder.

Table of Contents

  1. Summary
  2. Dependencies
  3. Usage
  4. Data Acquisition
  5. Initial Data Inspection
  6. Initial Descriptive Statistics
  7. Data Exploration
    • Histograms
    • Dow Jones Performance Status Count
    • Line Chart of Selling Price
    • Bar Chart of Selling Price
    • Histogram of Selling Price
    • Bar Chart of Price Change
  8. Conclusion

Summary

The project analyzes the Dow Jones dataset to gain insights into its historical performance. It employs exploratory data analysis techniques to understand trends, patterns, and fluctuations in the DJIA over the specified period. The goal is to assess the long-term viability of investing in the DJIA.

Dependencies

The project requires the following Python libraries:

  • pandas
  • matplotlib
  • seaborn

Usage

To run the project, ensure you have Python installed along with the required dependencies. Then, execute the Python script nolan_eda.ipynd. Make sure to have the Dow Jones dataset (dowjones) accessible in your working directory or adjust the data acquisition step accordingly.

Data Acquisition

The project loads the Dow Jones dataset using seaborn's load_dataset function. It provides a brief overview of the dataset by displaying the first few rows.

Initial Data Inspection

The initial data inspection step involves examining the structure of the dataset, including its shape and data types, to gain a better understanding of the available information.

Initial Descriptive Statistics

Descriptive statistics are calculated to summarize the numerical aspects of the dataset. These statistics provide insights into central tendencies and variability of the data.

Data Exploration

Various exploratory data analysis techniques are employed to explore different aspects of the Dow Jones dataset, including:

  • Histograms: to visualize the distribution of price changes and selling prices.
  • Dow Jones Performance Status Count: to analyze the frequency of different performance statuses.
  • Line Chart of Selling Price: to observe the trend of selling prices over time.
  • Bar Chart of Selling Price: to compare selling prices across different dates.
  • Histogram of Selling Price: to further examine the distribution of selling prices.
  • Bar Chart of Price Change: to visualize the price changes over time.

Conclusion

The examination indicated that over time, the Dow Jones Industrial Average (DJIA) has displayed a consistent upward trajectory, notwithstanding intermittent bouts of instability. Individuals with a focus on long-range investment objectives might deem the DJIA viable.

data-fun-eda's People

Contributors

crusoe22 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.