Giter Club home page Giter Club logo

prediction's Introduction

Prediction

Description

Predicting solar generation based on weather forecast - a project which was part of Machine Learning course at BITS Pilani

Data Collection

Among the many different data sets we went through, we found the weather data from a Davis Weather station in Amherst, Massachusetts most suitable. The data was collected periodically every five minutes.

Attributes of the dataset

  • Temperature: The temperature is measured in °C.
  • Wind-chill: Wind-chill is the perceived decrease in air temperature felt by the body on exposed skin due to the flow of air. In this dataset it is measured in °C.
  • Humidity: Humidity is the amount of water vapor in the air. Relative humidity, expressed as a percent, measures the current absolute humidity relative to the maximum for that temperature. The dataset has values for relative humidity.
  • Dew-point: The dew point is the temperature at which the water vapor in a sample of air at constant barometric pressure condenses into liquid water at the same rate at which it evaporates. The dataset records dew point in °C.
  • Wind speed: Wind speed is measured in miles/hour (mph)
  • Wind direction: Wind direction is measured in degrees (°).
  • Rain: Rainfall is measured in mm.
  • Pressure: Pressure is measured in inHg (inches of mercury).
  • Solar Intensity: Solar intensity refers to the amount of sunlight hitting the planets surface. Latitude is the North-South position on the planet. Solar intensity varies with latitude based on the shape and angle of the planet. In the dataset the solar intensity is measured in watts/sq. meter (w/m2)
  • Evapotranspiration (ET): It is the sum of the evaporation and plant transpiration from the Earth’s land and ocean surface to the atmosphere. It is expressed in feet.
  • UltraViolet index: It is an international standard measurement of the strength of ultraviolet (UV) radiation from the sun at a particular place and time.
  • Day of the year: It is the specific day of the year numbered from 1 to 365.
  • Time of the day: This is included as a timestamp.

The last two attributes are included because daylight on particular instant of time influences solar intensity and varies throughout the year for a given location.

Algorithms Used

The multivariate linear regression method is implemented using three different algorithmic techniques in MATLAB R2013b.

  • mvn (multivariate normal maximum likelihood estimation)
  • ecm(expectation maximization algorithm)
  • cwls(Covariance-weighted least squares estimation)

The original dataset consisted of thirteen independent variables as discussed above and one response variable. Since the dimension of data was too large, it has been trained on the data collected at noon in the year 2010. Principal Component Regression(PCR) is also implemented that performs a regression analysis on the observed principal components obtained from PCA which identifies those set of parameters that captures high variability. From this analysis, seven features have shown maximum variability in the data. The parameters were Temperature, Humidity, Dew point, Wind speed, Pressure, Day, Evapotranspiration. Thus the model is limited to only these seven parameters as the remaining parameters did not affect much in the variability of the data.

Furthermore a correlation analysis amongst all the parameters was performed. The analysis quantifies how each forecast parameter affects each other and the solar intensity. We found that relative humidity and rainfall are highly correlated with each other and with solar intensity, while temperature, dewpoint, and wind speed are only partially correlated with each other and with solar intensity.

prediction's People

Contributors

agarwaltanmay avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.