Giter Club home page Giter Club logo

meme_stock_nn's Introduction

meme_stock_prices

Presentation: Google Slides Dashboard/Site: Dashboard

Git Hub Branches and Communication

Our main communication took place in the Slack messenger app, when a more detailed question/task/troubleshooting needed to be explained a video conference was set up through skype. Each member worked on their own separate github branch, and in individual files (This avoided excessive overlap during merging). When files needed to be combined the main contributor will be in charge of merging the files and the person with the squared role would confirm the merge.

Team members:

Background

Meme stocks have been gaining popularity in the last two years and have generated investors both profits and losses. The rise of social media fandom and its accompanying chatter has been named the culprit of major catapults in stocks such as GME, TSLA, AMC, and many others.

Overview

In this project we will test whether we can predict if the price of a meme stock will increase or decrease based on the social media hype around it, that is the conversations invoving their mention online. Specifically, we will be analyzing Twitter data to measure how often Tesla stock is mentioned in a 7 day period, and merging this data with the stock price counterpart on an hourly basis. As a baseline we will see if we can create the same results using a Stock index, such as the S&P500 index, in lieu of the Twitter data.

Source data

  1. Historical Stock Data: For the stock data we used the Yfiance API python library. With this python API library we were able to extract historical stock data, however we encountered the following limitation: hourly data was limited to the previous 7 days.

    1. Data Acquire (Hourly Stock Data): % Stock Price change, Volume of stock exchanges for SP500 and TSLA stocks.
  2. Twitter - TSLA stock mention: For twitter TSLA data we used the Twarc API python library. This library uses the Twitter API to collect data. The Twitter API also limited us to hourly data, for the previous 7 days.

    1. Data Acquire (Hourly Tweet Count Data): Tweets that mentioned β€œ#TSLA”, the common way to mention a stock ticker.
    • This analysis includes tweet data from the past 7 days at the start of the project.
    • Data was extracted from twitter API, placed into a jupyter notebook dataframe, and cleaned up to include the date, hour, and tweet counts for the respective time frame.

Questions To Answer

  1. Will a logistical regression model show that the quantity of Tweets mentioning #TSLA affect the price of Tesla stock?
  2. What has a bigger impact on the price of a meme stock such as TSLA? (Tweet mention count data vs. S&P500)

Results

Logistical Regression (Supervised Machine Learning)

  • This analysis was performed to investigate whether TSLA stock increase or decrease based on certain factors: tweet counts mentioning TSLA, SPY day % change, and a combination of all factors.
  • The following image shows our results of the logistical regression model for all three items mentioned above. As evidenced here, all three logistical regression models reflect the same results precision, recall, and f-1 scores, therefore we were unable to draw any solid conclusions.

Screen Shot 2021-11-20 at 2 33 54 PM

Our task then, was to figure out why we were getting the same results for all three tests.

Linear Regression

The linear regression models were performed on the following parameters: Tesla tweet counts, Tesla percent day changes, and lastly Tesla Volume

  • Tesla percent day changes vs #TSLA Tweet counts
    • Results: In this case, this model yielded an R-squared value of 0.0058, which does not indicate a high level of correlation between tweet count and changes in TSLA's percent day increase or decrease. Larger tweet counts would yield a bit more drastic changes according to our outlier data, but they couldn't very well determine whether the Tesla stock would increase or decrease

lin_reg_tsla_tweet

  • Tesla percent day changes vs SPY percent day changes
    • Results: The relationship between SPY percent day changes and TSLA's percent day changes yielded a higher correlation that the tweet count model above, however at 0.1937, it is still not a strong enough model in assessing behavior or making future predictions with confidence.The

lin_reg_tsla_spy

  • Tesla Volume vs Tesla percent day changes
    • Results: From this linear regression test, we found that higher volume was linked to more drastic changes in Tesla stock (either increasing or decreasing), however at lower volume levels, there wasn't a clear picture of correlation

lin_reg_combined

What has the strongest correlation to TSLA (meme stock) price, SP500 or tweet post count?

meme_stock_nn's People

Contributors

rick2stack avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.