Problem statement : Users assessed tweets related to various brands and products, providing evaluations on whether the sentiment conveyed was positive, negative, or neutral. Additionally, if the tweet conveyed any sentiment, contributors identified the specific brand or product targeted by that emotion.
Dataset Columns: tweet_text ,emotion_in_tweet_is_directed_at ,is_there_an_emotion_directed_at_a_brand_or_product.
- Cleaned data/ removed null values and special char.
- Tokenization of elements in tweet texts.
- Grouped apple/google tweets and negative/positive to boolean.
- As it is a classification problem, I Used Scikit learn, and logistic regression.
- Then exported it as a twittermodel.joblib
- Then I deployed this project as a flask app
- f1_score(y_test, pred) -> 0.9236209335219236
- accuracy_score(y_test,pred) -> 0.8646616541353384
- pred_prob[0][1] >= 0.3 -> True