Please read this document first to get started with this problem. This document contains introduction to the problem and data used in the problem and metadata of the files present in this folder.
There are separate files for the codes, input data(both CSV and .mp4), guide to setup the project, methodologies and results.
-
Introduction to the problem and Data--> text file for introduction to problem and data used
-
Code Folder: Contains another 2 folders for notebook(.ipynb) and python scripts (.py)
ner_nltk_spacy
--> Python script for Named Entity Recognitionsent_analysis_rf
--> Python script for Sentiment Analysissk_topic_modeling
--> Python Script for Topic Modeling using LDAspeech2text.py
--> Python script for speech to text conversion
-
csv files folder: Folder contains 3 csv files are used for sentiment analysis, topic modeling, and named entity recognition tasks
dev_sent_emo
--> csv file for devtrain_sent_emo
--> csv file for trainingtest_sent_emo
--> csv file for testingNote
: Please update the paths for the above files at the beginning of every file accordingly
-
Methodoloy and Results: Folder contains 3 .pdf files explaining methodology applied and the results of the respective tasks
NER_Results_Methodology
--> For Named Entity RecognitionSentiment_Analysis_Methodology_Results
--> For Sentiment AnalysisTopic_Modeling_LDA_Methodology_Results
--> For Topic Modeling using LDA
-
requirements.txt
: File to install the dependencies required> pip3 install -r requirements.txt
-
setup.txt
: Guide to set up the assignment -
Input data: Can be found on the link: https://www.kaggle.com/zaber666/meld-dataset