Giter Club home page Giter Club logo

talia178 / nlp_topicmodelling_lda Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 7.3 MB

In this project, I'll visualize text data using WordCloud, employ the LDA model for topic modeling, and compute coherence scores to assess the model's quality and find the optimal number of topics. I'll create an interactive visualization with pyLDAvis, saving it as an HTML link for exploration.

Jupyter Notebook 100.00%
lda-model machinelearning-python natural-language-processing python3 topic-modeling

nlp_topicmodelling_lda's Introduction

About Dataset

"Friends" is an American television sitcom, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast starring Jennifer Aniston, Courteney Cox, Lisa Kudrow, Matt LeBlanc, Matthew Perry and David Schwimmer, the show revolves around six friends in their 20s and 30s who live in Manhattan, New York City. The series was produced by Bright/Kauffman/Crane Productions, in association with Warner Bros. Television. The original executive producers were Kevin S. Bright, Kauffman, and Crane.

Kaggle link: https://www.kaggle.com/datasets/sujaykapadnis/friends/data?select=friends.csv

friends.csv variables:

  • text: Dialogue as text
  • speaker: Name of the speaker
  • season: Season Number
  • episode: Episode Number
  • scene: Scene Number
  • utterance: Utterance Number

About the Topic Modelling project

I am a devoted fan of the 'Friends' sitcom, having rewatched the series numerous times. Among the characters, Chandler Bing stands out as my favorite male character. His witty humor never fails to bring a smile to my face. The recent loss of the actor, Matthew Perry, who portrayed him, deeply saddened fans around the worlds. In tribute to him and the entire cast of the series, I undertook a small project using this captivating dataset.

In this project, I'll visualize text data using WordCloud, employ the LDA model for topic modeling, and compute coherence scores to assess the model's quality and find the optimal number of topics. I also create an interactive visualization with pyLDAvis, saving it as an HTML link for exploration.

nlp_topicmodelling_lda's People

Contributors

talia178 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.