Giter Club home page Giter Club logo

a_machine_learning_approach_to_classify_anti-social_bengali_comments_on_social_media's Introduction

A Machine Learning Approach to Classify Anti-social Bengali Comments on Social Media

This repository is based on my research paper titled "A Machine Learning Approach to Classify Anti-social Bengali Comments on Social Media". The paper is accepted by IEEE. You can found this paper in the following link: https://ieeexplore.ieee.org/document/9836407.

Here, you can found the basic coding part of the research. We have collected 7000+ bengali comments from social media. Please see the dataset folder.

Problem statement of this research

  • Comments: Collect preprocessed comments from social media
  • Annotation: Annotated the comments as social or anti-social
  • Building Model Using the data, build a ML model
  • Classification Model can classify the new comments

Dataset annotation example

  • “আর ভারতে যাইয়া পুজোর উদ্বোধন করে আসো"
    (Travel to India to inaugurate Puja.)
    -Although no obscene words are used in this sentence, it offends a community's religious emotions. So, it is an anti-social comment.

  • “হিরো কনডম সবসময় এক নাম্বার"
    (Hero condom is always number one.)
    -Here no slang word is used or no one is dehumanized. So, it is not an anti-social comment.

  • "মা হিসেবে আপনি জাতির গর্ব কারণ আপনার সন্তানকে আপনি অনেকগুলা বাবা উপহার দিয়েছেন"
    (As a mother you are the pride of the nation because you have given many fathers to your child.)
    -In this scenario, an individual character such as a mother was questioned, which is not socially acceptable. Hence this is termed as anti-social.

  • “এটা ই যেন শেষ আন্দোলন হয়"
    (Hope it will be the last movement.)
    -It yearned for the end of a movement indicating peace. which is socially acceptable.

Contribution of this research

  • Comments on social media were analysed based on socially acceptable or not rather than sentiment analysis.
  • Made a dataset containing 2000 Bengali comments collected from popular social media.
  • Applied 5 traditional and modern machine learning algorithms and compared the performance of the models based on these algorithms
  • Built N-Gram (uni-gram, bi-gram, and tri-gram) language models.

a_machine_learning_approach_to_classify_anti-social_bengali_comments_on_social_media's People

Contributors

mdforhadcse avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.