Giter Club home page Giter Club logo

nlp_flair_texhero_distilbert's Introduction

Project Name

Predict which tweets are about real dissters and which ones are not in a couple lines of code. Data for calculation taken from https://www.kaggle.com/c/nlp-getting-started

General info

If you need to get quickly some initial results in typical NLP task than using packages Flair, Texthero and DistilBERT would give quite good results.

Libraries and useful links

  1. Flair Embeddings
  2. FastText Embeddings
  3. TransformerWordEmbeddings
  4. How to use flair with keras

Status

Project is: in progress,

Inspiration

Project inspired by Kaggle nootebook

result on leaderboard

###


Second attempt

Rev_B_real_or_not.ipynb Results were worse compared with initial simple automatic approach. That proves how good/opimised Flair Framework is to get best results. Tweaking does not give better results. Maybe more extensive text cleaning and deciphering abbreviation and other shorcuts could do better result. Things like hero.visualization.wordcloud, kmeans, custom_pipeline were checked.

Info

Created by [email protected]

nlp_flair_texhero_distilbert's People

Contributors

len-sla avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.