Giter Club home page Giter Club logo

any-nlp's Introduction

any-nlp

This is a step by step guide how to approach any Natural Language Processing (NLP) problem using Keras, Tensorflow, and Glove Vectors.

This provided me a nice framework to do machine learning problems in the future.

Table of Contents

Description

For a competitive Kaggle challenge I used this notebook to put together and track my submissions.

This was hosted at DS12 Whiskey Multiclass Classification problem

(0, 1, 2) Classify the whiskey based on the the text description

  • Excellent 0
  • Good 1
  • Bad 2

Important directories

  • working (this is where the notebook is located, also where you need to download the glove vector text file to)
  • input (this is the directory with the data)
  • submission (this directory holds some of the submissions that I uploaded to the Kaggle Competition)
  • first-attempt (this directory holds my first attempts at solving the problem)

Functionality

In order to run the notebooks and follow along you will need to download the Glove Vectors, for this competition I used the glove.840B.300d.txt file this can be found on the github stanfordnlp page listed below. You will need to unzip the file in the same directory as the notebook in order to follow along. I did not include in this repository due to the large file size.

https://github.com/stanfordnlp/GloVe

http://nlp.stanford.edu/data/wordvecs/glove.840B.300d.zip

Technology

What are the technologies used?

  • Natural Language Processing
  • Keras preprocessing
  • Keras LSTM, MaxPooling1d,
  • Stanford glove vectors
  • Pandas
  • Numpy
  • Scikit-Learn train+test+split

License

license

Special thanks to Kaggle User Abhishek for the inspiration on how to apply NLP to various problems https://www.kaggle.com/abhishek/approaching-almost-any-nlp-problem-on-kaggle

more work to come soon

any-nlp's People

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.