Giter Club home page Giter Club logo

nlp_made_easy's Introduction

NLP Made Easy

Simple code notes for explaining NLP building blocks

  • Subword Segmentation Techniques
    • Let's compare various tokenizers, i.e., nltk, BPE, SentencePiece, and Bert tokenizer.
  • Beam Decoding
    • Beam decoding is essential for seq2seq tasks. But it's notoriously complicated to implement. Here's a relatively easy one, batchfying candidates.
  • How to get the last hidden vector of rnns properly
    • We'll see how to get the last hidden states of Rnns in Tensorflow and PyTorch.
  • Tensorflow seq2seq template based on the g2p task
    • We'll write a simple template for seq2seq using Tensorflow. For demonstration, we attack the g2p task. G2p is a task of converting graphemes (spelling) to phonemes (pronunciation). It's a very good source for this purpose as it's simple enough for you to up and run.
  • PyTorch seq2seq template based on the g2p task
    • We'll write a simple template for seq2seq using PyTorch. For demonstration, we attack the g2p task. G2p is a task of converting graphemes (spelling) to phonemes (pronunciation). It's a very good source for this purpose as it's simple enough for you to up and run.
  • [Attention mechanism](Work in progress)
  • POS-tagging with BERT Fine-tuning
    • BERT is known to be good at Sequence tagging tasks like Named Entity Recognition. Let's see if it's true for POS-tagging.
  • Dropout in a minute
    • Dropout is arguably the most popular regularization technique in deep learning. Let's check again how it work.
  • Ngram LM vs. rnnlm(WIP)
  • Data Augmentation for Quora Question Pairs
    • Let's see if it's effective to augment training data in the task of quora question pairs.

nlp_made_easy's People

Contributors

kyubyong avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.