Giter Club home page Giter Club logo

conversationalai's Introduction

Conversational Ai using Tensorflow

This is the implementation of multi head attention model on the cornell movie dataset. This is an implementation of Transformer models introduced in 2017 and is used in NLP in various task such as Translation (you can visit my other repository Neural Style Translation).

How to use:-

  1. first clone the repository
$ git clone https://github.com/sarthak7509/ConversationalAi.git
  1. Download the pretrained weights from my drive and place it in the model weights section
  2. Run conversationAi.py for demo or run app.py to start a local server that return the chatbot value for the POST methon
  3. The training files are included under training folder use it to train custom model

How to use WkiiQna:-

  1. As of 1st July 2021 I have used wkiiqna dataset provided by microsoft and used it to train on my existing model architecture with current accuracy of 25% which is not bad for 20 epochs
  2. Since the model used is same we just need to download latest weights and provide its path to the model my drive as on now the latest weight is model2.h5 and place it in folder model_weight_WkiiQna
  3. Training notebook is provided under training folder
  4. Just need to specify the model path in the code where ever it is mentioned
  5. Tokenizer is preincluded in the github file

Note wikki qna bot model weights are preincluded in the git directory name model_weight_WkiiQna. So no need to download current latest weight is model3.h5

Note added a talking demo file for quick demo session of the project

How to use it:-

$ python talkingdemo.py -b WEIGHT_PATH -t TOKENIZER_PATH -n BOT_NAME(optional)

encoder

Single Encoder Layer

alt text

Encoder

alt text

Decoder

Single Decoder Layer

alt text

Decoder

alt text

Transformer

alt text

conversationalai's People

Contributors

sarthak7509 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.