Giter Club home page Giter Club logo

microsoft-rec's Introduction

Recommending System using Microsoft recommenders

TF-IDF model was used to create a recommender. It was the only option available from Microsoft recommenders that dealt with text data. Recommenders usually work on a user-product-rating framework and our dataset is not build like that. It has users and the tags they use.

Maybe, some further investigation can show that other models can be used in this case.

TF-IDF is a standard model used for text data. It basically tokenizes all text, in a standard format and builds the frequency of each token. Building from that it calculates cosine similarity of the vector tokens of a user with another one and recommends accordingly.

Cosine similarity is a distance metric used for vector tokens because of its nature of high-dimensionality. Euclidean distance does not work in this cases and basically cosine distance measures the angle between different vector of tokens.

Obstacles

TF-IDF does not scale well so I took a small sample of 500 users to build a model. In my machine it crashed when I tried to do all users at once.

To train a new model

  • delete model in /model/ and it will create a new model for the new input you have provided

Run to create image

  • docker build --tag microsoft-recommender .

Run to start up container from created image

  • docker run -p 3001:3001 --name backend microsoft-recommender

microsoft-rec's People

Contributors

lucavehbiu avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.