Giter Club home page Giter Club logo

isis-api's Introduction

Isis

A service exposing package tags and package categories to Thoth's recommendation engine.

project2vec

Isis API exposes functionality on top of project2vec - description of a package using a vector. The vector consists of features that the given project provides. These features are aggregated based on keywords found in the Python ecosystem and subsequently they are extracted from project descriptions and other free text descriptions of a project (README files on linked GitHub repos).

These vectors form space in which we can search for similar packages (by computing distance between these vectors) as well as performing feature based queries by for example masking the resulting vectors (specifing features we are interested in).

The figure bellow shows a visualization in TensorBoard after dimensionality space reduction using t-SNE. There can be seen formed clusters of similar packages and search of similar packages in the ecosystem.

TensorBoard project2vec visualization

Deployment

The service is built using OpenShift's s2i. On deployment, there is first run an init container that downloads model from Ceph/S3 (created by one of the flows defined by selinon-worker flows).

isis-api's People

Contributors

fridex avatar goern avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.