Giter Club home page Giter Club logo

rh-steve-grubb / openvino_model_server Goto Github PK

View Code? Open in Web Editor NEW

This project forked from opendatahub-io/openvino_model_server

0.0 0.0 0.0 38.84 MB

A scalable inference server for models optimized with OpenVINO™

Home Page: https://docs.openvino.ai/latest/ovms_what_is_openvino_model_server.html

License: Apache License 2.0

Shell 0.69% C++ 83.81% Python 11.71% C 0.72% Java 0.43% Groovy 0.09% Go 0.49% Makefile 0.57% CMake 0.04% Dockerfile 0.08% Starlark 1.36%

openvino_model_server's Introduction

OpenVINO™ Model Server

Model Server hosts models and makes them accessible to software components over standard network protocols: a client sends a request to the model server, which performs model inference and sends a response back to the client. Model Server offers many advantages for efficient model deployment:

  • Remote inference enables using lightweight clients with only the necessary functions to perform API calls to edge or cloud deployments.
  • Applications are independent of the model framework, hardware device, and infrastructure.
  • Client applications in any programming language that supports REST or gRPC calls can be used to run inference remotely on the model server.
  • Clients require fewer updates since client libraries change very rarely.
  • Model topology and weights are not exposed directly to client applications, making it easier to control access to the model.
  • Ideal architecture for microservices-based applications and deployments in cloud environments – including Kubernetes and OpenShift clusters.
  • Efficient resource utilization with horizontal and vertical inference scaling.

OVMS diagram

OpenVINO™ Model Server (OVMS) is a high-performance system for serving models. Implemented in C++ for scalability and optimized for deployment on Intel architectures, the model server uses the same architecture and API as TensorFlow Serving and KServe while applying OpenVINO for inference execution. Inference service is provided via gRPC or REST API, making deploying new algorithms and AI experiments easy.

OVMS picture

The models used by the server need to be stored locally or hosted remotely by object storage services. For more details, refer to Preparing Model Repository documentation. Model server works inside Docker containers, on Bare Metal, and in Kubernetes environment. Start using OpenVINO Model Server with a fast-forward serving example from the Quickstart guide or explore Model Server features.

Read release notes to find out what’s new.

Key features:

Note: OVMS has been tested on RedHat, and Ubuntu. The latest publicly released docker images are based on Ubuntu and UBI. They are stored in:

Run OpenVINO Model Server

A demonstration on how to use OpenVINO Model Server can be found in our quick-start guide. For more information on using Model Server in various scenarios you can check the following guides:

References

Contact

If you have a question, a feature request, or a bug report, feel free to submit a Github issue.


* Other names and brands may be claimed as the property of others.

openvino_model_server's People

Contributors

dkalinowski avatar dtrawins avatar atobiszei avatar rasapala avatar mzegla avatar michalkulakowski avatar bstrzele avatar mkuczyns11 avatar mgumowsk avatar ficol avatar jasiu86 avatar intel-rrozestw avatar ngrozae avatar ncybulsk avatar dszyfelb avatar joannanosek avatar waldekpi avatar ksankiew avatar sgolebiewski-intel avatar mwilkows avatar jszczepa avatar ravikumarbhattiprolu avatar krzyczar avatar jacob27 avatar ngaloppo avatar pgladkows avatar stevegrubb avatar tsavina avatar ryanloney avatar waitingkuo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.