omarabid59 / yolov8-triton Goto Github PK

Provides an ensemble model to deploy a YoloV8 ONNX model to Triton

License: Apache License 2.0

Dockerfile 2.08% Python 97.92%

deployment triton-inference-server triton-server ultralytics yolov8

yolov8-triton's Introduction

Overview

This repository provides an ensemble model to combine a YoloV8 model exported from the Ultralytics repository with NMS post-processing. The NMS post-processing code contained in models/postprocess/1/model.py is adapted from the Ultralytics ONNX Example.

For more information about Triton's Ensemble Models, see their documentation on Architecture.md and some of their preprocessing examples.

Directory Structure

models/
    yolov8_onnx/
        1/
            model.onnx
        config.pbtxt
        
    postprocess/
        1/
            model.py
        config.pbtxt
        
    yolov8_ensemble/
        1/
            <Empty Directory>
        config.pbtxt
README.md
main.py

Quick Start

Install Ultralytics and TritonClient

pip install ultralytics==8.0.51 tritonclient[all]==2.31.0

Export a model to ONNX format:

yolo export model=yolov8n.pt format=onnx dynamic=True opset=16

Rename the model file to model.onnx and place it under the /models/yolov8_onnx/1 directory (see directory structure above).
(Optional): Update the Score and NMS threshold in models/postprocess/1/model.py
(Optional): Update the models/yolov8_ensemble/config.pbtxt file if your input resolution has changed.
Build the Docker Container for Triton Inference:

DOCKER_NAME="yolov8-triton"
docker build -t $DOCKER_NAME .

Run Triton Inference Server:

DOCKER_NAME="yolov8-triton"
docker run --gpus all \
    -it --rm \
    --net=host \
    -v ./models:/models \
    $DOCKER_NAME

Run the script with python main.py. The overlay image will be written to output.jpg.

yolov8-triton's People

Contributors

Stargazers

Watchers

Forkers

anhngml mattwoods arcayi liuli01 goga1992 bestsongc

yolov8-triton's Issues

when I start the docker ,I get this error?please help me

I0523 10:54:25.345383 1 server.cc:264] Waiting for in-flight requests to complete.
I0523 10:54:25.345392 1 server.cc:280] Timeout 30: Found 0 model versions that have in-flight inferences
I0523 10:54:25.345416 1 server.cc:295] All models are stopped, unloading models
I0523 10:54:25.345419 1 server.cc:302] Timeout 30: Found 1 live models and 0 in-flight non-inference requests
I0523 10:54:25.345488 1 onnxruntime.cc:2640] TRITONBACKEND_ModelInstanceFinalize: delete instance state
I0523 10:54:25.348370 1 onnxruntime.cc:2586] TRITONBACKEND_ModelFinalize: delete model state
I0523 10:54:25.348400 1 model_lifecycle.cc:579] successfully unloaded 'yolov8_onnx' version 1
I0523 10:54:26.345508 1 server.cc:302] Timeout 29: Found 0 live models and 0 in-flight non-inference requests
error: creating server: Internal - failed to load all models

'yolov8_ensemble' is not found

Hi.
Trying to start up you project.
Had to correct Dockerfile by adding

EXPOSE 8000
EXPOSE 8001
EXPOSE 8002

After starting you server by running dockerfile, I'm trying to connect to it via starting main.py.
But I cannot connect to the started server.
Exception is tritonclient.utils.InferenceServerException: [StatusCode.NOT_FOUND] Request for unknown model: 'yolov8_ensemble' is not found.

The directory tree is

I've tried to create and not to create /1 folder in yolov8_ensemble

Can you, please, explain me how should it be started otherwise or what to correct to start?

@MattWoods
@omarabid59

Driver version of cuda

Hello, i'm new in this field
When i try to run this repo, i get a error like:
ERROR: This container was built for NVIDIA Driver Release 525.85 or later, but
version 470.199.02 was detected and compatibility mode is UNAVAILABLE.
So can i ask how to fix it, will i change version of tritonserver from dockerfile
FROM nvcr.io/nvidia/tritonserver:23.02-py3

how to use yolov8l-pose.pt in it

yolo export model=model/yolov8l-pose.pt format=onnx dynamic=True opset=16
I try it but get error results

Unable to find 'libtriton_tensorrt_plan.so' for model 'object_detection', searched: /models/object_detection/1, /models/object_detection, /opt/tritonserver/backends/tensorrt_plan

Unable to find 'libtriton_tensorrt_plan.so' for model 'object_detection', searched: /models/object_detection/1, /models/object_detection, /opt/tritonserver/backends/tensorrt_plan
I am getting the above error while running the triton server. I have used yolov8 conversion method to convert the custom trained model into tensorrt engine and I am nvidia triton image to run the server. I have kept the model inside the repository as model.engine.

Where should I keep the .so file. I have the .so file "libyolo_layer.so" in this name, where should I put this.

omarabid59 / yolov8-triton Goto Github PK

yolov8-triton's Introduction

Overview

Directory Structure

Quick Start

yolov8-triton's People

Contributors

Stargazers

Watchers

Forkers

yolov8-triton's Issues

when I start the docker ,I get this error?please help me

'yolov8_ensemble' is not found

Driver version of cuda

how to use yolov8l-pose.pt in it

Unable to find 'libtriton_tensorrt_plan.so' for model 'object_detection', searched: /models/object_detection/1, /models/object_detection, /opt/tritonserver/backends/tensorrt_plan

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent