Giter Club home page Giter Club logo

Comments (9)

wilderfield avatar wilderfield commented on September 15, 2024

from ml-suite.

esberglu avatar esberglu commented on September 15, 2024

I'm actually hitting this using the demo caffemodel as well, not just our custom model.

from ml-suite.

adki avatar adki commented on September 15, 2024

I did see the same error when I ran YOLO example. It happened when I ran YOLO twice and the second run ended with the error.
So, I stopped F1 instance and then stated F1. Then it runs.
There might be something missing to de-allocate/release resources or cleanup steps after run.

Click to expand
... ...
[XDNN] kernel configuration
[XDNN]   num cores       : 1
[XDNN]   dsp array width : 56
[XDNN]   img mem size    : 5 MB
[XDNN]   version         : 2.2
[XDNN]   8-bit mode      : 0
[XDNN]   Max Image W/H   : 1023
[XDNN]   Max Image Depth : 4095

Loading weights/bias/quant_params to FPGA...
ERROR: Failed to allocate A device memory

from ml-suite.

wilderfield avatar wilderfield commented on September 15, 2024

from ml-suite.

esberglu avatar esberglu commented on September 15, 2024

I was able to run with the demo yolo model after restarting my F1 instance as well.

from ml-suite.

esberglu avatar esberglu commented on September 15, 2024

Some more information. After restarting my instance I am able to run the demo with the default model successfully multiple times in succession. But if I try to use my custom yolo model, I get the error no matter what (after a restart and after successful default model run). Once I hit the error using my custom model, any subsequent runs (default or custom) will also hit the error.

My guess is that there is an issue with the custom model that is causing this failure which puts the system in a bad state which is resolved by a restart. So really there are two issues here. 1) What is wrong with our model that is breaking the demo. 2) Why is the initial failure causing all subsequent runs to fail.

from ml-suite.

wilderfield avatar wilderfield commented on September 15, 2024

from ml-suite.

esberglu avatar esberglu commented on September 15, 2024

We had an issue with the caffemodel datadir having an extra directory in the path when using our custom model. The result was that the weights weren't being found. That's why we would see the issue initially. Still not sure why the initial failure causes all subsequent runs to fail.

from ml-suite.

cjvolzka avatar cjvolzka commented on September 15, 2024

I recently hit this as well experimenting on the yolo demo. I'm not sure if my situation is the same as others but running fpga-clear-local-image -S 0 cleared the loaded AFI image from the FPGA and fixed it without needing to restart the whole instance. (Note: I only have one FPGA in my instance and -S 0 clears the FPGA at the first slot. If you have multiple FPGAs, you might need to adjust the slot number)

from ml-suite.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.