Comments (9)
from ml-suite.
I'm actually hitting this using the demo caffemodel as well, not just our custom model.
from ml-suite.
I did see the same error when I ran YOLO example. It happened when I ran YOLO twice and the second run ended with the error.
So, I stopped F1 instance and then stated F1. Then it runs.
There might be something missing to de-allocate/release resources or cleanup steps after run.
Click to expand
... ...
[XDNN] kernel configuration
[XDNN] num cores : 1
[XDNN] dsp array width : 56
[XDNN] img mem size : 5 MB
[XDNN] version : 2.2
[XDNN] 8-bit mode : 0
[XDNN] Max Image W/H : 1023
[XDNN] Max Image Depth : 4095
Loading weights/bias/quant_params to FPGA...
ERROR: Failed to allocate A device memory
from ml-suite.
from ml-suite.
I was able to run with the demo yolo model after restarting my F1 instance as well.
from ml-suite.
Some more information. After restarting my instance I am able to run the demo with the default model successfully multiple times in succession. But if I try to use my custom yolo model, I get the error no matter what (after a restart and after successful default model run). Once I hit the error using my custom model, any subsequent runs (default or custom) will also hit the error.
My guess is that there is an issue with the custom model that is causing this failure which puts the system in a bad state which is resolved by a restart. So really there are two issues here. 1) What is wrong with our model that is breaking the demo. 2) Why is the initial failure causing all subsequent runs to fail.
from ml-suite.
from ml-suite.
We had an issue with the caffemodel datadir having an extra directory in the path when using our custom model. The result was that the weights weren't being found. That's why we would see the issue initially. Still not sure why the initial failure causes all subsequent runs to fail.
from ml-suite.
I recently hit this as well experimenting on the yolo demo. I'm not sure if my situation is the same as others but running fpga-clear-local-image -S 0
cleared the loaded AFI image from the FPGA and fixed it without needing to restart the whole instance. (Note: I only have one FPGA in my instance and -S 0 clears the FPGA at the first slot. If you have multiple FPGAs, you might need to adjust the slot number)
from ml-suite.
Related Issues (20)
- Update overlays HOT 1
- Unable to select device id HOT 1
- shell firmware for alveo-U50 HOT 1
- overlay bin for U50 HOT 2
- machine hangs after running notebook HOT 1
- notebook doesn't clean up properly after finishing
- overlay bin mismatch (docker/xilinx)
- missing source code
- non-existent script
- googlenet_v1 result discrepancy
- source code for XRT
- Couldn't check accuracy of quantized tensorflow model HOT 2
- How to inference TensorFlow example over FPGA with container and without container ?
- yolov3-tiny no model
- After testing ,I found pool performance on the fpga-card,is that normal? HOT 5
- Benchmark with Alveo Vs CPU VsGPU
- AttributeError: 'module' object has no attribute 'createHandle'
- Error in Compiler steps HOT 1
- Alveo U50 ml-suite Unknown: exceptions.RuntimeError: Could not init FPGA: xclbin HOT 2
- Failure 'origBiasSize == outChans' loading weights to FPGA
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ml-suite.