Giter Club home page Giter Club logo

Comments (7)

rgeorgej avatar rgeorgej commented on August 28, 2024

When running inside the Docker you need to ensure that nvidia driver module is loaded in the host. We normally run deviceQuery to load the nVidia Kernel

  1. Launched the instance with AMI ami-d6f2e6bc
  2. Build DeviceQuery and run it with the following instructions in the AMI
cd NVIDIA_CUDA-7.0_Samples/1_Utilities/deviceQuery
make
./deviceQuery

3 . sudo docker run -it --privileged amazon/dsstne /bin/bash

from amazon-dsstne.

rgeorgej avatar rgeorgej commented on August 28, 2024

Because of the confusion I have also updated the setup documentation for further referrence.

Please do let us know if you are still blocker

from amazon-dsstne.

hamukazu avatar hamukazu commented on August 28, 2024

Thank you for a very quick reply.

3 . sudo docker run -it --privileged amazon/dsstne /bin/bash

I found the reason. It was because I didn't say "sudo" before the "docker".

Now everything is working.

I think it should be explained in the setup document. It is not clear for people who are not familiar with docker or cuda.

from amazon-dsstne.

Claire-Kelley avatar Claire-Kelley commented on August 28, 2024

I am having a similar problem (and hamukazu's solution doesnt work for me)

I am following the instructions in the setup file for running dsstne through docker.

In my aws ami ./deviceQuery works (it returns Detected 4 CUDA Capable device(s) and then lists them and finally says that it passed)
i can look at my device and driver with the command nvidia-smi and everything appears to be correct.

I then build the docker image and start it using:

sudo docker run -it --privileged amazon/dsstne /bin/bash

however when I try to run the sample code in the docker container all lines run up until the train step which returns the error:
GpuContext::Startup: Process 0 out of 1 initialized.
cudaGetDeviceCount failed no CUDA-capable device is detected

If I try to list nvidia devices using the command nvidia-smi it returns the errors:
Failed to initialize NVML: GPU access blocked by the operating system

Do you have any advice about how I might fix this?

from amazon-dsstne.

rgeorgej avatar rgeorgej commented on August 28, 2024

Did you install the same driver version as it is inside the docker?

from amazon-dsstne.

rgeorgej avatar rgeorgej commented on August 28, 2024

@Claire-Kelley Are you still Blocked with this

from amazon-dsstne.

Claire-Kelley avatar Claire-Kelley commented on August 28, 2024

Sorry to respond slowly! Yes I had installed the same driver version as in the docker.

I never did figure out what was causing my particular issue- I think I must have installed something improperly and not been able to uninstall it. I ended up wiping the instance and starting again with a new one (which worked as expected). Thanks for your help!

from amazon-dsstne.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.