Comments (2)
Hi @dammo001 . Sorry to hear you're having problems.
Are you running this in a SageMaker notebook instance? If so, what type? It mentions at the top that it was tested and run on an ml.p2.xlarge, but maybe we should be more explicit that the ctx = mx.gpu()
statement in the first code block of the "Set Parameters" section requires a GPU instance. If you are running in a CPU instance can you try setting this to ctx = mx.cpu()
or testing on an ml.p2.xlarge instance?
If you're running from a GPU machine that's not a SageMaker notebook instance, perhaps this GitHub issue could help? It looks like reinstalling the CUDA driver is a possible solution.
Thanks!
from amazon-sagemaker-examples.
Closing for now. Feel free to re-open if you have additional questions.
from amazon-sagemaker-examples.
Related Issues (20)
- [Bug Report] RuntimeError: Dataset not found. You can use download=True to download it for pytorch minist horovod
- Dataset not working in example in notebook A Move Amazon SageMaker Autopilot ML models from experimentation to production using Amazon SageMaker Pipelines
- Broken lnks HOT 1
- How do you use the custom generator to train the TensorFlow model on PageMaker?
- [Example Request] Minimal Example for Fine Tuning a LLM with FSDP utilizing the HuggingFace Trainer
- [Bug Report] Forbidden(403) on Introduction to JumpStart - Sentence Pair Classification
- getting error:
- Getting "TypeError: can only join an iterable" while running "print(predictor.predict(test_data).decode("utf-8"))"
- [Bug Report] Example notebook has incorrectly formatted serving.properties
- AttributeError: module 'pandas.core.strings' has no attribute 'StringMethods'
- Inference Recommender Job fails
- [Bug Report]Error with using dgl library in Sagemaker
- Deploy this TheBloke/vicuna-13B-v1.5-GGUF model on AWS
- Parameter validation failed: Unknown parameter in PrimaryContainer HOT 2
- [Bug Report] - README - Train EleutherAI GPT-J with Model Parallel Link Broken
- smddp_deepspeec_example doesn't run because of dependency issues.
- Unable to download model artifacts due to 403 forbidden error
- Alter JupyterLab dockerfile to block target domain / IP from running contiainer
- [Bug Report] RuntimeError when running instruction fine-tuning on mistral 7b, Sagemaker Jumpstart
- Torch not compiled with CUDA enabled when deploying T5 using Triton
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from amazon-sagemaker-examples.