Comments (11)
What should I do if my the newest version of cuda my computer supports is 11.2?
If your hardware supports CUDA 11.2, it should support 11.6. Just follow the pip installation (it should install CUDA and all required libraries into your virtual environment).
from gandlf.
How did you install PyTorch? If you used Conda, can you try using pip [ref]?
pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116
from gandlf.
What should I do if my the newest version of cuda my computer supports is 11.2?
from gandlf.
So that fixed that, but now I'm getting the following:
Looping over training data for penalty calculation: 0%| | 0/2426 [00:00<?, ?it/s]/cbica/projects/DBT_AI/.conda/envs/venv_gandlf_new/lib/python3.8/site-packages/torchio/data/io.py:36: UserWarning: Error loading image with SimpleITK:
Exception thrown in SimpleITK ImageFileReader_Execute: /tmp/SimpleITK-build/ITK-prefix/include/ITK-5.2/itkImportImageContainer.hxx:192:
Failed to allocate memory for image.
Trying NiBabel...
warnings.warn(message)
Looping over training data for penalty calculation: 0%| | 0/2426 [00:00<?, ?it/s]
ERROR:
from gandlf.
Hmmmm, This seems like an interesting error. Can you please let us know your ITK version and the output of
head -n 2 data.csv
for assisting you better?
from gandlf.
subjectID,channel_0,label
0,/cbica/projects/DBT_AI/Data/masks/75712684_PROC_LCC_RC_2/75712684_PROC_LCC_RC_mat.nii.gz,/cbica/projects/DBT_AI/Data/masks/75712684_PROC_LCC_RC_2/75712684_PROC_LCC_RC_mask.nii.gz
ITK verison: 3.8.0
from gandlf.
Can you mention the output of the following command:
# activate gandlf python environment
python -c "import SimpleITK as sitk;image=sitk.ReadImage('/cbica/projects/DBT_AI/Data/masks/75712684_PROC_LCC_RC_2/75712684_PROC_LCC_RC_mat.nii.gz');print(image.GetSize());mask=sitk.ReadImage('/cbica/projects/DBT_AI/Data/masks/75712684_PROC_LCC_RC_2/75712684_PROC_LCC_RC_mask.nii.gz');print(mask.GetSize())"
Also, it would be great if you can post at least the mask so that we can debug further.
from gandlf.
(1996, 2457, 73)
(1996, 2457, 73)
What do you mean by posting the mask?
from gandlf.
This is also at the top of the error file:
No NVIDIA kernel driver module found, skipping CUDA
from gandlf.
(1996, 2457, 73)
(1996, 2457, 73)
Hmm, if the piece of code I replied with is giving this output, it means that the IO is working as expected.
What do you mean by posting the mask?
I meant uploading it here for us to debug. But it doesn't matter, since the IO is working correctly (as seen from the output of the command I sent).
No NVIDIA kernel driver module found, skipping CUDA
This is not unrelated to GaNDLF, and is dependent on the host machine.
from gandlf.
I tried running the same job with 1/5 of the training data and it was able to run without an error, however, I'm getting this:
Epoch Final train loss : 1.0
Epoch Final train dice : 0.0
Epoch Final train dice_per_label : [0.0, 0.0]
Epoch Final train iou : 0.34576352043151853
Epoch Final train f1 : 0.5872102342128753
from gandlf.
Related Issues (20)
- Some code styling through Ruff for linting HOT 7
- Standardize commenting style
- Fix some code style issues reported by codacy HOT 1
- Add a script to generate information useful for debugging
- add WarmupCosineSchdule Scheduler HOT 2
- Memory build-up on various locations HOT 4
- Add DCGAN architecture
- GAN metrics
- Compute utilities for incorporating generative networks
- Add remaining files and functionalities for GANs
- Add unit testing for GANs
- Update GANDLF functionalities for compatibility with GAN pipelines
- AUROC error while running classification of pathology images HOT 5
- [FEATURE] Set `line-length` for `black` in the `project.toml` file HOT 1
- Config documentation for GANs
- Black configuration in pyproject.toml
- [FEATURE] Add the ability to split CSVs for training/validation/testing as a separate script HOT 1
- [FEATURE] Add the ability to generate training/validation/testing CSV with proportional splits HOT 1
- [FEATURE] Add tensorboard support HOT 2
- [FEATURE] speed up CI tests HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gandlf.