Comments (2)
I've fixed this by depending on tch
version 0.15
and adding tch::maybe_init_cuda()
to the start of main()
. This seems to stop the linker from removing the libtorch_cuda
dependency, which is what causes that error message (at least in my case).
This problem could definitely be documented better; it took me a couple hours to figure this out.
from burn.
I agree that when running into issues with tch
, the actual error is never really clear.
What happens the most often is trying to use the CUDA version when the environment variable was set in another shell (not persistent), so you try to run your program and you get an error similar to the one you posted. Cargo is all sorts of confused and the resolution on tch-rs
based on the changes to the environment variable never seemed to work for me, so I end up cleaning the cache and rebuilding the package.
We tried to improve the setup but the environment variables are required by tch-rs
, so it is not as straightforward to circumvent (I tried). We could definitely add some documentation for common issues at the very least. The best we can do about the error message from the torch side is probably just try to match the generic error message and give some tips/cues.
We're open to suggestions!
from burn.
Related Issues (20)
- The trait `std::clone::Clone` is not implemented for `BenchmarkModuleRecord<B>
- [Book] Add custom dataset, loader and batcher detailed example HOT 1
- Crate zip 1.3.0 is yanked
- Alternative design for node, argument handling in burn import HOT 2
- Crate zip 1.3.0 is yanked
- Remove the Copy restriction from `impl<const N: usize, T, B> Module<B> for [T; N]`
- Batch matrix multiply leads to vulkan error on WGPU HOT 2
- Distributions from space bounds
- Backward pass has mismatched dimensions HOT 15
- burn-import: Add optional or configurable logging to generated burn graph HOT 1
- Crate zip 1.3.0 is yanked
- burn-import initializer tensor not added to scope HOT 16
- to_device not working HOT 2
- Loading record on initialized model set `Option<Module>` to None HOT 11
- The `model.load_record()` method turns off the activation function during the forward pass HOT 2
- Trouble importing FaceONNX detector model: `Only tensor indices is valid` HOT 2
- [Cube] Polishing
- "condvar wait not supported" in wasm-bindgen tests in browser with WebGPU backend HOT 1
- Crash on Chrome WebGPU for kernels that bind with aliasing.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from burn.