Comments (3)
Hi,
Running the
run.py
in both the main directory and in the MultiGPU directory leads me to have an error:
We haven't finished checking the multi-GPU version yet, so we put these files in elegantrl/AgentZoo/ElegantRL-MultiGPU
.
Once all the checks are done, we'll update it directly to the elegantrl directory.
from elegantrl.
Hi,
Running the
run.py
in both the main directory and in the MultiGPU directory leads me to have an error:We haven't finished checking the multi-GPU version yet, so we put these files in
elegantrl/AgentZoo/ElegantRL-MultiGPU
.
Once all the checks are done, we'll update it directly to the elegantrl directory.
I just wanted to make sure you saw, I referred to both the main directory and the multigpu directory. Out of the box, elegantrl/run.py does not work for me with the same out of memory above error. I tried both to see if one example would work.
I am unable to find a configuration that works. I have tried lowering the net size, batch size, rollout size, etc.
Traceback (most recent call last):
File "/usr/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
self.run()
File "/usr/lib/python3.8/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "elegantrl/run.py", line 414, in mp_train
agent.init(net_dim, state_dim, action_dim)
File "/home/kyle/trading/erl2/ElegantRL/elegantrl/agent.py", line 687, in init
self.act = ActorPPO(net_dim, state_dim, action_dim).to(self.device)
File "/home/kyle/.virtualenv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 673, in to
return self._apply(convert)
File "/home/kyle/.virtualenv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 387, in _apply
module._apply(fn)
File "/home/kyle/.virtualenv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 387, in _apply
module._apply(fn)
File "/home/kyle/.virtualenv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 409, in _apply
param_applied = fn(param)
File "/home/kyle/.virtualenv/lib/python3.8/site-packages/torch/nn/modules/module.py", line 671, in convert
return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking)
RuntimeError: CUDA error: out of memory
from elegantrl.
We have fully upgraded ElegantRL and now supports multiple GPU training (1~8 GPU).
And we have optimized the architecture of this library so that it takes up less GPU memory than before.
Now the problem you mentioned has been resolved. I'm sorry that we have been busy developing the 80 GPU version (Cloud platform) of ElegantRL, and we were unable to reply to you in time.
I will close this question in 3 days.
from elegantrl.
Related Issues (20)
- H-term implementation? HOT 2
- None of the IsaacGym related examples work HOT 1
- demo_IsaacGym.py HOT 1
- none of your example works... HOT 1
- how to start with mujoco env?
- Implementation bug in Prioritized Experience Replay HOT 1
- mutil discrete action spaces
- Conditions to stop training when target return is reached
- run.py碰到一些问题 HOT 1
- MADDPG init issues HOT 1
- Isaac Gym Preview4 examples?
- Requirements completely inconsistent HOT 1
- How to get the value of account_value_erl
- How to get the value of account_value_erl HOT 1
- tutorial
- maybe a small bug in the function `explore_vec_env` of discretePPO and discreteA2C?
- Continue Training From Checkpoint
- where is train_and_evaluate function? HOT 1
- train_ppo_a2c_for_lunar_lander_continuous的ppo算法,好像不能完全复现曲线变化情况
- 已经下载好'./China_A_shares.pandas.dataframe',无法加载'./China_A_shares.pandas.dataframe',出现报错UnpicklingError
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elegantrl.