Comments (11)
I have the same problem, has anyone solved it?
from on-policy.
I have the same problem, has anyone solved it?
it should be modified.
from on-policy.
but what to change? the codes below only need x
from on-policy.
but what to change? the codes below only need x
maybe:
def forward(self, x, a=None):
from on-policy.
but what to change? the codes below only need x
maybe:
python '''
def forward(self, x, a=None):
'''
sorry, it doesn't work
Traceback (most recent call last):
File "/home/qss/syo/onpolicy/scripts/train/train_mpe.py", line 169, in
main(sys.argv[1:])
File "/home/qss/syo/onpolicy/scripts/train/train_mpe.py", line 154, in main
runner.run()
File "/home/qss/syo/onpolicy/runner/shared/mpe_runner.py", line 28, in run
values, actions, action_log_probs, rnn_states, rnn_states_critic, actions_env = self.collect(step)
File "/home/qss/.conda/envs/syo/lib/python3.6/site-packages/torch/autograd/grad_mode.py", line 15, in decorate_context
return func(*args, **kwargs)
File "/home/qss/syo/onpolicy/runner/shared/mpe_runner.py", line 121, in collect
raise NotImplementedError
NotImplementedError
from on-policy.
The code above also needs to be changed
# raise NotImplementedError
actions_env = actions
from on-policy.
@sinizu would you mind give me your emal please. there are still some problems i want to talk with you.
from on-policy.
You can develop it just like discrete action implement. But i can not get satisfied result when i use the modified code.
from on-policy.
@sinizu would you mind give me your emal please. there are still some problems i want to talk with you.
You can leave yours. I'll text you when I'm free
from on-policy.
You can leave yours. I'll text you when I'm free
thank a lot! I've successfully changed it to continuous action space. however the result is slightly inferior compared with discrete version.
from on-policy.
@Shenyiou
would you mind share your code or tell what you have changed? That would help a lot
from on-policy.
Related Issues (20)
- Shape of buffered log_probs
- Action mask! HOT 1
- 无法解析导入“wandb” HOT 1
- Why share_obs in shared/mpe_runner.py just repeats obs when self.use_centralized_V set to true HOT 1
- Does the state of the grid world need to be normalized? HOT 1
- shared_buffer.py is an outdated file
- Questions on the episode length of 1000 on gfootball env instead of a maximum env limit of 400 HOT 1
- render_mpe produces much worse performance than training or evaluation HOT 2
- Doubts about the results of rmappo/mappo in SMAC HOT 1
- __init__() got multiple values for argument 'device' HOT 2
- Runtime Error: Tensors on Different Devices (cuda:0 and cpu) HOT 1
- HAPPO为何与mappo的policy文件完全相同
- 模型性能问题 HOT 3
- 请问我该如何保存replay?
- question about reply buffer size in MAPPO
- missing "onpolicy.runner.separated.hanabi_runner_forward" HOT 1
- NAN HOT 2
- When I tried to train the code for smacv2, I encountered this error message——AssertionError: check recurrent policy! HOT 1
- Error when run ./train_mpe_spread.sh HOT 2
- Model... HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from on-policy.