Hi, can MAPPO be used for continuous action space? How can I do this?When I change dis

I have the same problem, has anyone solved it? <p dir="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

You can leave yours. I'll text you when I'm free <p dir

continuous action space about on-policy HOT 11 CLOSED

marlbenchmark commented on July 26, 2024

continuous action space

from on-policy.

Comments (11)

wanghui589 commented on July 26, 2024

I have the same problem, has anyone solved it?

from on-policy.

sinizu commented on July 26, 2024

I have the same problem, has anyone solved it?

on-policy/onpolicy/algorithms/utils/distributions.py

Line 82 in 9677c91

def forward(self, x):

it should be modified.

from on-policy.

Shenyiou commented on July 26, 2024

but what to change? the codes below only need x

from on-policy.

sinizu commented on July 26, 2024

but what to change? the codes below only need x
maybe:

def forward(self, x, a=None):

from on-policy.

Shenyiou commented on July 26, 2024

but what to change? the codes below only need x
maybe:
python '''
def forward(self, x, a=None):
'''

sorry, it doesn't work

Traceback (most recent call last):
File "/home/qss/syo/onpolicy/scripts/train/train_mpe.py", line 169, in
main(sys.argv[1:])
File "/home/qss/syo/onpolicy/scripts/train/train_mpe.py", line 154, in main
runner.run()
File "/home/qss/syo/onpolicy/runner/shared/mpe_runner.py", line 28, in run
values, actions, action_log_probs, rnn_states, rnn_states_critic, actions_env = self.collect(step)
File "/home/qss/.conda/envs/syo/lib/python3.6/site-packages/torch/autograd/grad_mode.py", line 15, in decorate_context
return func(*args, **kwargs)
File "/home/qss/syo/onpolicy/runner/shared/mpe_runner.py", line 121, in collect
raise NotImplementedError
NotImplementedError

from on-policy.

sinizu commented on July 26, 2024

on-policy/onpolicy/runner/shared/mpe_runner.py

Line 121 in 9677c91

raise NotImplementedError

The code above also needs to be changed

# raise NotImplementedError
actions_env = actions

from on-policy.

Shenyiou commented on July 26, 2024

@sinizu would you mind give me your emal please. there are still some problems i want to talk with you.

from on-policy.

chillybird commented on July 26, 2024

You can develop it just like discrete action implement. But i can not get satisfied result when i use the modified code.

from on-policy.

sinizu commented on July 26, 2024

@sinizu would you mind give me your emal please. there are still some problems i want to talk with you.

You can leave yours. I'll text you when I'm free

from on-policy.

Shenyiou commented on July 26, 2024

You can leave yours. I'll text you when I'm free

thank a lot! I've successfully changed it to continuous action space. however the result is slightly inferior compared with discrete version.

from on-policy.

kargarisaac commented on July 26, 2024

@Shenyiou
would you mind share your code or tell what you have changed? That would help a lot

from on-policy.

Recommend Projects

continuous action space about on-policy HOT 11 CLOSED

Comments (11)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent