Comments (5)
Thanks for the questions! We do evaluate QMIX with a concatenation of the default env global state and all agents' local observations. We do this to study if this helps the performance of QMIX, and to create a fair comparison with MAPPO (AS) which concatenates the global env state with the local agent observation and does not prune features (and hence is a fair comparison with QMIX(MG)). This is seen in Figure 14 in the appendix of our paper. We notice that QMIX (MG) still underperforms MAPPO(AS) on several maps, and is comparable to MAPPO(AS) on the others.
It is important to note that MAPPO's performance with the FP and AS states are very similar; FP significantly outperforms on only 1 map (3s5zvs3s6z) - this is seen in Table 1. Our primary purpose with conducting FP experiments was that reducing the critic's input dimensionality by dropping redundant features can further improve MAPPO's performance - however, this improvement is marginal. You're right that we could have evaluated QMIX by pruning redundant features, but we felt that because we already compared QMIX(MG) with MAPPO(AS), these experiments wouldn't yield many new conclusions.
from on-policy.
And could you please provide the code of QMix(MG)? Thanks!
from on-policy.
In my opinion, the pruning operation itself is a form of prior human knowledge. However, only MAPPO can take advantage of this knowledge under the CTDE paradigm, because the value function in QMix also serves as the policy in execution, and it will raise the inconsistent issue if you use the Feature-Pruned Agent-Specific Global States only in training and use the local observations in execution.
Is that right?
from on-policy.
Thanks for the reply! Another question: Does share_obs
in line 34 in smac_runner.py
represents FP states?
from on-policy.
I see. Once again thanks for your reply!
from on-policy.
Related Issues (20)
- Shape of buffered log_probs
- Action mask! HOT 1
- 无法解析导入“wandb” HOT 1
- Why share_obs in shared/mpe_runner.py just repeats obs when self.use_centralized_V set to true HOT 1
- Does the state of the grid world need to be normalized? HOT 1
- shared_buffer.py is an outdated file
- 求助FP和AS状态!! HOT 1
- Cannot reproduce MPE simple_speaker_listener HOT 2
- Can't the mat algorithm be called yet? HOT 1
- MAPPO simple spread not converging HOT 2
- Runtime Error: Tensors on Different Devices (cuda:0 and cpu) HOT 1
- HAPPO为何与mappo的policy文件完全相同
- 模型性能问题 HOT 3
- 请问我该如何保存replay?
- question about reply buffer size in MAPPO
- missing "onpolicy.runner.separated.hanabi_runner_forward" HOT 1
- NAN HOT 2
- When I tried to train the code for smacv2, I encountered this error message——AssertionError: check recurrent policy! HOT 1
- Error when run ./train_mpe_spread.sh HOT 2
- Model... HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from on-policy.