Giter Club home page Giter Club logo

Comments (5)

akashvelu avatar akashvelu commented on July 26, 2024 1

Thanks for the questions! We do evaluate QMIX with a concatenation of the default env global state and all agents' local observations. We do this to study if this helps the performance of QMIX, and to create a fair comparison with MAPPO (AS) which concatenates the global env state with the local agent observation and does not prune features (and hence is a fair comparison with QMIX(MG)). This is seen in Figure 14 in the appendix of our paper. We notice that QMIX (MG) still underperforms MAPPO(AS) on several maps, and is comparable to MAPPO(AS) on the others.

It is important to note that MAPPO's performance with the FP and AS states are very similar; FP significantly outperforms on only 1 map (3s5zvs3s6z) - this is seen in Table 1. Our primary purpose with conducting FP experiments was that reducing the critic's input dimensionality by dropping redundant features can further improve MAPPO's performance - however, this improvement is marginal. You're right that we could have evaluated QMIX by pruning redundant features, but we felt that because we already compared QMIX(MG) with MAPPO(AS), these experiments wouldn't yield many new conclusions.

from on-policy.

Henry668 avatar Henry668 commented on July 26, 2024

And could you please provide the code of QMix(MG)? Thanks!

from on-policy.

Henry668 avatar Henry668 commented on July 26, 2024

In my opinion, the pruning operation itself is a form of prior human knowledge. However, only MAPPO can take advantage of this knowledge under the CTDE paradigm, because the value function in QMix also serves as the policy in execution, and it will raise the inconsistent issue if you use the Feature-Pruned Agent-Specific Global States only in training and use the local observations in execution.
Is that right?

from on-policy.

Henry668 avatar Henry668 commented on July 26, 2024

Thanks for the reply! Another question: Does share_obs in line 34 in smac_runner.py represents FP states?

from on-policy.

Henry668 avatar Henry668 commented on July 26, 2024

I see. Once again thanks for your reply!

from on-policy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.