Comments (2)
ObsNormEnv
is suitable for normal image state like that in Atari and Procgen env.
For your case, it is important to unify data range for different state vectors, which is better for getting more smooth gradient scale. You have already know the data range, so there are two basic methods you can try:
- max-min method, norm your state into 0-1 range by
(x-min)/(max-min)
- discretization method, divide the whole range into several discrete bins (for example, 0-40 can be divided into 8 bins with interval 5), and if a value is located in bins 3 (i.e. 10-15), it should be transformed into a one-hot vector
[0,0,1,0,0,0,0,0]
. This method is effective when fine-grained state information is not necessary.
from di-engine.
Okay, thank you for your advice.
from di-engine.
Related Issues (20)
- ram usage increase overtime HOT 1
- Trading deploy - issues when trying to process a single window HOT 5
- how to separate training environments and evaluation environments HOT 1
- Flask version error import HOT 3
- How can I use the algorithm I designed (such as a new multi-agent reinforcement learning algorithm) in the relevant environment (such as MPE, SMAC, etc.) provided by this platform? HOT 2
- what algorithm do you use to sovle the overcooked problem? MADDPG? HOT 3
- 代码报错:在配置好conda环境以及将该项目fork到本地后,在运行DI-engine/dizoo/petting_zoo/config/路径下的所有py文件(如ptz_simple_spread_madqn_config.py;ptz_simple_spread_mappo_config.py等)时均出现报错 HOT 3
- H-PPO算法运行失败 HOT 7
- 尝试使用自定义环境出现问题 HOT 2
- gym soccer是否有文档? 其参数设置以及action的类型该如何写 HOT 3
- record a video HOT 2
- Implementation of Mean-Field MARL algorithm HOT 3
- FQF logit computation HOT 3
- 混合动作空间环境,PPO使用gae_estimator报错 HOT 3
- 如何获取每个episode的reward值 HOT 1
- TD3应用混合动作空间报错,AssertionError HOT 1
- how to get the ckpt file? HOT 2
- get "TypeError: __init__() got an unexpected keyword argument 'agent_obs_shape'" when running " python3 -u smac_5m6m_masac_config.py" HOT 2
- question for SMAC HOT 3
- docker内运行lunarlander_dqn_deploy失败 HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from di-engine.