Comments (6)
Hi @cynthiatliu, sorry but I don't totally understand your setup. Do you mean that the action is either a discrete choice (1 out of N), or a two-dimensional continuous number?
You will probably need to implement a new action type, similar to how they are defined right now:
- https://github.com/rllab/rllab/blob/master/rllab/spaces/discrete.py
- https://github.com/rllab/rllab/blob/master/rllab/spaces/box.py
- https://github.com/rllab/rllab/blob/master/rllab/spaces/product.py
Your scenario is more like a union type rather than a product type. Since you need to be able to convert to/from a vector representation (see flatten
and unflatten
), one possible scheme is that you use the first entry in the vector to denote whether the action is a discrete choice or a 2D vector. The subsequent entries encode the actual choice.
from rllab.
Yes, that's what I mean. And thanks! I'll work on it.
from rllab.
Hi,
For this action space, how should the method "new_tensor_variables" be overwritten? Thanks,
from rllab.
Depending on your representation. If it's either a discrete or a 1D vector, you can use the same code as the Box
type: https://github.com/rllab/rllab/blob/master/rllab/spaces/box.py#L71
from rllab.
It's either a discrete or a 2D vector.
Additionally, how should the actor distinguish between the types of actions? Part of the actor's action is to choose what type of action, as you know--does that create an extra dimension?
from rllab.
Solved the problems
from rllab.
Related Issues (20)
- gym.wrappers.monitoring import error HOT 1
- Problem running rllab MazeAntEnv HOT 2
- ImportError: cannot import name 'MemmapingPool' HOT 8
- How to record videos in SwimmerGatherEnv
- Error Using Custom Env + GaussianGRU + VPG
- Docker intended running environment HOT 2
- Gaussian Policy - no inputs
- can not find files vendor/mujoco/ HOT 4
- Dockerfiles unnecessarily large
- AttributeError: 'NoneType' object has no attribute 'put' HOT 1
- Difference between std_hidden_nonlinearity and hidden_nonlinearity?
- gradient descent to optimize the TRPO or PPO algorithm?
- No module named 'cached_property' HOT 1
- How to improve the GPU-Util when running RL program with RLLab. HOT 2
- setup_linux.sh always exits before creating environment
- Error while instantiating <class 'rllab.envs.gym_env.GymEnv'> HOT 1
- [Installation Issue]: ResolvePackageNotFound HOT 2
- How to test trained model??
- ResolvePackageNotFound:
- Stuck while training at 977 itr
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rllab.