Comments (6)
What you propose is a good solution and the way I would do it by default.
I would rather them not be aware of the other players score
What do you mean by this exactly? If you want the value prediction from the network to be made independently from the other player score, you can just remove this information in the GI.vectorize_state
function.
Another question: are future available actions dependent on both players' score? If not, you can remove both scores from the state and just use rewards to model score. If this info is relevant, there is no choice but to include it in the state.
from alphazero.jl.
Hm ok. Let me think on that.
Another question: are future available actions dependent on both players' score? If not, you can remove both scores from the state and just use rewards to model score. If this info is relevant, there is no choice but to include it in the state.
As for this kinda?
Each can make one of two moves lets say: Jump, wait, land
So each should only be able to make the moves Jump and wait, but then if they've jumped they should only be able to wait or land.
I also need to retain what the value of the map was when they jumped in order to calculate their score when they landed.
So how best would you handle the actions and the reward portion of it? Would you perhaps go through the previous moves and use that to determine if they had already jumped as well as use that to add up the values needed to determine which performed better?
from alphazero.jl.
By definition of a state, which future scenarios are possible should only depend on the current state of your game and never on the history of previous actions. (Technically, this is called the Markovian property of states.)
Therefore, in your case, the state should contain some info about whether or not the agent is currently "in the air" and waiting to land.
from alphazero.jl.
Hm ok.
Can you elaborate on a few things for me?
For Vectorize state
-Can you explain how this is used by alphazero?
-What would you do if you needed to vectorize a large amount of data?
from alphazero.jl.
vectorize_state
turns a state into a format that can be fed into a neural network (typically a tensor of floating point numbers).
What would you do if you needed to vectorize a large amount of data?
I do not understand your question. If your states are large and yield big tensors, then it is what it is. Either you can find more economical representations or you have to live with your system being memory-hungry and maybe stressing your hardware.
from alphazero.jl.
I am closing this issue for lack of activity but do not hesitate to reopen it if you need more help.
from alphazero.jl.
Related Issues (20)
- Training Stuck at 'Network Only against MinMax (depth 6)' after Modifying TicTacToe Board Size to 4x4 HOT 4
- [Feedback Requested] Wishlist for the next major release of AlphaZero.jl HOT 4
- Maximum number of iterations HOT 1
- Success stories HOT 2
- How to prevent GPU OOM? HOT 9
- What next after training an agent? HOT 2
- How to play a one-player game automatically and get the move sequence? HOT 1
- I can't install it to run training HOT 1
- LoadError: UndefVarError: #flatten not defined
- Any way to start when playing against AlphaZero? HOT 2
- AZ much worse than generic solution for simple game HOT 2
- When should I stop learning? HOT 2
- How many iterations are required?
- Julia 1.9?
- How important is loss during learning
- How to restart a training session that has completed HOT 3
- Not invoking alternate implementation of select_move HOT 1
- Success report and request for help HOT 3
- got Out of GPU memory when learning HOT 6
- Non-trivial games HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alphazero.jl.