Comments (11)
I found in previous commits that output used to be, but later due to log_likelihood_cost refactor got deleted.
output = tf.nn.softmax(tf.reshape(h_conv_final, [-1, go.N ** 2]) + b_conv_final)
from mugo.
Hm. Sorry about that - work on this repo is continuing at https://github.com/tensorflow/minigo. I'll update the README.md
from mugo.
from mugo.
@CezCz yeah, thanks for your kind answer,
actually, I fixed the line 88 with
" log_likelihood_cost = tf.reduce_sum(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y))" and it could work,
but I could not understand the output of mcts, why it often choose the bigger value even if it is negative ?
I am confused about the result , I would appreciate if you can tell me the reason @CezCz
from mugo.
from mugo.
just like the first picture , the location of white is R4, and I get the value is -7.5, just as the second image,
the another position is Q3,
and its value is 8.5,
and why do the white chose the R4 rather than Q3, the latter value is greater than the former value , I am just very confused about this, perhaps I do not understand the code ,or maybe this is a silly question , but I want strongly to know the reason and I am very grateful to you @CezCz , you are a very kind person and thank you very much !
from mugo.
from mugo.
Hi , @CezCz
So the next move is chosen just because the algorithm chooses the most visited move , and the value network backpropagated the visit count and the winner predicted, the positive value represents the current player wins this game , the next move is selected is not related to the value of value network, just related to the visit count , right ?
from mugo.
@JoeyQWu
The move that is chosen to be played in the actual game yes. Not to confuse with move chosen within selection phase - this one is chosen based on some sophisticated heuristic with exploration taken into consideration.
You may want to read:
https://jeffbradberry.com/posts/2015/09/intro-to-monte-carlo-tree-search/ - nice mcts introduction with examples
http://www.baeldung.com/java-monte-carlo-tree-search - simple monte carlo tree search implementation
https://deepmind.com/documents/119/agz_unformatted_nature.pdf - page 25-27 MCTS implementation within alphago zero (don't be confused about temperature parameter and parent visit count, these are just another parameters to promote exploration during training, but the core is visit count)
from mugo.
@CezCz
okay, I will read more to understand , thank you very much, you are so nice , very grateful to you for your help!
from mugo.
I also wrote http://www.moderndescartes.com/essays/deep_dive_mcts/ recently
from mugo.
Related Issues (20)
- After a long period of training, what will the training data accuracy incline? HOT 2
- About how to play MuGo HOT 8
- Group ID overflowed HOT 2
- error message "Key RL_glocal_step not found in checkpoint" HOT 2
- tensorflow.python.framework.errors_impl.NotFoundError: Key RL_global_step not found in checkpoint HOT 2
- SelfPlay Illegal Move HOT 4
- Error when start training HOT 8
- Error when prepossessing Chinese SGF HOT 1
- same game again and again HOT 1
- I want to let it train in itself.
- Error while prerunning it HOT 1
- IllegalMove HOT 1
- RuntimeError: dictionary changed size during iteration HOT 3
- NameError: name 'read_file' is not defined HOT 1
- About how to fix argument problem. HOT 1
- AttributeError: 'PolicyNetwork' object has no attribute 'output' HOT 3
- could you help me to load MuGo in gtp gui like Sabaki? HOT 1
- max rollout depth exceeded! import copy rasied it ?
- SelfPlay Illegal Move
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mugo.