First working implementation of ANN which is trained using GameResults.

The PassFilter simply outputs BlackWins and WhiteWins (Range 0 - 1 but not presently clamped).
In principle, this type of feedforward ANN can be used to decide whether a PASS will result in blackwins or whitewins at any stage.
The goal is for the network to learn that passing while losing when valid moves exist is bad, but passing while winning is relatively harmless later in the game.
This commit is contained in:
2012-11-17 18:40:31 -05:00
parent d9d6ecda80
commit aca8320600
37 changed files with 1040 additions and 544 deletions

View File

@@ -0,0 +1,9 @@
Cumulative results for 3 games (BLACK=ROOT_PAR, WHITE=UCT)
1. W+26.5
2. B+1.5
3. W+16.5
Cumulative results for 3 games (BLACK=UCT, WHITE=ROOT_PAR)
1. W+4.5
2. B+25.5
3. B+2.5
Elapsed Time: 301.403 seconds.