Photo: Storyblocks.com |
The architecture that beat humans at the notoriously CPU-impervious game Go, AlphaGo by Google Deep Mind, was converted to allow the machine to tackle other "closed-rules" games. Successively, the program was given the rules of chess, and a huge battery of Google's GPUs to train itself on the game. Within four hours, the alien emerged. And it is indeed a new class of player.
The AlphaZero neural network uses reinforcement learning to teach itself things from scratch. It does not rely on previous knowledge - which in the case of chess is surprising, as the mass of knowledge on the game accumulated in centuries of experimentation is hard to shrug off. Combined with a powerful search algorithm, the neural network is at present unbeatable. This was demonstrated in a 100-game match against the strongest chess program around, Stockfish 8.
What impressed me when I saw a few games from that match, which was concluded with 25 wins and 75 draws, no losses from Alpha zero, is that the machine can display an evolved treatment of openings, is keen to sacrifice material for positional gains, and has no prejudices. Indeed, while most chess machines around have pre-defined weights that discourage certain kinds of positions -say, putting your king in the center of the board when there's lots of pieces around potentially capable of threatening it is a no-no strategy, punished with negative weights that prevent chess engines from entertaining the thought- alpha zero knows no borders. Look at this position, e.g.:
It transpires that something has gone wrong for black - while its position is solid, it is left with a white-squared bishop that has no future, blocked as it is from its own central pawns. White, instead, has gotten rid of its own potentially similarly fated darksquared bishop, and enjoys more space. So what is the next move that white does here?
Read more...
Source: Science 2.0