Tag AlphaGo

Posts: 3

AlphaGO Zero learns on its own

The news dropped that Google's new implementation of AlphaGo, called AlphaGO Zero, was able to learn completely on its own. No training set was first used, rather it built it's own training set as it played against the older AlphaGO.

Earlier versions of AlphaGo were taught to play the game using two methods. In the first, called supervised learning, researchers fed the program 100,000 top amateur Go games and taught it to imitate what it saw. In the second, called reinforcement learning, they had the program play itself and learn from the results.

AlphaGo Zero skipped the first step. The program began as a blank slate, knowing only the rules of Go, and played games against itself. At first, it placed stones randomly on the board. Over time it got better at evaluating board positions and identifying advantageous moves. It also learned many of the canonical elements of Go strategy and discovered new strategies all its own. via Quantamagazine

Imagine if you took this deep learning technology and used it on the Quantum Computer Google is developing? Amazing times we are living in.

comments

Where Computers Defeat Us and Where They Don't

After AlphaGo won 4 to 1 against Lee Sedol, the world is a buzz with excitement and fear of AI. Although we're still a long way from the "rise of the machines," but we are getting closer.

AlphaGo does use simulations and traditional search algorithms to help it decide on some moves, but its real breakthrough is its ability to overcome Polanyi’s Paradox. It did this by figuring out winning strategies for itself, both by example and from experience. The examples came from huge libraries of Go matches between top players amassed over the game’s 2,500-year history. To understand the strategies that led to victory in these games, the system made use of an approach known as deep learning, which has demonstrated remarkable abilities to tease out patterns and understand what’s important in large pools of information. via NY Times

Polanyi's Paradox states that we humans know more than we can tell, which is really our tacit knowledge. For the longest time AI couldn't do that, couldn't know more than what it can tell. It just relied on hard rules and brute force computations. AlphaGo changed all that.

For the first time ever, AI is breaking through the paradox, which is cool and scary at the same time. Once AI can figure out how to reprogram itself or make better machines than we can design, we'll be in trouble.

comments

AlphaGo vs Lee Sedol

Lots of great videos on YouTube of the match between AlphaGo and Lee Sedol. I consider these historic because this level of AI is almost humanlike in its thinking of playing the game. Wikipedia has a great summary of the first 4 games. It's AlphaGo 3 wins, Lee 1 win.

Game 1

Game 2

Game 3

Game 4

Game 5

comments