Speaker
Ouraman Hajizadeh
Description
Exploratory study of training a Gomoku-Agent (generalization of tic tac
toe) using pure Deep Reinforcement Learning. Different training
approaches and neural network architectures are studied. The performance
of the resulting agents is compared to tree search based competitors of
the Gomocup.