Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN Outspeeds Data Augmentation by Smaller Sample - Application in Gomoku Reinforcement Learning
Suen, C-H. & Alonso, E. ORCID: 0000-0002-3306-695X (2023). Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN Outspeeds Data Augmentation by Smaller Sample - Application in Gomoku Reinforcement Learning. In: Proceedings of the AISB Convention 2023. AISB Convention 2023, 13-14 April 2023, Swansea, UK.
Abstract
To replace data augmentation, this paper proposed a method called SLAP to intensify experience to speed up machine learning and reduce the sample size. SLAP is a model-independent protocol/function to produce the same output given different transformation variants. SLAP improved the convergence speed of convolutional neural network learning by 83% in the experiments with Gomoku game states, with only one eighth of the sample size compared with data augmentation. In reinforcement learning for Gomoku, using AlphaGo Zero/AlphaZero algorithm with data augmentation as baseline, SLAP reduced the number of training samples by a factor of 8 and achieved similar winning rate against the same evaluator, but it was not yet evident that it could speed up reinforcement learning. The benefits should at least apply to domains that are invariant to symmetry or certain transformations. As future work, SLAP may aid more explainable learning and transfer learning for domains that are not invariant to symmetry, as a small step towards artificial general intelligence.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | G Geography. Anthropology. Recreation > GV Recreation Leisure Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Departments: | School of Science & Technology > Computer Science |
Download (416kB) | Preview
Export
Downloads
Downloads per month over past year