Abstract:
This paper introduces some perceptron algorithms and their variations, gives various pseudo-codes, pionts out advantage among algorithms. It gives mistake bound’s theorems of perceptrons algorithm in linearly separable and unlinearly separable situation. It studies their mistake bounds and works out their bounds. It shows their various applications in the online optimization, reinforcement learning and bandit algorithm, and discusses the open problems.
Key words:
perceptron,
mistake bound,
bandit algorithm,
reinforcement learning
摘要: 介绍感知器学习算法及其变种,给出各种感知器算法的伪代码,指出各种算法的优点。给出感知器算法在线性可分和线性不可分情况下的误差界定理,讨论各种感知器学习算法的误差界理论,给出各种算法的误差界。介绍感知器学习算法在在线优化场景、强化学习场景和赌博机算法中的应用,并对未解决的问题进行讨论。
关键词:
感知器,
错误界,
赌博机算法,
强化学习
CLC Number:
LIU Jian-wei; SHEN Fang-lin; LUO Xiong-lin. Reserch on Perceptron Learning Algorithm[J]. Computer Engineering, 2010, 36(7): 190-192.
刘建伟;申芳林;罗雄麟. 感知器学习算法研究[J]. 计算机工程, 2010, 36(7): 190-192.