Algorithm of TD(λ) Based on Factored Representation

doi:10.3969/j.issn.1000-3428.2009.13.066

Computer Engineering ›› 2009, Vol. 35 ›› Issue (13): 190-192,. doi: 10.3969/j.issn.1000-3428.2009.13.066

• Artificial Intelligence and Recognition Technology • Previous Articles Next Articles

Algorithm of TD(λ) Based on Factored Representation

DAI Shuai, YIN Chang-ming, ZHANG Xin

(School of Computer & Communication Engineering, Changsha University of Science & Technology, Changsha 410076)

Received:1900-01-01 Revised:1900-01-01 Online:2009-07-05 Published:2009-07-05

基于因素化表示的TD(λ)算法

戴帅，殷苌茗，张欣

(长沙理工大学计算机与通信工程学院，长沙 410076)

Abstract

Abstract: This paper proposes a new algorithm of TD(λ) based on factored representation. The main principle of the algorithm is that states are factored representation, and makes use of Dynamic Bayesian Networks(DBNs) to represent the conditional probability distributions in Markov Decision Processes(MDPs), together with decision-trees representation of value function in the algorithm of TD(λ) to lower the state space exploration and computation complexity. Therefore the algorithm is a promise for solving large-scale MDPs problems which are of a huge state space. Experiments demonstrates the validity of this representation method.

Key words: factored representation, Dynamic Bayesian Networks(DBNs), decision tree, algorithm of TD(λ)

摘要： 提出一种新的基于因素法方法的TD(λ)算法。其基本思想是状态因素化表示，通过动态贝叶斯网络表示Markov决策过程(MDP)中的状态转移概率函数，结合决策树表示TD(λ)算法中的状态值函数，降低状态空间的搜索与计算复杂度，因而适用于求解大状态空间的MDPs问题，实验证明该表示方法是有效的。

关键词: 因素化表示, 动态贝叶斯网络, 决策树, TD(λ)算法

CLC Number:

TP18

DAI Shuai; YIN Chang-ming; ZHANG Xin. Algorithm of TD(λ) Based on Factored Representation[J]. Computer Engineering, 2009, 35(13): 190-192,.

戴帅;殷苌茗;张欣. 基于因素化表示的TD(λ)算法[J]. 计算机工程, 2009, 35(13): 190-192,.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.13.066

http://www.ecice06.com/EN/Y2009/V35/I13/190

[1]	WANG Bo, ZHANG Yuan, YANG Yongbei. Study on Adaptive Bitrate Algorithm in Decision Tree Based on Imitation Learning [J]. Computer Engineering, 2023, 49(5): 206-214.
[2]	GAN Hongnan, ZHANG Kai. Approximate Nearest Neighbor Search Based on Neighbor Graphs with Parameter Adaptation [J]. Computer Engineering, 2022, 48(9): 28-36.
[3]	RAN Yi, WANG Runnian, PAN Hongwei, YU Haimeng, YUAN Peisen. Factorization Machine Model for Power Outage Classification Prediction [J]. Computer Engineering, 2022, 48(5): 98-103,111.
[4]	LI Li, REN Zhenkang, SHI Kexin. Cost Sensitive Boosting Software Defect Prediction Method [J]. Computer Engineering, 2022, 48(3): 175-180.
[5]	CHANG Shuo, ZHANG Yanchun. Improved Random Forest Algorithm Based on Out-of-Bag Prediction and Extended Space [J]. Computer Engineering, 2022, 48(3): 1-9.
[6]	JI Wentao, LI Yuanyuan, QIN Baodong. Working Mode Recognition for SM4 Block Cipher Based on Decision Tree [J]. Computer Engineering, 2021, 47(8): 157-161,169.
[7]	HE Famei, MA Huizhen, WANG Xuren, FENG Anran. Research on Anomaly Intrusion Detection System Based on Feature Grouping Clustering [J]. Computer Engineering, 2020, 46(4): 123-128,134.
[8]	ZHANG Yueping, LI Ru, WANG Yuanlong, CHAI Qinghua, WU Yujuan, GUAN Yong. Research on Null Instantiation Recognition and Filling Method in Chinese Discourses [J]. Computer Engineering, 2020, 46(3): 79-86.
[9]	LI Yang, CHEN Zibin, XIE Guangqiang. A Differential Privacy Protection Algorithm Based on ExtraTrees [J]. Computer Engineering, 2020, 46(2): 134-140.
[10]	ZHANG Chuanwei, ZENG Hongjun, YANG Mengyue, LI Bo, CHEN Shangrui. Multi-Scale Pedestrian Detection Based on Multi-Resolution Filter Channels [J]. Computer Engineering, 2020, 46(2): 235-241.
[11]	LI Yuanhang, CHEN Xianlai, LIU Li, AN Ying, LI Zhongmin. Random Forest Algorithm for Differential Privacy Protection [J]. Computer Engineering, 2020, 46(1): 93-101.
[12]	GAO Ninghua, WANG Heng, FENG Xinghua. Classification Method of Electrocardiogram Signals Based on Dynamic Fuzzy Decision Tree [J]. Computer Engineering, 2020, 46(1): 80-86.
[13]	YANG Chen, LIANG Yiwen, TAN Chengyu, ZHOU Wen. Optimized Dendritic Cell Algorithm Combined with XGBoost [J]. Computer Engineering, 2019, 45(9): 194-197,203.
[14]	ZHANG Bo, ZHOU Conghua, ZHANG Fuquan, ZHANG Ting, JIANG Yueming. A Fuzzy Clustering Algorithm for SNP Selection [J]. Computer Engineering, 2019, 45(8): 66-74.
[15]	ZHANG Xu, ZHOU Xinzhi, ZHAO Chengping, SHAO Lun. Unbalanced Data Classification Based on Hesitant Fuzzy Decision Tree [J]. Computer Engineering, 2019, 45(8): 75-79,91.

Please choose a citation manager

Content to export

Algorithm of TD(λ) Based on Factored Representation

基于因素化表示的TD(λ)算法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Algorithm of TD(λ) Based on Factored Representation

基于因素化表示的TD(λ)算法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments