文献详情 - Gdtheory理论粤军网|广东智库信息化平台

文献详细_{Journal detailed}

基于多步回溯Q学习的自动发电控制指令动态优化分配算法
Multi-step backtrack Q-learning based dynamic optimal algorithm for auto generation control order dispatch

下载全文在线阅读

收藏

作　　者： ; ; ; ; ;

出　　处： 《控制理论与应用》 2011年第1期58-64,共7页

摘　　要： 单步Q学习在火电占优、机组时延较大的自动发电控制(AGC)功率指令动态优化分配中的应用表现出收敛速度慢等不足而影响最优策略的获取.具有多步预见能力的多步回溯Q学习(Q(λ))显式利用资格迹进行高效回溯操作,能够有效解决火电机组大时滞环节带来的延时回报问题,算法平均收敛时间较Q学习缩短50%以上.算法奖励函数引入调节费用一项,形成多目标动态最优控制.两区域模型及南方电网模型仿真研究分析显示,Q(λ)算法在随机、大负荷扰动的复杂系统环境中有效提高系统控制性能标准(CPS)控制品质和适应性,并且在保证CPS合格率的前提下,使AGC调节费用下降超过5%. This paper presents the application of multi-step backtrack Q （λ） learning-based methodology on CPS order dynamic dispatch problem. The proposed Q（λ） learning can effectively solve the long time-delay assessment for the action strategy of one step Q-learning in the thermal dominated power system. AGC production cost is formulated as Markov decision process（MDP） reward function by means of linear weighted aggregative approach in the CPS order multi- objective dynamic optimal dispatch. Simulation of institute of electrical and electronics engineers（IEEE） two-area LFC model shows that the convergence time of the Q（λ） algorithm is reduced by more than 50% comparing with Q-learning. The statistical experiments of Q（λ） in the China Southern Power grid show that the proposed method can effectively enhance the robustness and dynamic performance of AGC systems in CPS assessment and save more than 5% of AGC oroduction cost while the CPS compliances are ensured.

关键词： 学习自动发电控制控制性能标准随机最优调节费用

领　　域： [电气工程]

基于多步回溯Q学习的自动发电控制指令动态优化分配算法
Multi-step backtrack Q-learning based dynamic optimal algorithm for auto generation control order dispatch

参考文献更多+

二级参考文献更多+

引证文献更多+

二级引证文献更多+

同被引文献更多+

耦合作品文献更多+

相关文献更多+

相关作者

相关机构对象

相关领域作者

基于多步回溯Q学习的自动发电控制指令动态优化分配算法 Multi-step backtrack Q-learning based dynamic optimal algorithm for auto generation control order dispatch

参考文献 更多+

二级参考文献 更多+

引证文献 更多+

二级引证文献 更多+

同被引文献 更多+

耦合作品文献 更多+

相关文献 更多+

相关作者

相关机构对象

相关领域作者

基于多步回溯Q学习的自动发电控制指令动态优化分配算法
Multi-step backtrack Q-learning based dynamic optimal algorithm for auto generation control order dispatch

参考文献更多+

二级参考文献更多+

引证文献更多+

二级引证文献更多+

同被引文献更多+

耦合作品文献更多+

相关文献更多+