|
个人信息Personal Information
教授 博士生导师 硕士生导师
任职 : Intelligence & Robotics 副主编、山东省自动化学会常务理事、威海市机电与自动化学会副理事长
性别:男
毕业院校:山东大学
学历:研究生(博士)毕业
学位:工学博士学位
在职信息:在职
所在单位:低空科学与工程学院
入职时间:2001-07-01
学科:控制理论与控制工程
办公地点:知行楼北楼605B
电子邮箱:
Goal-Conditioned Reinforcement Learning With Adaptive Intrinsic Curiosity and Universal Value Network Fitting for Robotic Manipulation
点击次数:
所属单位:低空科学与工程学院
论文名称:Goal-Conditioned Reinforcement Learning With Adaptive Intrinsic Curiosity and Universal Value Network Fitting for Robotic Manipulation
发表刊物:IEEE Transactions on Industrial Informatics
关键字:Adaptive intrinsic curiosity (AIC), hindsight experience replay (HER), robotic manipulation, universal value network fitting (UVNF)
摘要:Hindsight experience replay (HER) has greatly
increased the possibility of using deep reinforcement learning
(DRL) for robotic manipulation with sparse rewards.
However, there are still concerns about low learning efficiency
and poor performance due to its insufficient exploration
ability and bias against the initial goal introduced
by HER. In this article, to solve this problem, a multigoal
robotic manipulation DRL method based on adaptive intrinsic
curiosity and universal value network fitting (AICUVNF)
is proposed to further improve the exploration ability
and learning performance. Specifically, this method utilizes
an improved curiosity mechanism to construct a joint intrinsic
reward and adaptively adjust the proportion, which
can enhance exploration ability and avoid excessive pursuit
of novel states. In addition, a universal value network fitting
approach is proposed to incorporate the initial goal into
the value function fitting process, which employs the value
of the initial goal to eliminate the bias of HER in the algorithm
update. Combined with the off-policy soft actor-critic
method, AIC-UVNF is verified on multigoal robotic manipulation
tasks. The results show that the proposed method
achieves better convergence efficiency and learning performance.
第一作者:Zihao Sun
通讯作者:Xianfeng Yuan,Yong Song*
全部作者:Qiangyang Xu,Bao Pang,Rui Song,Yibin Li
论文类型:应用研究
论文编号:1747462659626520577
学科门类:工学
卷号:21
期号:3
页面范围:1-15
字数:10
是否译文:否
发表时间:2025-03
收录刊物:SCI、SCI
发布时间:2024-12-27
