个人信息
参与实验室科研项目
人机混合智能系统双层智能测试评估技术研究
复杂环境下非完全信息博弈决策的智能基础模型研究
研究课题
基于指导水平的人在环深度强化学习方法研究
学术成果
共撰写/参与撰写专利 1 项,录用/发表论文 1 篇,投出待录用论文0篇。 联培学生可能有其他不在此展示的论文/专利。
patent
-
眼科病人信息录入软件V1.0
康宇,
田霞,
董凯,
鲁理,
夏睿钰,
赵云波,
刘斌琨,
and 李晓蒙
2023
[pdf]
Conference Articles
-
Uncertainty-Based Dynamic Weighted Experience Replay for Human-in-the-Loop Deep Reinforcement Learning
Xia Tian,
Yu Kang,
Yun-Bo Zhao ,
Ya-Qing Zhou,
and Peng-Fei Li
In
2024
[Abs]
[pdf]
Human-in-the-loop reinforcement learning (HIRL) enhances sampling efficiency in deep reinforcement learning by incorporating human expertise and experience into the training process. However, HIRL methods still heavily depend on expert guidance, which is a key factor limiting their further development and largescale application. In this paper, an uncertainty-based dynamic weighted experience replay approach (UDWER) is proposed to solve the above problem. Our approach enables the algorithm to detect decision uncertainty, triggering human intervention only when uncertainty exceeds a threshold. This reduces the need for continuous human supervision. Additionally, we design a dynamic experience replay mechanism that prioritizes machine self-exploration and humanguided samples with different weights based on decision uncertainty. We also provide a theoretical derivation and related discussion. Experiments in the Lunar Lander environment demonstrate improved sampling efficiency and reduced reliance on human guidance.
博客文章