Skip to yearly menu bar Skip to main content


Poster

ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation

Nantian He · Shaohui Li · Zhi Li · Yu LIU · You He


Abstract:

The diffusion model has demonstrated impressive performance in offline reinforcement learning. However, non-deterministic sampling in diffusion models can lead to unstable performance. Furthermore, the lack of confidence measurements makes it difficult to evaluate the reliability and trustworthiness of the sampled actions. To address these issues, we present ReDiffuser, which utilizes confidence estimation to ensure reliable decision-making. We achieve this by learning a confidence function based on Random Network Distillation. The confidence function measures the reliability of sampled actions and contributes to quantitative recognition of reliable actions. Additionally, we integrate the confidence function into task-specific sampling procedures to realize adaptive-horizon planning and value-embedded planning. Experiments show that the proposed ReDiffuser achieves state-of-the-art performance on standard offline RL datasets.

Live content is unavailable. Log in and register to view live content