STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

编辑：映维 | 分类：XR | 2023年11月7日

Note: We don't have the ability to review paper

PubDate: Oct 2023

Teams: Beijing Institute of Technology；Tsinghua University

Writers: Weipu Zhang, Gang Wang, Jian Sun, Yetian Yuan, Gao Huang

PDF: STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Abstract

Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments. These approaches begin by constructing a parameterized simulation world model of the real environment through self-supervised learning. By leveraging the imagination of the world model, the agent’s policy is enhanced without the constraints of sampling from the real environment. The performance of these algorithms heavily relies on the sequence modeling and generation capabilities of the world model. However, constructing a perfectly accurate model of a complex unknown environment is nearly impossible. Discrepancies between the model and reality may cause the agent to pursue virtual goals, resulting in subpar performance in the real environment. Introducing random noise into model-based reinforcement learning has been proven beneficial. In this work, we introduce Stochastic Transformer-based wORld Model (STORM), an efficient world model architecture that combines the strong sequence modeling and generation capabilities of Transformers with the stochastic nature of variational autoencoders. STORM achieves a mean human performance of 126.7% on the Atari 100k benchmark, setting a new record among state-of-the-art methods that do not employ lookahead search techniques. Moreover, training an agent with 1.85 hours of real-time interaction experience on a single NVIDIA GeForce RTX 3090 graphics card requires only 4.3 hours, showcasing improved efficiency compared to previous methodologies.

本文链接：https://paper.nweon.com/14902

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

您可能还喜欢...

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

您可能还喜欢...

CLAW: A Multifunctional Handheld Haptic Controller for Grasping, Touching, and Triggering in Virtual Reality

360-Degree Video Streaming with MPEG-DASH

Mixed Reality Light Fields for Interactive Remote Assistance

最新AR/VR行业分享

最新AR/VR专利

最新AR/VR行业招聘