1 FLAIR , University of Oxford 2 WhiRL , University of Oxford 3 DARK , University College London
Existing benchmarks for open-ended learning are either too slow or too simple. Craftax is both fast and complicated. We hope that this will allow researchers without access to industrial compute to investigate learning in an open-ended environment with an ease that was not previously possible.
Progress in reinforcement learning (RL) algorithms is driven in large part by the development and adoption of suitable benchmarks. In the effort towards increasingly general agents, there has arisen a community focused on benchmarks that exhibit more open-ended dynamics , in the form of procedural world generation, skill acquisition and reuse, long term dependencies and continual learning. This has motivated the development of environments like MALMO (Minecraft), The NetHack Learning Environment , MiniHack