marathon-envs

Marathon Man

This repository explores the application of deep reinforcement learning for physics-based animation. It contains a set of high-dimensional continuous control benchmarks using Unity’s native physics simulator, PhysX. The environments can be trained using Unity ML-Agents or any OpenAI Gym compatible algorithm. This project may be useful for:

Video Game researchers interested in apply bleeding-edge robotics research into the domain of locomotion and AI for video games.
Academic researchers looking to leverage the strengths of Unity and ML-Agents along with the body of existing research and benchmarks provided by projects such as the DeepMind Control Suite, or OpenAI Mujoco environments.

The Unity project has two parts. Both can be find in UnitySDK > Assets:

In folder MarathonEnvs there are several benchmarks of physics-based animation, implemented on the basis of different papers in the field. More details on the environments can be found here and instructions on how to train them here

MarathonEnvs

In folder MarathonController there are resources to take a skinned character, with a typical controller like mecanim or motion matching, and generate from it a training environment. Further details can be found here

Example-current-status,

There are also instructions to export the outcome of the training here

1. Getting started

Check the installation instructions
Make sure you can train an existing environment.
If you want to adapt it to your own characters, or explore the creation of novel controllers, we recommend you to start with the instructions in the here
You can also follow the 2021 SIGGRAPH course on physics-based character animation based on this project.

If you have further questions, feel free to join our Discord server

2. Contributors

v4.0 was created by:

Joe Booth (@Sohojoe), Twitter - @iAmVidyaGamer
Joan Llobera, at the Artanim Foundation
Valérie Juillard, a colleague from the Artanim Foundation has provided some of the animations.

v3.0 was created by:

Joe Booth (@Sohojoe), Twitter - @iAmVidyaGamer
Vladimir Ivanov (@vivanov879)

Note: This project is the result of contributions from members of the Unity community (see below) who actively maintain the repository. As such, the contents of this repository are not officially supported by Unity Technologies.

3. Open issues

Currently, our main challenge is that results still look like if they came with this department of silly walks effect, (something that obviously does not appear in the demos of the papers). It is annoying, and we absolutely need to solve it if we want to have something that can be used in practice.

Weird Walks

4. Publications

SIGGRAPH 2021 Course based on the benchmarks in this repository (coming soon)
Technical Paper (v3.0, 2020): Realistic Physics Based Character Controller
AAAI 2019 Workshop on Games and Simulations for Artificial Intelligence: Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine
An early version of this work was presented March 19th, 2018 at the AI Summit - Game Developer Conference 2018
Legacy Tutorial: Getting Started With MarathonEnvs.This is a legacy tutorial from an older version of MarathonEnvs.

5. Licensing

All the project is under Apache License Version 2.0, January 2004 http://www.apache.org/licenses/LICENSE-2.0 , with the single exception of the motion data for the quadruped is adapted under the available under the terms of the Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license, as stated in their README,

6. References

DReCon: data-driven responsive control of physics-based characters Insperation for ControllerMarathonMan environment.
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills Insperation for Style Transfer environments.
OpenAI.Gym Mujoco implementation. Good reference for enviroment setup, reward functions and termination functions.
PyBullet pybullet_envs - a bit harder than MuJoCo gym environments but with an open source simulator. Pre-trained environments in stable-baselines zoo.
DeepMind Control Suite - Set of continuous control tasks.
DeepMind paper Emergence of Locomotion Behaviours in Rich Environments and video- see page 13 b.2 for detail of reward functions
MuJoCo homepage.
A good primer on the differences between physics engines is ‘Physics simulation engines have traditional made tradeoffs between performance’ and it’s accompanying video.
MuJoCo Unity Plugin MuJoCo’s Unity plugin which uses socket to comunicate between MuJoCo (for running the physics simulation and control) and Unity (for rendering).

Document last updated: 11.05.2021