Mujoco Hopper agent with Deep Deterministic Policy Gradient(DDPG)

Python 3.5
Pytorch 0.4.1
CUDA is depend on GPU
Simply run with python hopper.py
Training Return
Red line is the average return accumulating throughout the training sequence

| Name | Name | Last commit date | ||
|---|---|---|---|---|