I trained your ppo first.
python examples/ppo_gym.py --env-name Ant-v2 --save-model-interval 100
After 500 episodes, I made trajectories.
python gail/save_expert_traj.py --model-path assets/learned_models/Ant-v2_ppo.p
Last, I ran gail.
python gail/gail_gym.py --env-name Ant-v2 --expert-traj-path assets/expert_traj/Ant-v2_expert_traj.p
I implemented Gail and Vail, but I failed to train it too.(but hopper worked well)
Any Ideas?