dim2r May 15 2021 at 08:28

RL — Trust Region Policy Optimization (TRPO) Explained. (Часть 1)

6 min

4.4K

Machine learning *

Recovery Mode

Translation

+1

Comments

There are no comments yet, you can be the first one!

Sign up to leave a comment.