Skip to content

VPG

Pre-release
Pre-release
Compare
Choose a tag to compare
@cpnota cpnota released this 31 May 21:47
· 228 commits to master since this release
021f0a0

The release contains two small changes:

  1. Rename REINFORCE to VPG in order to stay consistent with other libraries. Also, allow VPG to average the gradients over multiple episodes, drastically improving performance in some cases.
  2. Tweaked A2C to make it align better with other implementations. In particular, a new n-step buffer was added that is more accurate. There are also some small changes to make sure feature gradients are computed correctly.