Better league convergence criterion #37

vwxyzjn · 2022-01-18T03:13:55Z

The current league evaluation is a binary search with a maximum number of iteration n. However, a better convergence criterion is to test if the agent's sigma has reached a certain threshold. This PR will continue to run the league evaluation until the sigma has gone down to 1.4.

Additionally, if the binary search early converges to an index, we would expand the the search area just a bit to continue the search until the sigma has gone down to 1.4.

Better league convergence criteria

8e21bd9

vwxyzjn merged commit e26dd98 into master Jan 19, 2022

vwxyzjn deleted the league-improvement branch January 19, 2022 15:32

vwxyzjn mentioned this pull request Jan 22, 2022

Binary search bug with the new_league.py #41

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better league convergence criterion #37

Better league convergence criterion #37

vwxyzjn commented Jan 18, 2022

Better league convergence criterion #37

Better league convergence criterion #37

Conversation

vwxyzjn commented Jan 18, 2022