Add Trueskill Evaluation for New AIs #21

vwxyzjn · 2021-10-30T22:57:15Z

This PR allows the users to evaluate on newer AIs.

Bootstrap trueskills for the initial league

poetry install
cd experiments
poetry run python new_league.py --evals agent_sota.pt randomBiasedAI workerRushAI lightRushAI

=======================
             name         mu     sigma  trueskill
0   agent_sota.pt  39.300444  2.273100  32.481145
1    workerRushAI  31.810887  2.335282  24.805039
2     lightRushAI  25.776148  2.388974  18.609226
3  randomBiasedAI  11.527615  3.700987   0.424655

Add trueskills evals for additional agents

poetry run python new_league.py --evals coacAI

             name         mu     sigma  trueskill
0   agent_sota.pt  39.300444  2.273100  32.481145
1          coacAI  36.132655  1.769652  30.823699
2    workerRushAI  31.810887  2.335282  24.805039
3     lightRushAI  25.776148  2.388974  18.609226
4  randomBiasedAI  11.527615  3.700987   0.424655

Notice how the initial league's (i.e. reference agents) trueskill rating is always fixed, which corresponds to OpenAI Five and Alphastar's methods.

Evaluate Trueskill during training

Finally, there is support for evaluating Trueskill during training. This is achieved by using subprocess to call the league evaluation.

cd experiments
rm -rf dataset.db
rm -rf dataset.csv
poetry run python new_league.py --evals randomBiasedAI workerRushAI lightRushAI coacAI
poetry run python ppo_gridnet.py --prod-mode

vwxyzjn · 2021-11-03T19:41:49Z

Synced with @kered9 and @xluox. Merging now.

vwxyzjn added 7 commits October 30, 2021 17:50

use sql lite

8b126c0

refacor

5fb86a4

refactor

b7e2bfb

make it work

d2d4f20

test

dd05289

refactor and support for eval during training

d9f6c19

quick fix

9e06155

vwxyzjn merged commit b54ac3d into master Nov 3, 2021

vwxyzjn deleted the new-league branch November 3, 2021 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Trueskill Evaluation for New AIs #21

Add Trueskill Evaluation for New AIs #21

vwxyzjn commented Oct 30, 2021 •

edited

Loading

vwxyzjn commented Nov 3, 2021

Add Trueskill Evaluation for New AIs #21

Add Trueskill Evaluation for New AIs #21

Conversation

vwxyzjn commented Oct 30, 2021 • edited Loading

Bootstrap trueskills for the initial league

Add trueskills evals for additional agents

Evaluate Trueskill during training

vwxyzjn commented Nov 3, 2021

vwxyzjn commented Oct 30, 2021 •

edited

Loading