Introduce smaller and more efficient NN architecture #56

vwxyzjn · 2022-01-31T20:01:58Z

This PR introduces a smaller and more efficient NN architecture. Namely, replace the existing

    def __init__(self, envs, mapsize=16 * 16):
        super(Agent, self).__init__()
        self.mapsize = mapsize
        h, w, c = envs.observation_space.shape
        self.encoder = nn.Sequential(
            Transpose((0, 3, 1, 2)),
            layer_init(nn.Conv2d(c, 32, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
            nn.ReLU(),
            layer_init(nn.Conv2d(32, 64, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
            nn.ReLU(),
            layer_init(nn.Conv2d(64, 128, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
            nn.ReLU(),
            layer_init(nn.Conv2d(128, 256, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
        )

        self.actor = nn.Sequential(
            layer_init(nn.ConvTranspose2d(256, 128, 3, stride=2, padding=1, output_padding=1)),
            nn.ReLU(),
            layer_init(nn.ConvTranspose2d(128, 64, 3, stride=2, padding=1, output_padding=1)),
            nn.ReLU(),
            layer_init(nn.ConvTranspose2d(64, 32, 3, stride=2, padding=1, output_padding=1)),
            nn.ReLU(),
            layer_init(nn.ConvTranspose2d(32, 78, 3, stride=2, padding=1, output_padding=1)),
            Transpose((0, 2, 3, 1)),
        )
        self.critic = nn.Sequential(
            nn.Flatten(),
            layer_init(nn.Linear(256, 128)),
            nn.ReLU(),
            layer_init(nn.Linear(128, 1), std=1),
        )
        self.register_buffer("mask_value", torch.tensor(-1e8))

with the following

    def __init__(self, envs, mapsize=16 * 16):
        super(Agent, self).__init__()
        self.mapsize = mapsize
        h, w, c = envs.observation_space.shape
        self.encoder = nn.Sequential(
            Transpose((0, 3, 1, 2)),
            layer_init(nn.Conv2d(c, 32, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
            nn.ReLU(),
            layer_init(nn.Conv2d(32, 64, kernel_size=3, padding=1)),
            nn.MaxPool2d(3, stride=2, padding=1),
            nn.ReLU(),
        )

        self.actor = nn.Sequential(
            layer_init(nn.ConvTranspose2d(64, 32, 3, stride=2, padding=1, output_padding=1)),
            nn.ReLU(),
            layer_init(nn.ConvTranspose2d(32, 78, 3, stride=2, padding=1, output_padding=1)),
            Transpose((0, 2, 3, 1)),
        )
        self.critic = nn.Sequential(
            nn.Flatten(),
            layer_init(nn.Linear(64 * 4 * 4, 128)),
            nn.ReLU(),
            layer_init(nn.Linear(128, 1), std=1),
        )
        self.register_buffer("mask_value", torch.tensor(-1e8))

Preliminary experiment shows it can also produce a sota model but only taking about 16 hours (50M steps) and 36 hours in total to wait for all evaluations to finish.

@cpuheater

In contrast, the previous SOTA model (using the larger model) gains a bit higher Trueskill which tasks in about 109 hours

Given this evidence, this PR makes the code base use the default smaller model to save compute.

kachayev

This makes a lot of sense to me, exactly the architecture I'm using for almost of my experiments.

It would be interesting to see if we can speed up (or stabilize) learning by either using additional reconstruction loss for the encoder or by forcing encoder outputs for subsequent steps to be close to the current one (e.g. InfoNCE). But that is a completely separate topic

vwxyzjn · 2022-02-02T04:07:41Z

Awesome. Merging the PR now.

Introduce smaller models

e5f835d

vwxyzjn requested a review from kachayev January 31, 2022 20:02

vwxyzjn added 2 commits January 31, 2022 15:10

Reduce the number of models evaluated

e3b393a

set a maximum thread to avoid ovleading CPU

de07677

vwxyzjn mentioned this pull request Jan 31, 2022

Faster Convergence #51

Open

Quick change

8ef7018

kachayev approved these changes Feb 2, 2022

View reviewed changes

vwxyzjn merged commit 5e7be25 into master Feb 2, 2022

vwxyzjn deleted the smallnn branch February 2, 2022 04:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce smaller and more efficient NN architecture #56

Introduce smaller and more efficient NN architecture #56

vwxyzjn commented Jan 31, 2022 •

edited

Loading

kachayev left a comment

vwxyzjn commented Feb 2, 2022

Introduce smaller and more efficient NN architecture #56

Introduce smaller and more efficient NN architecture #56

Conversation

vwxyzjn commented Jan 31, 2022 • edited Loading

kachayev left a comment

Choose a reason for hiding this comment

vwxyzjn commented Feb 2, 2022

vwxyzjn commented Jan 31, 2022 •

edited

Loading