Fix bug with lazy data loading, un-implement len on AllennlpLazyDataset #4328

epwalsh · 2020-06-05T19:49:03Z

This fixes a bug that occurs when using a lazy dataset reader where you'd get a PyTorch warning on every single batch during training. It does so by making AllennlpLazyDataset consistent with torch.data.IterableDataset in that it no longer implements __len__() (previously len() would just return 1).

To reproduce this bug just run allennlp train on a experiment with a lazy dataset reader like this:

{
  "dataset_reader": {
    "type": "squad",
    "lazy": true,
    "token_indexers": {
      "tokens": {
        "type": "single_id",
        "lowercase_tokens": true
      },
      "token_characters": {
        "type": "characters",
        "character_tokenizer": {
          "byte_encoding": "utf-8",
          "start_tokens": [259],
          "end_tokens": [260]
        },
        "min_padding_length": 5
      }
    }
  },
  "train_data_path": "https://allennlp.s3.amazonaws.com/datasets/squad/squad-train-v1.1.json",
  "validation_data_path": "https://allennlp.s3.amazonaws.com/datasets/squad/squad-dev-v1.1.json",
  "model": {
    "type": "bidaf",
    "text_field_embedder": {
      "token_embedders": {
        "tokens": {
          "type": "embedding",
          "pretrained_file": "https://allennlp.s3.amazonaws.com/datasets/glove/glove.6B.100d.txt.gz",
          "embedding_dim": 100,
          "trainable": false
        },
        "token_characters": {
          "type": "character_encoding",
          "embedding": {
            "num_embeddings": 262,
            "embedding_dim": 16
          },
          "encoder": {
            "type": "cnn",
            "embedding_dim": 16,
            "num_filters": 100,
            "ngram_filter_sizes": [5]
          },
          "dropout": 0.2
        }
      }
    },
    "num_highway_layers": 2,
    "phrase_layer": {
      "type": "lstm",
      "bidirectional": true,
      "input_size": 200,
      "hidden_size": 100,
      "num_layers": 1
    },
    "matrix_attention": {
      "type": "linear",
      "combination": "x,y,x*y",
      "tensor_1_dim": 200,
      "tensor_2_dim": 200
    },
    "modeling_layer": {
      "type": "lstm",
      "bidirectional": true,
      "input_size": 800,
      "hidden_size": 100,
      "num_layers": 2,
      "dropout": 0.2,
    },
    "span_end_encoder": {
      "type": "lstm",
      "bidirectional": true,
      "input_size": 1400,
      "hidden_size": 100,
      "num_layers": 1,
    },
    "dropout": 0.2,
  },
  "data_loader": {
    "batch_size": 40,
    "num_workers": 1,
  },
  "trainer": {
    "num_epochs": 20,
    "grad_norm": 5.0,
    // "patience": 10,
    "validation_metric": "+em",
    "cuda_device": 0,
    "learning_rate_scheduler": {
      "type": "reduce_on_plateau",
      "factor": 0.5,
      "mode": "max",
      "patience": 2,
    },
    "optimizer": {
      "type": "adam",
      "betas": [0.9, 0.9],
    },
  },
}

epwalsh · 2020-06-05T20:02:38Z

allennlp/commands/train.py

@@ -490,7 +490,7 @@ def run(self) -> Dict[str, Any]:
        return self.trainer.train()

    def finish(self, metrics: Dict[str, Any]):
-        if self.evaluation_data_loader and self.evaluate_on_test:
+        if self.evaluation_data_loader is not None and self.evaluate_on_test:


Need this so len() is not called.

epwalsh · 2020-06-05T20:03:04Z

allennlp/training/trainer.py

@@ -361,7 +361,7 @@ def __init__(
        self.optimizer = optimizer

        if patience is None:  # no early stopping
-            if validation_data_loader:
+            if validation_data_loader is not None:


Need this so len() is not called.

epwalsh · 2020-06-05T20:20:16Z

By the way I tracked the __len__ implementation back to this PR. I think the rational for adding it on lazy datasets is now out-dated: #3700 (comment).

epwalsh · 2020-06-05T20:48:29Z

I think this should close #4035

dirkgr · 2020-06-08T03:44:13Z

allennlp/training/learning_rate_schedulers/slanted_triangular.py

@@ -138,7 +138,9 @@ def get_values(self):
                self.batch_num_total_epoch_end[-1] / (len(self.batch_num_total_epoch_end) - 1)
            )
        else:
-            actual_num_steps_per_epoch = max(self.num_steps_per_epoch, self.last_batch_num_total)
+            actual_num_steps_per_epoch = max(
+                self.num_steps_per_epoch or 1, self.last_batch_num_total


Why does it default to 1 here? Isn't that a problem when it happens?

This gives the same behavior as it was before. I'm not sure if that's an issue.

epwalsh added 4 commits June 5, 2020 12:03

improve support for lazy data loading

55053fb

update changelog

9341ef5

raise TypeError instead

e02277c

fix

f98b2bf

epwalsh requested a review from matt-gardner June 5, 2020 20:02

epwalsh commented Jun 5, 2020

View reviewed changes

matt-gardner approved these changes Jun 5, 2020

View reviewed changes

use num_steps_per_epoch=None with lazy

0239375

epwalsh linked an issue Jun 5, 2020 that may be closed by this pull request

Cases where the length of the IterableDataset is not overriden properly #4035

Closed

epwalsh requested a review from matt-gardner June 5, 2020 21:04

Merge branch 'master' into lazy-data-loading

463af52

epwalsh merged commit 902d36a into allenai:master Jun 5, 2020

epwalsh deleted the lazy-data-loading branch June 5, 2020 21:47

dirkgr reviewed Jun 8, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug with lazy data loading, un-implement len on AllennlpLazyDataset #4328

Fix bug with lazy data loading, un-implement len on AllennlpLazyDataset #4328

epwalsh commented Jun 5, 2020

epwalsh Jun 5, 2020

epwalsh Jun 5, 2020

epwalsh commented Jun 5, 2020

epwalsh commented Jun 5, 2020

dirkgr Jun 8, 2020

epwalsh Jun 8, 2020

Fix bug with lazy data loading, un-implement __len__ on AllennlpLazyDataset #4328

Fix bug with lazy data loading, un-implement __len__ on AllennlpLazyDataset #4328

Conversation

epwalsh commented Jun 5, 2020

epwalsh Jun 5, 2020

Choose a reason for hiding this comment

epwalsh Jun 5, 2020

Choose a reason for hiding this comment

epwalsh commented Jun 5, 2020

epwalsh commented Jun 5, 2020

dirkgr Jun 8, 2020

Choose a reason for hiding this comment

epwalsh Jun 8, 2020

Choose a reason for hiding this comment

Fix bug with lazy data loading, un-implement len on AllennlpLazyDataset #4328

Fix bug with lazy data loading, un-implement len on AllennlpLazyDataset #4328