Fix bug in preconditioned KISS-GP / Hadamard Multitask GPs #2090

gpleiss · 2022-08-07T18:44:14Z

Previously in PyTorch, calling on a int/long matrix was a no-op.
At some point in the PyTorch releases, this line started throwing an error (since only
floating point operations can have gradients).

Our implementation of pivoted cholesky previously called
matrix_arg.requires_grad_(True) on all LazyTensor/LinearOperator
arguments, without first checking whether matrix_arg was a floating
point dtype.

KISS-GP makes use of InterpolatedLazyTensor (to become InterpolatedLinearOperator), which
has integer matrices (the index matrices for interpolation). This
therefore produced an error message when it was used in conjunction with
the pivoted cholesky preconditioner. A similar bug exists for
preconditioned Hadamard Multitask GPs.

(The reason this bug went undetected is because our tests for KISS-GP
models and multitask models all use small datasets (N < 100).
Preconditioners are not used until N > 2000 or so.)

[Fixes #2056]

Previously in PyTorch, calling on a int/long matrix was a no-op. At some point in the PyTorch releases, this line started throwing an error (since only floating point operations can have gradients). Our implementation of pivoted cholesky previously called `matrix_arg.requires_grad_(True)` on all LazyTensor/LinearOperator arguments, without first checking whether `matrix_arg` was a floating point dtype. KISS-GP makes use of InterpolatedLazyTensor (to become InterpolatedLinearOperator), which has integer matrices (the index matrices for interpolation). This therefore produced an error message when it was used in conjunction with the pivoted cholesky preconditioner. A similar bug exists for preconditioned Hadamard Multitask GPs. (The reason this bug went undetected is because our tests for KISS-GP models and multitask models all use small datasets (N < 100). Preconditioners are not used until N > 2000 or so.) [Fixes #2056]

Balandat · 2022-08-07T19:00:03Z

examples/03_Multitask_Exact_GPs/Hadamard_Multitask_GP_Regression.ipynb

@@ -58,8 +58,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "train_x1 = torch.rand(50)\n",
-    "train_x2 = torch.rand(50)\n",
+    "train_x1 = torch.rand(2000)\n",


By how much does this increase runtime? Should we be concerned here if this is run as part of the unit tests?

This change (plus the ones in the other two examples) add at most one second to the test suite - I just checked.

Balandat · 2022-08-07T19:02:20Z

gpytorch/functions/_pivoted_cholesky.py

@@ -98,7 +98,11 @@ def backward(ctx, grad_output, _):

        with torch.enable_grad():
            # Create a new set of matrix args that we can backpropagate through
-            matrix_args = [matrix_arg.detach().requires_grad_(True) for matrix_arg in _matrix_args]
+            matrix_args = []


I would say we could also catch the error so that if there are any future changes we don't have to adjust the code here. But then raising errors is unfortunately quite slow in ptorch...

gpleiss added bug high priority multitask For questions about multitask models labels Aug 7, 2022

Balandat approved these changes Aug 7, 2022

View reviewed changes

Merge branch 'master' into fix_preconditioned_kissgp

7c74425

gpleiss merged commit 433b7ee into master Aug 7, 2022

gpleiss deleted the fix_preconditioned_kissgp branch August 7, 2022 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in preconditioned KISS-GP / Hadamard Multitask GPs #2090

Fix bug in preconditioned KISS-GP / Hadamard Multitask GPs #2090

gpleiss commented Aug 7, 2022

Balandat Aug 7, 2022

gpleiss Aug 7, 2022

Balandat Aug 7, 2022

Fix bug in preconditioned KISS-GP / Hadamard Multitask GPs #2090

Fix bug in preconditioned KISS-GP / Hadamard Multitask GPs #2090

Conversation

gpleiss commented Aug 7, 2022

Balandat Aug 7, 2022

Choose a reason for hiding this comment

gpleiss Aug 7, 2022

Choose a reason for hiding this comment

Balandat Aug 7, 2022

Choose a reason for hiding this comment