From 8c809917c2d952fd61b2e0614a0483f5b9735757 Mon Sep 17 00:00:00 2001 From: Vivian Wu Date: Tue, 9 Apr 2024 18:43:41 +0000 Subject: [PATCH] Add public user guide for serving Gemma on GKE with JetStream --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index a48eee6c..5659a35a 100644 --- a/README.md +++ b/README.md @@ -8,6 +8,7 @@ JetStream is a fast library for LLM inference and serving on TPUs (and GPUs in f - [Online Inference with MaxText on v5e Cloud TPU VM](#jetstream-maxtext-inference-on-v5e-cloud-tpu-vm-user-guide) - [JetStream Standalone Local Setup](#jetstream-standalone-local-setup) +- [Serve Gemma using TPUs on GKE with JetStream](https://cloud.google.com/kubernetes-engine/docs/tutorials/serve-gemma-tpu-jetstream) ---