A basic training example using GGML #707
bssrdf
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Nice job! Thank you for sharing If you have some feedback of what could be improved - please let us know. The training capabilities in |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I just want to share what I have been working on recently. This is an example of training a MNIST VAE. The goal is to use only
ggml
pipeline and its implementation of ADAM optimizer.There aren't many training examples using
ggml
.The only one I found is baby-llama. But I think its way of doing opmization is not quite right.Found another training example inllama.cpp
which shows a proper way of using Adam.Some of the mods I have to add
Below are some samples from the VAE trained on MNIST after each epoch (total 10 epochs).
| data:image/s3,"s3://crabby-images/f8048/f80480f3e54fe69742024f6cc5914a4105fed650" alt="mnist-sample-epoch_2"
| data:image/s3,"s3://crabby-images/eb00a/eb00a9b18697c58365cd54626aa7c14bad7d4510" alt="mnist-sample-epoch_4"
| data:image/s3,"s3://crabby-images/0a3d1/0a3d1faa9ea354e306e76e22709f481f3ac85654" alt="mnist-sample-epoch_6"
| data:image/s3,"s3://crabby-images/f4ffa/f4ffa9755561a1ea9cc3ed4107c6f2ae40b3c677" alt="mnist-sample-epoch_8"
| data:image/s3,"s3://crabby-images/d18a5/d18a5e6a42d69330589dc1c670514d8d0d7f6e5d" alt="mnist-sample-epoch_10"
Beta Was this translation helpful? Give feedback.
All reactions