Skip to content

A simple, minimal implementation of a GPT model using PyTorch

License

Notifications You must be signed in to change notification settings

mandal13/toyGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

toyGPT

A minimal implementation of a GPT model using PyTorch, designed for educational purposes. toyGPT helps learners understand the nuts and bolts of Generative Pre-trained Transformer (GPT) models by providing a clean, easy-to-follow implementation.


Features

  • Simple Implementation: A clear and minimal codebase to focus on the core ideas.
  • Educational Focus: Ideal for learners exploring GPT model architecture and functionality.
  • PyTorch-Based: Built using PyTorch, a popular framework for deep learning.

Key Concepts Covered

  • Tokenization and Embedding
  • Self-Attention Mechanism
  • Transformer Architecture
  • Positional Encoding
  • Training and Fine-Tuning

About

A simple, minimal implementation of a GPT model using PyTorch

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages