This project focuses on building and training a convolutional neural network (CNN) to classify images from the CIFAR-10 dataset. The CIFAR-10 dataset consists of 60,000 32x32 color images in 10 classes, with 6,000 images per class. The classes are: airplane, automobile, bird, cat, deer, dog, frog, horse, ship, and truck.
The project is implemented using PyTorch and includes the following steps:
- Loading and preprocessing the CIFAR-10 dataset.
- Building a CNN model.
- Training the model with and without data augmentation.
- Hyperparameter tuning using Optuna.
- Evaluating the model on the test set.
You can veiw and run this code on Kagle through this link
While the current model achieves 76% accuracy on CIFAR-10, there are several avenues for further development:
- Model Architecture: Experiment with deeper networks like ResNet or DenseNet.
- Data Augmentation: Add morre techniques like random cropping, and rotation .
- Hyperparameter Tuning: Run the optuna study for more trials on the augmented dataset instead of the original dataset.