Computer Vision project focused on detecting smoke and fire in wild environments. The Google Vision Transformer was fine-tuned on a custom dataset.
-
Updated
Jun 5, 2023 - HTML
Computer Vision project focused on detecting smoke and fire in wild environments. The Google Vision Transformer was fine-tuned on a custom dataset.
Presentation on An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
FoodVision: A revolutionary project designed to enhance your culinary experience by effortlessly scanning food items to access comprehensive information, including nutritional details, suggested recipes tailored to the detected food, and nearby culinary inspirations.
This is Demo website that provides a youtube video.
This project is an image classification model built using PyTorch and pretrained networks from torchvision. It allows training, validation, testing, and making predictions on a dataset of images, leveraging a feedforward classifier with a frozen pretrained network.
Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.
To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."