Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
-
Updated
May 7, 2023 - Python
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow
Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex
Image captioning ready-to-go inference: show and tell model compatible with Tensorflow r1.9
Repo for Implementing Research Papers & Projects related to Machine Learning
ImageCaptioning improved with an attention mechanism. Also a PyQt5 application
Some interesting applications of RNN, e.g. char rnn (pomes generation), seq2seq (machine translation), image captioning (NIC)
An implementation of the paper "Context-aware Captions from Context-agnostic Supervision"
AI Poet who looks at the images and writes poems Web service.
Here are all my code files of Advanced AI/ML architectures built from scratch using Pytorch.
VisionVerse is a versatile tool that integrates image captioning, pre-trained image classification, and text-to-image generation.
ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.
Pre-Trained CNN Architecture for Indonesian Image Captioning using Transformer.
BLIP-ImageCaption
Add a description, image, and links to the imagecaptioning topic page so that developers can more easily learn about it.
To associate your repository with the imagecaptioning topic, visit your repo's landing page and select "manage topics."