Multi-Model RAG-Powered Article Chatbot

This project creates a Retrieve-and-Generate (RAG) powered chatbot for summarizing and interacting with articles. The system processes articles provided as PDFs or URLs, extracts text, splits the content into chunks, generates embeddings, and stores them in a vector database. The chatbot then responds to user queries using a Large Language Model (LLM) to generate context-aware answers.

The project is deployed on Streamlit Cloud, allowing users to interact with the chatbot via an easy-to-use interface.

Features

PDF/URL Upload: Upload a PDF file or provide a URL containing an article.
Text Extraction: The app extracts text content from the uploaded document or webpage.
Chunking & Embeddings: The extracted text is split into smaller chunks, and embeddings are generated for efficient search.
Vector Database: The embeddings are stored in a vector database for fast retrieval.
LLM Integration: Queries are processed using the ChatGroq LLM for generating contextually relevant answers.
Conversation History: Users can continue the chat with the chatbot, as it stores conversation history and context.
Changeable LLM Models: There is an option to switch between different LLM models for generating responses.
Deployable on Streamlit: The app is deployed on Streamlit Cloud for easy access and interaction.

Supported Models

Model	Requests per Minute	Requests per Day	Tokens per Minute	Tokens per Day	Advantages	Disadvantages
deepseek-r1-distill-llama-70b	30	1,000	6,000	Unlimited	Unlimited token capacity, low latency	Limited daily requests
gemma2-9b-it	30	14,400	15,000	500,000	High throughput, suitable for large-scale inference	Limited versatility
llama-3.1-8b-instant	30	14,400	20,000	500,000	High-speed processing, great for real-time applications	Less accurate for complex reasoning
llama-3.2-11b-vision-preview	30	7,000	7,000	500,000	Specialized for visual input tasks	Lower token capacity
llama-3.2-1b-preview	30	7,000	7,000	500,000	Lightweight and efficient for small queries	Limited versatility
llama-3.2-3b-preview	30	7,000	7,000	500,000	Balanced performance and scalability	Moderate token capacity
llama-3.2-90b-vision-preview	15	3,500	7,000	250,000	Complex visual and text reasoning	Low throughput
llama-3.3-70b-specdec	30	1,000	6,000	100,000	Precision-focused for decision-making	Limited token capacity
llama-3.3-70b-versatile	30	1,000	6,000	100,000	Versatile for high-accuracy scenarios	Low throughput
llama-guard-3-8b	30	14,400	15,000	500,000	Designed for safeguarding and content moderation	Not optimized for creative tasks
llama3-70b-8192	30	14,400	6,000	500,000	Long-context, ideal for extended conversations	Moderate speed and accuracy
llama3-8b-8192	30	14,400	20,000	500,000	High-speed inference with long-context support	Slightly less accurate for reasoning
mixtral-8x7b-32768	30	14,400	5,000	500,000	Multi-modal capabilities for diverse input (text and vision)	Lower token throughput

How It Works

User Input: Users upload a PDF file or provide a URL.
Text Extraction: The app extracts text from the document using Text Extraction functions.
Chunking & Embeddings: The extracted text is split into smaller chunks, and embeddings are created using the ChatGroq model.
Vector Store: Embeddings are stored in a vector database (e.g., ChromaDB, FAISS, Pinecone, or DocArrayInMemorySearch from LangChain).
Querying: Users can ask questions through the chatbot interface.
Response Generation: The system retrieves relevant chunks from the vector database and feeds them into the selected LLM to generate the response.
Conversation History: The chatbot maintains conversation history, enabling users to continue their chat without losing context.
Model Switching: Users have the flexibility to switch between different LLM models as per their preferences.
Final Answer: The LLM generates an answer based on the retrieved context and returns it to the user.

Getting Started

To use the chatbot locally or contribute to this repository, follow the steps below.

Prerequisites

Before starting, ensure you have the following:

Python 3.12
pip for installing packages
.env file for your API keys (see below)
ChatGroq API Key (for using the LLM model)

Clone the Repository

First, clone the repository to your local machine.

git clone /~https://github.com/RobinMillford/Multi-Model-RAG-Powered-Article-Chatbot.git
cd Multi-Model-RAG-Powered-Article-Chatbot

Install Dependencies

Create a virtual environment and install the required dependencies.

python -m venv venv
source venv/bin/activate  # On Windows, use 'venv\Scripts\activate'
pip install -r requirements.txt

Set Up Environment Variables

You need to set your ChatGroq API key. Create a .env file in the root directory and add the following:

GROQ_API_KEY=your_chatgroq_api_key

Make sure to replace your_chatgroq_api_key with your actual API key from ChatGroq.

Running the App Locally

Once the dependencies are installed and the environment variables are set up, you can start the application locally.

streamlit run app.py

Deployed on Streamlit Cloud

I deployed the app on Streamlit Cloud :

Visit Streamlit Deployed Demo

Usage

Once the app is running (locally or on Streamlit Cloud), follow these steps:

Upload a PDF file or provide a URL of an article.
The system will extract text from the document or URL.
The chatbot interface will appear, allowing you to ask questions about the content of the document.
The chatbot will generate a response based on the context of the document.

Contributing

I welcome contributions! To contribute, follow these steps:

Fork the repository on GitHub.
Clone your forked repository:

git clone /~https://github.com/RobinMillford/Multi-Model-RAG-Powered-Article-Chatbot.git

Create a new branch:

git checkout -b feature-name

Make your changes and commit them:

git add .
git commit -m "Description of the changes"

Push your changes to your forked repository:

git push origin feature-name

Open a Pull Request from your branch to the main branch of the original repository.

License

This project is licensed under the AGPL-3.0 license - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
.gitignore		.gitignore
LICENSE		LICENSE
Llama3-RAG-Chatbot-1.png		Llama3-RAG-Chatbot-1.png
Llama3-RAG-Chatbot-2.png		Llama3-RAG-Chatbot-2.png
README.md		README.md
Uml Diagram.png		Uml Diagram.png
app.py		app.py
chain_setup.py		chain_setup.py
helpers.py		helpers.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Model RAG-Powered Article Chatbot

Features

Supported Models

How It Works

Getting Started

Prerequisites

Clone the Repository

Install Dependencies

Set Up Environment Variables

Running the App Locally

Deployed on Streamlit Cloud

Usage

Contributing

License

About

Releases

Packages

Languages

License

RobinMillford/Multi-Model-RAG-Powered-Article-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Multi-Model RAG-Powered Article Chatbot

Features

Supported Models

How It Works

Getting Started

Prerequisites

Clone the Repository

Install Dependencies

Set Up Environment Variables

Running the App Locally

Deployed on Streamlit Cloud

Usage

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages