RAG App with Chain of Thought (CoT) and Multi-Query Retrieval For Wordpress website Blog

This project is a Retrieval-Augmented Generation (RAG) application that enhances response accuracy and depth by implementing a Chain of Thought (CoT) approach. By generating multiple alternative queries, the system retrieves a more comprehensive set of documents to answer complex user questions with improved relevance and context.

Overview

This RAG app uses LangChain, a powerful framework for AI applications, to create a system capable of answering user questions with augmented responses. By employing a Chain of Thought (CoT) approach, the app generates multiple interpretations of each question to retrieve related documents from a vector database. The end result is a well-rounded and contextually informed response that mimics human-like reasoning.

Features

Chain of Thought (CoT) Multi-Query Generation: Generates multiple alternative versions of the user's question to retrieve a broader set of relevant documents.
Retrieval-Augmented Generation (RAG): Provides accurate answers by retrieving related documents and combining them with the generative model's response.
Vector Database Integration: Uses Chroma as a vector store to manage document embeddings.
Web Scraping: Extracts and processes data from web pages for use in the vector database.

Installation

Clone the repository:

git clone /~https://github.com/Mohansharma13/rag-for-wordpress.git

Navigate to the project directory:
```
cd rag-app-with-cot
```
Install dependencies:
```
pip install -r requirements.txt
```
Set up your environment variables:
- Create a .env file in the root directory.
- Add your GOOGLE_API_KEY and any other required API keys.

Usage

Run the Application: Launch the app using Streamlit:
```
streamlit run app.py
```
Interact with the RAG System: Enter a question, and the system will generate multiple interpretations to retrieve the most relevant documents and respond accurately.

How It Works

Data Retrieval and Vectorization:
- Data is retrieved from a specified URL using a custom web scraper.
- After retrieving the raw content, it's cleaned of HTML tags and irrelevant shortcodes.
- The cleaned text is converted into document embeddings using sentence-transformers/all-MiniLM-L6-v2, then stored in a Chroma vector database.
Chain of Thought (CoT) Multi-Query Generation:
- For each user query, the app generates multiple alternative queries through a prompt template in LangChain. This multi-query retriever enhances accuracy by capturing various angles of the initial question.
Multi-Query Retrieval:
- The generated queries are used to retrieve documents from the vector database.
- By combining the results of these queries, the system assembles a richer context for answering the question.
Response Generation:
- The retrieved documents are passed to a generative AI model (e.g., ChatGoogleGenerativeAI) to produce a comprehensive response.
- If the context is insufficient, the model relies on its training knowledge to generate a complete answer.

Conclusion

The CoT-based multi-query strategy provides a smarter way of handling complex and open-ended questions, allowing the app to deliver responses that are both accurate and contextually enriched. This approach demonstrates the power of retrieval-augmented generation in real-world AI applications.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request with any enhancements or bug fixes.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
chat_model.py		chat_model.py
packages.txt		packages.txt
rag.py		rag.py
requirements.txt		requirements.txt
scrap_data.py		scrap_data.py
vectordb.py		vectordb.py
web_to_text.py		web_to_text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG App with Chain of Thought (CoT) and Multi-Query Retrieval For Wordpress website Blog

Table of Contents

Overview

Features

Installation

Usage

How It Works

Conclusion

Contributing

License

About

Releases

Packages

Languages

Mohansharma13/rag-for-wordpress

Folders and files

Latest commit

History

Repository files navigation

RAG App with Chain of Thought (CoT) and Multi-Query Retrieval For Wordpress website Blog

Table of Contents

Overview

Features

Installation

Usage

How It Works

Conclusion

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages