✨ LLM based QA chatbot builder

Paper Link

https://www.sciencedirect.com/science/article/pii/S235271102400400X

🏁 An end-to-end solution to develop a fully open-source application based on open-source models and libraries.

🎯 What Is LLM based QA chatbot builder?

There are various stages involved in developing an LLM-based QA chatbot: a) collecting and preprocessing data; b) fine-tuning, testing, and inference of the LLM; and c) developing the chat interface. In this work, we offer the LLM QA builder, a web application that assembles all the processes and simplifies the building of the LLM QA chatbot for both technical and non-technical users, in an effort to speed this development process. Zepyhr, Mistral, Llama-3, Phi, Flan-T5, and a user-provided model for retrieving information relevant to an organization can all be fine-tuned using the system; these LLMs can then be further improved through the application of retrieval-augmented generation (RAG) approaches. We have included an automatic RAG data scraper that is based on web crawling. Furthermore, our system has a human evaluation component to determine the quality of the model.

🎯 Features

🦾 Model Support	Implemented	Description
Mistral	✅	Fine-tuning model powered by Mistral
Zephyr	✅	Fine-tuning model powered by HuggingFace
Llama-3	✅	Fine-tuning model powered by Facebook
Microsoft Phi-3	✅	Fine-tuning model powered by Microsoft
Flan-T5	✅	Fine-tuning model powered by Google
ColBERT	✅	Embedding model
bge-large-en-v1.5	✅	Embedding model

🔀 Here is the diagram of the software architecture.

⭐ Key Features

1️⃣ Data collection: Collect data from users or Excel files, and automatically build RAG data using a web crawler.

🖼️

2️⃣ Finetune: Fine-tune state-of-the-art models (e.g., Mistral, Llama, Zephyr, Phi-3) and lightweight models (e.g., Flan-T5).

🖼️

3️⃣ Testing data generation: Generate test data using the fine-tuned models.

🖼️

4️⃣ Human evaluation: Evaluate model performance based on user ratings.

🖼️

5️⃣ Inference: Perform inference using the fine-tuned models.

🖼️

6️⃣ Deployment: Deploy the fine-tuned models.

🖼️

🎯 Getting started

🚀 Installation

git clone /~https://github.com/shahidul034/LLM-based-QA-chatbot-builder

conda create -n llm python=3.10
conda activate llm
pip install torch torchvision torchaudio jupyter langchainhub sentence-transformers faiss-gpu docx2txt langchain bitsandbytes transformers peft accelerate pynvml trl datasets packaging ninja wandb colbert-ai[torch,faiss-gpu] RAGatouille
pip install -U flash-attn --no-build-isolation

or

pip install -r requirements.txt

Run

cd src
python full_UI.py

🎯 Contributing

Contributions are always welcome!

🎯 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
software screenshot		software screenshot
src		src
.gitignore		.gitignore
LICENSE		LICENSE
LLM QA Chatbot Builder.mp4		LLM QA Chatbot Builder.mp4
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ LLM based QA chatbot builder

Paper Link

🏁 An end-to-end solution to develop a fully open-source application based on open-source models and libraries.

🎯 What Is LLM based QA chatbot builder?

🎯 Features

⭐ Key Features

🎯 Getting started

🚀 Installation

Run

🎯 Contributing

🎯 License

About

Releases

Packages

Languages

License

ElsevierSoftwareX/SOFTX-D-24-00446

Folders and files

Latest commit

History

Repository files navigation

✨ LLM based QA chatbot builder

Paper Link

🏁 An end-to-end solution to develop a fully open-source application based on open-source models and libraries.

🎯 What Is LLM based QA chatbot builder?

🎯 Features

⭐ Key Features

🎯 Getting started

🚀 Installation

Run

🎯 Contributing

🎯 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages