LLM Studio

A comprehensive Next.js application for running and exploring .gguf open-source LLM models locally.

LLM Studio is designed to empower users to harness the capabilities of large language models (LLMs) on their own machines, providing a seamless, private, and offline experience. It is the ideal solution for anyone looking to dive deeper into open-source LLMs while maintaining control over their data and processes. Explore the power of large language models, all from the comfort of your own machine.

Working Mechanism

LLM Studio operates with a focus on simplicity and efficiency. Here’s how it works:

Chat History Management:
- Each chat session's history is saved as a JSON file in the history folder.
- The filename corresponds to the date when the chat was first initiated, in YYYY-MM-DD format.
Model Storage:
- .gguf models should be placed in the models folder.
- By default, the smallest model in the folder is selected as the default model.
Optimized History Loading:
- All chat histories are loaded into the browser for quick access.
- Since they are stored as JSON files, the data size remains minimal.
Search Functionality:
- The search field allows users to quickly navigate through their entire chat history.
Model Switching:
- Users can switch between different models available in the models folder as needed.
Incognito Mode:
- Activate incognito mode to prevent saving chat history from that point onward.
Copy and Delete Options:
- Easily copy any chat message by clicking the copy icon.
- Delete chat sessions using the delete icon for better organization.

Prerequisites

CMake: Verify with cmake --version.
C++ Compiler: Install gcc (Linux), clang (macOS), or Visual Studio (Windows).

Convert Hugging Face (.safetensors) Model Format to .gguf

Follow these steps to convert a Hugging Face model into the .gguf format for use in LLM Studio:

Step 1: Clone the Llama.cpp Repository

Clone the repository for llama.cpp, which contains the necessary tools for the conversion:

git clone /~https://github.com/ggerganov/llama.cpp
cd llama.cpp/

Step 2: Build the Project

Use CMake to build the project:

cmake -S . -B build

Step 3: Install Python Dependencies

Install the necessary Python dependencies:

pip install -r requirements.txt

Convert the Model

Run the conversion script to convert the Hugging Face model to .gguf format with FP16 precision:

python convert_hf_to_gguf.py --outtype f16 models/<INSERT_YOUR_FOLDER_NAME_HERE>

Replace <INSERT_YOUR_FOLDER_NAME_HERE> with the path to your model folder.

Know Issues

CUDA Errors on GPU Models:

You may encounter errors like this when running on a GPU:

ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX <model-name>, compute capability 7.5, VMM: yes
<path-to>\node-llama-cpp\node-llama-cpp\llama\llama.cpp\ggml\src\ggml-cuda\ggml-cuda.cu:70: CUDA error

This error typically occurs when your GPU is unable to handle the model. Try using smaller models to resolve the issue.

Useful Links

Contributing

We welcome contributions! If you'd like to improve the project, please fork the repository, make your changes, and submit a pull request. For any issues or feature requests, feel free to open an issue on GitHub.

License

This project is licensed under the Apache License Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github		.github
app		app
components		components
history		history
hooks		hooks
lib		lib
models		models
providers		providers
public		public
styles		styles
utilities		utilities
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
README.md		README.md
WORKFLOW.md		WORKFLOW.md
components.json		components.json
jsconfig.json		jsconfig.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.js		tailwind.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Studio

Working Mechanism

Prerequisites

Convert Hugging Face (.safetensors) Model Format to .gguf

Step 1: Clone the Llama.cpp Repository

Step 2: Build the Project

Step 3: Install Python Dependencies

Convert the Model

Know Issues

CUDA Errors on GPU Models:

Useful Links

Contributing

License

About

Releases

Packages

Languages

License

nirajgiriXD/llm-studio

Folders and files

Latest commit

History

Repository files navigation

LLM Studio

Working Mechanism

Prerequisites

Convert Hugging Face (.safetensors) Model Format to .gguf

Step 1: Clone the Llama.cpp Repository

Step 2: Build the Project

Step 3: Install Python Dependencies

Convert the Model

Know Issues

CUDA Errors on GPU Models:

Useful Links

Contributing

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages