SGLang is a fast serving framework for large language models and vision language models.
-
Updated
Jan 18, 2025 - Python
SGLang is a fast serving framework for large language models and vision language models.
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
This project is a web-based PDF question-answering chatbot powered by LangChain, and Deepseek v3's Language Learning Models (LLMs). Users can upload PDFs, ask questions related to the uploaded documents, and receive accurate responses. (Frontend still in progress. Use the website link to test the app with a rough frontend)
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A project to compare language models like DeepSeek-V3 and Llama3.1 for tasks such as text generation. Includes a FastAPI-based REST API for real-time inference, Docker support for deployment, and a flexible framework for evaluation and experimentation.
MiniMax-01 is a simple implementation of the MiniMax algorithm, a widely used strategy for decision-making in two-player turn-based games like Tic-Tac-Toe. The algorithm aims to minimize the maximum possible loss for the player, making it a popular choice for developing AI opponents in various game scenarios.
Add a description, image, and links to the deepseek-v3 topic page so that developers can more easily learn about it.
To associate your repository with the deepseek-v3 topic, visit your repo's landing page and select "manage topics."