🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
-
Updated
Jan 17, 2025 - Python
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Spring Boot applications
An exploration of text splitting and chunking in JavaScript
LangChain is a framework, which is very helpful and easy to build applications based on available Large Language Models.
Text splitting example using Tiktoken
This is an experiment in learning langchain, pinecone and stuff, don't mind
Matching strings between lists based on length
Add a description, image, and links to the text-splitting topic page so that developers can more easily learn about it.
To associate your repository with the text-splitting topic, visit your repo's landing page and select "manage topics."