Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feat] Switch to using Chonkie.Tokenizer for Chunkers, Refineries #178

Merged
merged 7 commits into from
Feb 17, 2025

Conversation

bhavnicksm
Copy link
Collaborator

This pull request includes various updates to improve code readability, consistency, and functionality across multiple files. The changes primarily focus on formatting adjustments, code refactoring, and configuration updates.

Code Formatting and Consistency:

  • Updated string delimiters from single quotes to double quotes for consistency in cookbook/tutorials/Chonkie_RecursiveChunker_for_PDF_and_Markdown.ipynb and other files. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12]

Code Refactoring:

  • Refactored the BaseChef class and its methods in src/chonkie/chef/base.py to improve readability and ensure proper handling of optional parameters. [1] [2] [3] [4] [5]

Configuration Updates:

  • Updated pyproject.toml to improve package management and linting configurations, including adding new packages and modifying linting rules. [1] [2]

New Additions:

  • Added a new module chonkie.chef in pyproject.toml to include the new chef package.

Minor Adjustments:

  • Added missing newlines and adjusted indentation for better readability in various files. [1] [2] [3]

These changes collectively enhance the maintainability and readability of the codebase, ensuring a more consistent and clean structure.

@bhavnicksm bhavnicksm merged commit 932f484 into main Feb 17, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant