rgarfield11 / text_chunker_ex

A library for semantically coherent text chunking
MIT License
0 stars 0 forks source link

Engineer - Text Chunker - Implement Splitting Strategies #2

Open rgarfield11 opened 9 months ago

rgarfield11 commented 9 months ago

Background

While the Chunker library currently employs a Recursive Split strategy for text segmentation, suitable for maintaining semantic integrity with customizable overlap, there is a clear need for a less complex, basic splitting strategy. This foundational approach would offer a straightforward method to split text into chunks based purely on size, without any overlap or concern for semantic boundaries, thereby serving use cases that require simple, direct fragmentation of text.

Acceptance Criteria

Scenario: Implementing Basic Text-Splitting Strategy

Given I am a developer looking to include basic text-splitting logic