OpenPecha / toolkit-v2

OpenPecha toolkit version 2
MIT License
0 stars 0 forks source link

Feat/pecha plaintext parser #32

Closed tenzin3 closed 2 months ago

tenzin3 commented 2 months ago

The Pecha Parser is designed with the following key principles in mind:

The parser operates with the following logic:

  1. Input Text: Accepts the text to be processed.
  2. Segmenter: Segments the text using one of the following methods: Space Segmenter New Line Segmenter Regex Segmenter
  3. Annotation Name: Assigns a name to the segmented text. The annotation name must be selected from a predefined list of enums.