This is originates from NER application. (https://github.com/nicolay-r/ARElight/issues/118)
The snippet below illustrates that we apply text processing pipeline separately for each sentence (text_parser.run).
If we want to enhance the document processing performance, there is a need to switch from a single sentence to list of sentences. The latter denotes to support batching.
This is originates from NER application. (https://github.com/nicolay-r/ARElight/issues/118) The snippet below illustrates that we apply text processing pipeline separately for each sentence (
text_parser.run
). If we want to enhance the document processing performance, there is a need to switch from a single sentence tolist
of sentences. The latter denotes to supportbatching
.https://github.com/nicolay-r/AREkit/blob/4c577cb52eb4aabd547c80f939bdf05edb908634/arekit/common/docs/parser.py#L19-L25
[x] :x: These parameters could be removed: https://github.com/nicolay-r/AREkit/blob/4c577cb52eb4aabd547c80f939bdf05edb908634/arekit/common/docs/parser.py#L31-L32
The following in actually required and cited to the related parameter in context: https://github.com/nicolay-r/AREkit/blob/4c577cb52eb4aabd547c80f939bdf05edb908634/arekit/contrib/source/brat/entities/parser.py#L10