At the moment the TextDatasetFromDataFrame class collects tag/entity sequences from the start of the document until it fills the block. This is not ideal; we would rather have the block of text surrounding the target terms:
if there is one term, just center around the term
if there are more terms, center around the int(N/2) term
At the moment the TextDatasetFromDataFrame class collects tag/entity sequences from the start of the document until it fills the block. This is not ideal; we would rather have the block of text surrounding the target terms: