nicolay-r / ARElight

Granular Viewer of Sentiments Between Entities in Massively Large Documents and Collections of Texts, powered by AREkit
https://link.springer.com/chapter/10.1007/978-3-031-56069-9_23
MIT License
37 stars 2 forks source link

`SampleRowProvider` -- Enable Crop Mode #125

Closed nicolay-r closed 1 year ago

nicolay-r commented 1 year ago

Reason: In some cases we may end up dealing with the very long sentences and texts without punkt signs. Once saved into samples, these sentences may quickly occupy the whole disk space :floppy_disk:

To enable this option, there is a need to switch onto cropping modes here: https://github.com/nicolay-r/ARElight/blob/75c339508215aa8383dbcf65eca11bea2e2fd981/arelight/samplers/bert.py#L50-L51