DataSeer / dataseer-ml

DataSeer machine-learning service
Apache License 2.0
25 stars 2 forks source link

Use pySBD for sentence segmentation #4

Closed kermitt2 closed 1 year ago

kermitt2 commented 4 years ago

because it is much more robust on scientific literature than other general purpose sentence segmenter.

1) via the existing JEP integration.

2) Or as alternative run the original ruby implementation with JRuby, https://github.com/diasks2/pragmatic_segmenter.

kermitt2 commented 1 year ago

Done