The Korean language if occasionally written without spaces separating the words. This PR aims to handle words that are in the dictionary, but not all entirely extracted due to being "stuck together" in the input text.
Coverage decreased (-0.3%) to 98.966% when pulling 6bb27adcddb3f8269b2c115b60dc50c10ef6708c on jwnz:master into 50c45f1f4a394572381249681046f57e2bf5a591 on vi3k6i5:master.
Coverage decreased (-0.3%) to 98.966% when pulling 6bb27adcddb3f8269b2c115b60dc50c10ef6708c on jwnz:master into 50c45f1f4a394572381249681046f57e2bf5a591 on vi3k6i5:master.
The Korean language if occasionally written without spaces separating the words. This
PR
aims to handle words that are in the dictionary, but not all entirely extracted due to being "stuck together" in the input text.Expected output:
['한국', '전력', '공사']
Real output:
['한국', '공사']