Open maharajbrahma opened 2 years ago
After tokenizing extra space after Dari mark. The space should not exists after Dari marker.
tokenize("मख’जाथाव राव गोनोखोआरि आरो रोंगौसाफोरजों दाजानाय मोनसे हारिमायारि बोसोनगिरि आफादजों।") "मख'जाथाव राव गोनोखोआरि आरो रोंगौसाफोरजों दाजानाय मोनसे हारिमायारि बोसोनगिरि आफादजों । "
Another problem that occurs when more than one punctuation marks are one after another.
For e.g. । followed by ' and another such sequences.
Both the issues have been fixed in the commit e36ffa0
Nice
After tokenizing extra space after Dari mark. The space should not exists after Dari marker.