jaiminpan / pg_jieba

Postgresql full-text search extension for chinese
BSD 3-Clause "New" or "Revised" License
338 stars 65 forks source link

Space character (" ") on stop-word list acts like a separator instead of a character #47

Open YiFanChen99 opened 4 years ago

YiFanChen99 commented 4 years ago

The space character (" ") is already on the default stop-word list. But SELECT ts_lexize('jieba_stem', ' '); => {" "} instead of {}

Even more, when adding apple watch to the list. Here are the results of ts_lexize: apple watch => {"apple watch"}, apple => {}, watch => {watch}

Seems that the space character breaks the line into the first lexis only.