jprante / elasticsearch-langdetect

A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector
Apache License 2.0
250 stars 46 forks source link

Index custom analzed data #58

Open doublex opened 7 years ago

doublex commented 7 years ago

A "language_analyzer" field, which index analyzed terms, not the iso-code. E.g:

"fields": {
  "language": {
    "type": "langdetect",
    "languages": [ "af", "ar", "bg", "bn", "cs", "da", "de", "el", "en", "es", "et", "fa", "fi", "fr", "gu", "he", "hi", "hr", "hu", "id", "it", "ja", "kn", "ko", "lt", "lv", "mk", "ml", "mr", "ne", "nl", "no", "pa", "pl", "pt", "ro", "ru", "sk", "sl", "so", "sq", "sv", "sw", "ta", "te", "th", "tl", "tr", "uk", "ur", "vi", "zh-cn", "zh-tw" ],
    "language_analyzer": {
      "ar": "arabic",
      "bg": "bulgarian",
      "cs": "czech",
      "da": "danish",
      "de": "german",
      "el": "greek",
      "en": "english",
      "es": "spanish",
      ...
    }
  }
}