Skip to content

Latest commit

 

History

History
35 lines (34 loc) · 872 Bytes

File metadata and controls

35 lines (34 loc) · 872 Bytes

A fork of elasticsearch-analysis-morphology that adds two new token filters:

  • russian_word_form
  • english_word_form For example those analyzer leave only nouns in nominative case:
PUT test
{
   "settings": {
      "index": {
         "analysis": {
            "analyzer": {
               "noun": {
                  "type": "custom",
                  "char_filter": [],
                  "tokenizer": "standard",
                  "filter": [
                     "noun_filter"
                  ]
               }
            },
            "filter": {
               "noun_filter": {
                  "type": "russian_word_form",
                  "grammemas": [
                     "С",
                     "им"
                  ]
               }
            }
         }
      }
   }
}