Skip to content

Kamapcuc/elasticsearch-analysis-morphology

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A fork of elasticsearch-analysis-morphology that adds two new token filters:

  • russian_word_form
  • english_word_form For example those analyzer leave only nouns in nominative case:
PUT test
{
   "settings": {
      "index": {
         "analysis": {
            "analyzer": {
               "noun": {
                  "type": "custom",
                  "char_filter": [],
                  "tokenizer": "standard",
                  "filter": [
                     "noun_filter"
                  ]
               }
            },
            "filter": {
               "noun_filter": {
                  "type": "russian_word_form",
                  "grammemas": [
                     "С",
                     "им"
                  ]
               }
            }
         }
      }
   }
}

About

Morphological Analysis Plugin for ElasticSearch

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 79.1%
  • Shell 20.9%