Use of "WordNet" for semantic indexing and information retrieval
This figure represents the way of filling the database with the words of the texts.
This figure shows the tokenization step of the entered text.
In the following figure we present the cleaned tokens, eliminating Function words (using the empty English word list).
In this phase we eliminated stop words, numbers, special characters, and we converted all words into lowercase.
In this phase we calculates the frequency of each term existing in the document.
This phase shows each word with its lemma using WordNet.
This phase shows each word with its first synset using WordNet.
The last phase of our application is the semantic search in the database.
Linkedin: aghezzafmohamed
Gmail: [email protected]