Skip to content

richlo01/searchEngine

Repository files navigation

Search Engine

Created a search engine on the ICS repository, ranks the results, and returns them in under 200ms. We improved query results by implementing cosine similarity between tokens and documents using term frequency - inverse document frequency. Before the search engine runs, we developed a scraper to create an index that was sorted alphebetically. Here we would do word preprocessing and document duplicate checks using SimHash. This way, the results are accurate and efficient since they were done before hand.


An example image is as follows:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages