Skip to content

Tanvi141/WikiSearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WikiSearch

Search engine on English Wikipedia

Indexer

To run: index.sh <path_to_wiki_dump> <path_to_invertedindex_output> <invertedindex_stat.txt> invertedindex_stat.txt contains two numbers:

  1. Total number of tokens (​ after converting to lowercase​ ) encountered in the dump
  2. Total number of tokens in the inverted index

Related code files are in the directory indexer

Searcher

To run: search.sh <path to directory containing the index> "<query string>"

Related code files are in the directory searcher

About

Search engine on english wikipedia

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published