Skip to content

yaldashbz/web-search-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Search Engine

Information Retrieval Course Project

Sharif University of Technology

Methods

This project is search on the web pages with five methods and query expansion:

  • boolean
  • TF-IDF
  • A transformer-based model
  • fasttext

After the search, you can apply link analysis on the results.

  • pagerank
  • hits

Also classification and clustring are available.

you can use different embeddings for different methods.


How to Start

  1. Clone the repo.
git clone https://github.com/IR1401-Spring-Final-Projects/Web1401-10_38.git
  1. Install the requirements.
pip install -r requirements.txt
  1. Get the available models from this link and extract them in the root of the project (next to manage.py)

  2. Run the server

python manage.py runserver
  1. search, cluser or classify!

The note book for working with services are available here.

About

Information Retrieval Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages