This project is search on the web pages with five methods and query expansion:
- boolean
- TF-IDF
- A transformer-based model
- fasttext
After the search, you can apply link analysis on the results.
- pagerank
- hits
Also classification and clustring are available.
you can use different embeddings for different methods.
- Clone the repo.
git clone https://github.com/IR1401-Spring-Final-Projects/Web1401-10_38.git
- Install the requirements.
pip install -r requirements.txt
-
Get the available models from this link and extract them in the root of the project (next to manage.py)
-
Run the server
python manage.py runserver
- search, cluser or classify!
The note book for working with services are available here.