Final Year Project Group RAYW4: A Knowledge Graph-based Recommendation Website for Computer Science Learners

Web crawling (`crawling` subfolder)

We took reference from the selenium official page https://www.selenium.dev/documentation/webdriver/ to do the crawling

Crawling_final.ipynb: Scraping the article HTML source code from website using selenium
html2txt.ipynb: Extract text and heading from HTML source code to txt files

Entity extraction (`entity_extraction` subfolder)

Follow the instructions in https://github.com/stanfordnlp/stanza to pip install the stanza library and run the code after it. This subfolder includes two .py files:

corenlp.py: Extraction of OpenIE triples and their wiki entities
corenlp_wiki.py: Tokenization of the article and the extraction of tokens' wiki entities

Knowledge graph embeddings (`kg` subfolder)

This subfolder contains two schemes to get the knowledge graph embedding referencing https://github.com/thunlp/Fast-TransX

wikidata_server.py: Server for requesting OpenKE WikiData KG entity embeddings
kg_preprocess.py: Preprocess the data stream for training
Fast-TransX: THU C++ implementation for KG training

Recommendation model - DKN (`DKN` subfolder)

data_loader.py: Data Loader for tensorflow version
DKN.py: Tensorflow implementation of the DKN model
train.py: training function of the Tensorflow implementation
main.py: Microsoft recommenders version implementation of the training flow

Backend Flask server (`flask_server` subfolder)

See the readme in the subfolder for more instructions to run the code

Wix website (`wix` subfolder)

We wrote the frontend on the Wix online editor. This code is a copy from the editor, where each page is saved independently in a .js file.

home.js: HOME page
search.js: SEARCH page
collection.js: COLLECTION page
interests.js: INTERESTS page
signup.js: REGISTER page
login.js: LOGIN page

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
crawling		crawling
dkn		dkn
entity_extraction		entity_extraction
flask_server		flask_server
kg		kg
wix		wix
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Year Project Group RAYW4: A Knowledge Graph-based Recommendation Website for Computer Science Learners

Web crawling (`crawling` subfolder)

Entity extraction (`entity_extraction` subfolder)

Knowledge graph embeddings (`kg` subfolder)

Recommendation model - DKN (`DKN` subfolder)

Backend Flask server (`flask_server` subfolder)

Wix website (`wix` subfolder)

About

Releases

Packages

Contributors 3

Languages

Siujohnjai/Recommendation_for_CS

Folders and files

Latest commit

History

Repository files navigation

Final Year Project Group RAYW4: A Knowledge Graph-based Recommendation Website for Computer Science Learners

Web crawling (crawling subfolder)

Entity extraction (entity_extraction subfolder)

Knowledge graph embeddings (kg subfolder)

Recommendation model - DKN (DKN subfolder)

Backend Flask server (flask_server subfolder)

Wix website (wix subfolder)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Web crawling (`crawling` subfolder)

Entity extraction (`entity_extraction` subfolder)

Knowledge graph embeddings (`kg` subfolder)

Recommendation model - DKN (`DKN` subfolder)

Backend Flask server (`flask_server` subfolder)

Wix website (`wix` subfolder)

Packages