Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 928 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 928 Bytes

Unsupervised technique to Glossary and Definition Extraction

Code Files

  1. GPT2-DefinitionModel.ipynb - GPT-2 model for definition generation.
  2. Data_Generator.ipynb - Data Scraper from GoodReads and GradeSaver
  3. Definition_Extraction.ipynb - WordNet model for definition generation.
  4. Glossary_Extraction.ipynb - Chinking strategy pipeline for selection of glossary terms.

For more details of the project and results you can access project presentation here also read my blog