This repository has been archived by the owner on Feb 27, 2022. It is now read-only.
We are proud to announce the release of 1.0.0! ππ
What's new in this release (since v0.8.0)
- Introducing parallel processing to parse document files (-j)
- Stabilized search results (fixed wrong argument of infer_vector)
- Updating the algorithm for extracting headings
- Clearer warning message when pdf or docx file is encrypted
- Removing bugs and performance glitches
- Utilize optimized options of Janome tokenizer (-l ja)
Doc2Vec model files:
- en https://github.com/tos-kamiya/d2vg/releases/download/v0.8.0/enwiki-m700-c380-d100.tar.bz2
- ja https://github.com/tos-kamiya/d2vg/releases/download/v1.0.0-rc.2/jawiki-janome-m100-c400-d100.tar.bz2
- ko https://github.com/tos-kamiya/d2vg/releases/download/v0.8.0/kowiki-m50-c400-d100.tar.bz2
- zh https://github.com/tos-kamiya/d2vg/releases/download/v0.8.0/zhwiki-m100-c400-d100.tar.bz2
Full Changelog: v0.8.0...v1.0.0