Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 647 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 647 Bytes

doc2vec_java

A Java implementation of doc2vec in ICML'14 based on https://github.com/NLPchina/Word2VEC_java

demo

src/test/Doc2VecTest.java

Require

Java 7 or above, I use Java 8 in this project. The input file should be in the form of file/amazon_docs.txt. One document per line.

Reference

  1. Le, Quoc V., and Tomas Mikolov. "Distributed representations of sentences and documents." ICML (2014).
  2. Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. "Distributed representations of words and phrases and their compositionality." In Advances in neural information processing systems, pp. 3111-3119. 2013.