Adding code to get sentence representations #5

ducdauge · 2017-03-04T16:46:33Z

I modified 2 files and added a new one.

desent.py has a new function, embedding, to get representations given a trained model and an input file with one sentence per line. Also, I fixed a minor problems with numpy.round(), which yielded a float rather than an integer.
build_dictionary.py now uses codecs tho handle input and output files. Otherwise, it considers only ansi-encoded characters.
sentence_representation.py is a wrapper for convenience's sake. It invokes the relevant functions in desent.py and allows to modify the configuration easily.

The function embedding (and its sub-routines) allow to get sentence representations given a trained model and an input file with one sentence per line. A minor fix concerns numpy.round(), whose output is cast into an integer.

A wrapper file to invoke the embedding function in desent.py

ducdauge added 3 commits March 4, 2017 16:28

Handling utf-8 encoding

ba48308

Create sentence_representation.py

fc9d10c

A wrapper file to invoke the embedding function in desent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding code to get sentence representations #5

Adding code to get sentence representations #5

ducdauge commented Mar 4, 2017

Adding code to get sentence representations #5

Are you sure you want to change the base?

Adding code to get sentence representations #5

Conversation

ducdauge commented Mar 4, 2017