Works for paragraphs? #4

timpal0l · 2018-12-17T17:15:14Z

(not an issue - just a question)

Do you know if USE does the tokenization, and splits sentences. Or should the user make the tokenisation?

"hello world. this is some text. hello"
or
[['hello', 'world',], ['this', 'is', 'some', 'text'], ['hello']]

The text was updated successfully, but these errors were encountered:

choran · 2019-01-17T19:29:21Z

Sorry only getting to this now!
That is an interesting point, I looked into the ELMO module as well and it does have tokenisation as part of it so that may be something to use if you are interested in that. I have not played around with it as much as the sentence embedding but it does look like it could be really useful as well
Cheers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Works for paragraphs? #4

Works for paragraphs? #4

timpal0l commented Dec 17, 2018

choran commented Jan 17, 2019

Works for paragraphs? #4

Works for paragraphs? #4

Comments

timpal0l commented Dec 17, 2018

choran commented Jan 17, 2019