Doesn't understand context #29

hwsamuel · 2020-11-27T18:06:53Z

The library seems to be working more like a dictionary look up for swear words. For example, it can correctly tag "fucking idiot" as negative, but also tags "fucking awesome!" as negative. Maybe the training set's features were uni-grams?

menkotoglou · 2020-11-30T12:06:39Z

From my point of view, that happens because of the learning algorithm the library uses. By tokenizing each word, "fucking" gets a huge probability of being profane, since it is profane in any context. For example, you cannot say "fucking awesome!" in a professional environment. If you place "fucking awesome!" in clean_data.csv, you will label as 1 (profane), not 0(not profane).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doesn't understand context #29

Doesn't understand context #29

hwsamuel commented Nov 27, 2020

menkotoglou commented Nov 30, 2020

Doesn't understand context #29

Doesn't understand context #29

Comments

hwsamuel commented Nov 27, 2020

menkotoglou commented Nov 30, 2020