Bug: Bad words with accented characters not getting detected #3

CookedApps · 2022-01-13T08:37:12Z

Hey,
I think I found a possible bug: Defining a bad word in a filter list with accented characters, will not filter the word if you write it exactly the same, but only when you normalize the characters first.

Example:

Define the filter with a custom bad word: const filter = new Filter({list: ["wörd"]});
Filtering the bad word will result in a false negative: filter.isUnclean("wörd") = false
Filtering with normalized characters will result in a false positive: filter.isUnclean("word") = true

Expected behavior:

filter.isUnclean("wörd") = true
filter.isUnclean("word") = false
And when defining a bad word without accents:
- const filter = new Filter({list: ["word"]});
- filter.isUnclean("word") = true
- filter.isUnclean("wörd") = true

The text was updated successfully, but these errors were encountered:

3chospirits · 2022-07-26T01:59:23Z

This filter is designed for only English. There are very little characters with accents that need to be censored out. In that case, using the non accented version of the filter would make things a lot easier. It's expected that when you load in the words it's already normalized.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Bad words with accented characters not getting detected #3

Bug: Bad words with accented characters not getting detected #3

CookedApps commented Jan 13, 2022 •

edited

Loading

3chospirits commented Jul 26, 2022

Bug: Bad words with accented characters not getting detected #3

Bug: Bad words with accented characters not getting detected #3

Comments

CookedApps commented Jan 13, 2022 • edited Loading

Example:

Expected behavior:

3chospirits commented Jul 26, 2022

CookedApps commented Jan 13, 2022 •

edited

Loading