Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug on trigger with cyrrilic language #22

Closed
pepelazz opened this issue Feb 12, 2017 · 1 comment · Fixed by #48
Closed

Bug on trigger with cyrrilic language #22

pepelazz opened this issue Feb 12, 2017 · 1 comment · Fixed by #48
Labels

Comments

@pepelazz
Copy link

pepelazz commented Feb 12, 2017

trigger

[*] beer [*]

works fine

but

[*] пиво [*]

doesn't works

Any idea how to solve such situation?

@kirsle kirsle added the unicode label Apr 1, 2017
@kirsle
Copy link
Member

kirsle commented Apr 1, 2017

Unfortunately, this is a common problem across multiple implementations of RiveScript (e.g. aichaos/rivescript-js#147, aichaos/rivescript-python#37); with optionals in particular, the word-boundary regexp sequence \b doesn't work well with certain Unicode characters.

I've determined that the best way to fix this is to allow authors to use literal regular expressions to work around the particular edge cases when they come up. I haven't been able to find a way to fix RiveScript's "simplified regexps" to support these edge cases without breaking the normal cases. However, none of the implementations have added the ~Regexp support yet.

I'll update and close this ticket when the Go version gains this feature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants