You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello @lsmith77, the issue comes from the tokenizer defaults. It so happens that π¨ is one of the icons that are looked for during tokenization.
Given the root of the issue, I don't think we'll be able to much about it in the near future... However, you could customize your tokenizer to avoid splitting the emoji.
Currently
π¨π½βπ©π½βπ§π½
is handled as multiple tokens. Note this likely relate to carpedm20/emoji#204Ideally it would be handled as a single token.
The text was updated successfully, but these errors were encountered: