Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add UTF-8 capabilities #3

Open
1 of 8 tasks
javierguerragiraldez opened this issue Apr 29, 2018 · 0 comments
Open
1 of 8 tasks

Add UTF-8 capabilities #3

javierguerragiraldez opened this issue Apr 29, 2018 · 0 comments

Comments

@javierguerragiraldez
Copy link
Member

Lua 5.3 already includes some:

  • '\u{XX...}' embeds the UTF-8 encoding in string literals.
  • %U in lua_pushfstring
  • utf8 library (for codepoint handling, no Unicode semantics)

surprisingly, it seems it doesn't include

  • %U in string.format

Other things that could be managed by a separate / optional library:

  • conversion between different encodings. (windows still uses some mixture of UCS2 and UTF16)
  • collation
  • normalization, case folding
  • text boundaries

The most obvious objection about including these capabilities with the language is the need of big tables. I think it would be valuable to evaluate what can the basic language do to make a binding as transparent as possible, without a hard dependency.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant