Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Text facet sort by name should use case & diacritic insensitive collation #482

Open
GoogleCodeExporter opened this issue Jan 26, 2016 · 1 comment

Comments

@GoogleCodeExporter
Copy link

Currently lowercase characters sort after all upper case characters so 'T' and 
't' are in wildly different spots and international characters collate at the 
very end so that 'Österreichische' is miles from the 'O's.

We should fold both case and diacritics to their base forms.

Original issue reported on code.google.com by tfmorris on 12 Nov 2011 at 7:51

@GoogleCodeExporter
Copy link
Author

r2371 makes the sorting order case insensitive, but Javascript doesn't appear 
to have a built-in diacritic folding method, so that'll be a little more work.

After I committed the "fix" I discovered that this may actually be a 
browser-specific bug/difference, but it doesn't appear that there's been much 
progress in fixing it, so we probably should assume that the current state is 
going to exist for a while.
http://code.google.com/p/v8/issues/detail?id=459

There's a code snippet here that can be used to scrub diacritics: 
http://lehelk.com/2011/05/06/script-to-remove-diacritics/

Original comment by tfmorris on 12 Nov 2011 at 8:33

  • Changed state: Started

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant