You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently all links where the hostname/domain contains puny code triggers the warning/confirmation dialog.
The Problem
While this is good for English region it is bad for other regions that use a different font/script/alphabet.
For them there are many false positives with perfectly valid normal urls.
Other languages with their own scripts: Hebrew, Chinese, Khmer and so on.
emoji urls (though only some registrars and top level domains allow them)
To a lesser degree:
Germans because of the Umlaute, but as they are rarely used it's mostly fine
I don't know how big the problem really is, as internationalised urls are still relatively new and before you could only use ascii, many websites and companies still stick to ascii domains.
Currently all links where the hostname/domain contains puny code triggers the warning/confirmation dialog.
The Problem
While this is good for English region it is bad for other regions that use a different font/script/alphabet.
For them there are many false positives with perfectly valid normal urls.
Non-exhaustive list of Examples:
Update: https://en.wikipedia.org/wiki/.рф - is used much apparently
Proposed solution
For each language we support specify a list of allowed unicode ranges.
for each detected puny code link check if it fits into the allowed ranges for any language, if no warn the user.
for example:
Alternatives Considered
Testcases
Anyways the first step is to collect test cases.
The text was updated successfully, but these errors were encountered: