Skip to content

Commit 45949f8

Browse files
authored
Add confusables for various dandas and double dandas (#1224)
See unicode-org/properties#468
1 parent cdf83cb commit 45949f8

File tree

4 files changed

+118
-44
lines changed

4 files changed

+118
-44
lines changed

unicodetools/data/security/dev/confusables.txt

Lines changed: 28 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusables.txt
2-
# Date: 2025-10-11, 02:30:37 GMT
2+
# Date: 2025-10-17, 00:06:13 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -573,14 +573,8 @@ A78F ; 00B7 ; MA # ( ꞏ → · ) LATIN LETTER SINOLOGICAL DOT → MIDDLE DOT #
573573

574574
18C2 ; 00B7 18C0 ; MA # ( ᣂ → ·ᣀ ) CANADIAN SYLLABICS SHWOY → MIDDLE DOT, CANADIAN SYLLABICS SHOY # →ᐧᣀ→
575575

576-
A830 ; 0964 ; MA #* ( ꠰ → । ) NORTH INDIC FRACTION ONE QUARTER → DEVANAGARI DANDA #
577-
578-
0965 ; 0964 0964 ; MA #* ( ॥ → ।। ) DEVANAGARI DOUBLE DANDA → DEVANAGARI DANDA, DEVANAGARI DANDA #
579-
580576
1C3C ; 1C3B 1C3B ; MA #* ( ᰼ → ᰻᰻ ) LEPCHA PUNCTUATION NYET THYOOM TA-ROL → LEPCHA PUNCTUATION TA-ROL, LEPCHA PUNCTUATION TA-ROL #
581577

582-
104B ; 104A 104A ; MA #* ( ။ → ၊၊ ) MYANMAR SIGN SECTION → MYANMAR SIGN LITTLE SECTION, MYANMAR SIGN LITTLE SECTION #
583-
584578
1AA9 ; 1AA8 1AA8 ; MA #* ( ᪩ → ᪨᪨ ) TAI THAM SIGN KAANKUU → TAI THAM SIGN KAAN, TAI THAM SIGN KAAN #
585579

586580
1AAB ; 1AAA 1AA8 ; MA #* ( ᪫ → ᪪᪨ ) TAI THAM SIGN SATKAANKUU → TAI THAM SIGN SATKAAN, TAI THAM SIGN KAAN #
@@ -589,12 +583,6 @@ A830 ; 0964 ; MA #* ( ꠰ → । ) NORTH INDIC FRACTION ONE QUARTER → DEVANAG
589583

590584
10A57 ; 10A56 10A56 ; MA #* ( ‎𐩗‎ → ‎𐩖𐩖‎ ) KHAROSHTHI PUNCTUATION DOUBLE DANDA → KHAROSHTHI PUNCTUATION DANDA, KHAROSHTHI PUNCTUATION DANDA #
591585

592-
1144C ; 1144B 1144B ; MA #* ( 𑑌 → 𑑋𑑋 ) NEWA DOUBLE DANDA → NEWA DANDA, NEWA DANDA #
593-
594-
11642 ; 11641 11641 ; MA #* ( 𑙂 → 𑙁𑙁 ) MODI DOUBLE DANDA → MODI DANDA, MODI DANDA #
595-
596-
11C42 ; 11C41 11C41 ; MA #* ( 𑱂 → 𑱁𑱁 ) BHAIKSUKI DOUBLE DANDA → BHAIKSUKI DANDA, BHAIKSUKI DANDA #
597-
598586
1C7F ; 1C7E 1C7E ; MA #* ( ᱿ → ᱾᱾ ) OL CHIKI PUNCTUATION DOUBLE MUCAAD → OL CHIKI PUNCTUATION MUCAAD, OL CHIKI PUNCTUATION MUCAAD #
599587

600588
055D ; 0027 ; MA #* ( ՝ → ' ) ARMENIAN COMMA → APOSTROPHE # →ˋ→→`→→‘→
@@ -2615,7 +2603,20 @@ A740 ; 004B 0335 ; MA # ( Ꝁ → K̵ ) LATIN CAPITAL LETTER K WITH STROKE → L
26152603

26162604
0198 ; 004B 0027 ; MA # ( Ƙ → K' ) LATIN CAPITAL LETTER K WITH HOOK → LATIN CAPITAL LETTER K, APOSTROPHE # →Kʽ→
26172605

2606+
0964 ; 006C ; MA #* ( । → l ) DEVANAGARI DANDA → LATIN SMALL LETTER L # →|→
2607+
A8CE ; 006C ; MA #* ( ꣎ → l ) SAURASHTRA DANDA → LATIN SMALL LETTER L # →|→
2608+
104A ; 006C ; MA #* ( ၊ → l ) MYANMAR SIGN LITTLE SECTION → LATIN SMALL LETTER L # →|→
2609+
AA5D ; 006C ; MA #* ( ꩝ → l ) CHAM PUNCTUATION DANDA → LATIN SMALL LETTER L # →|→
2610+
11047 ; 006C ; MA #* ( 𑁇 → l ) BRAHMI DANDA → LATIN SMALL LETTER L # →|→
2611+
110C0 ; 006C ; MA #* ( 𑃀 → l ) KAITHI DANDA → LATIN SMALL LETTER L # →|→
2612+
11141 ; 006C ; MA #* ( 𑅁 → l ) CHAKMA DANDA → LATIN SMALL LETTER L # →|→
2613+
111C5 ; 006C ; MA #* ( 𑇅 → l ) SHARADA DANDA → LATIN SMALL LETTER L # →|→
2614+
113D4 ; 006C ; MA #* ( 𑏔 → l ) TULU-TIGALARI DANDA → LATIN SMALL LETTER L # →|→
2615+
1144B ; 006C ; MA #* ( 𑑋 → l ) NEWA DANDA → LATIN SMALL LETTER L # →|→
2616+
11641 ; 006C ; MA #* ( 𑙁 → l ) MODI DANDA → LATIN SMALL LETTER L # →|→
2617+
11C41 ; 006C ; MA #* ( 𑱁 → l ) BHAIKSUKI DANDA → LATIN SMALL LETTER L # →|→
26182618
05C0 ; 006C ; MA #* ( ‎׀‎ → l ) HEBREW PUNCTUATION PASEQ → LATIN SMALL LETTER L # →|→
2619+
115C5 ; 006C ; MA #* ( 𑗅 → l ) SIDDHAM SEPARATOR BAR → LATIN SMALL LETTER L # →|→
26192620
007C ; 006C ; MA #* ( | → l ) VERTICAL LINE → LATIN SMALL LETTER L #
26202621
2223 ; 006C ; MA #* ( ∣ → l ) DIVIDES → LATIN SMALL LETTER L # →ǀ→
26212622
23FD ; 006C ; MA #* ( ⏽ → l ) POWER ON SYMBOL → LATIN SMALL LETTER L # →I→
@@ -2633,6 +2634,7 @@ FFE8 ; 006C ; MA #* ( │ → l ) HALFWIDTH FORMS LIGHT VERTICAL → LATIN SMALL
26332634
1D7ED ; 006C ; MA # ( 𝟭 → l ) MATHEMATICAL SANS-SERIF BOLD DIGIT ONE → LATIN SMALL LETTER L # →1→
26342635
1D7F7 ; 006C ; MA # ( 𝟷 → l ) MATHEMATICAL MONOSPACE DIGIT ONE → LATIN SMALL LETTER L # →1→
26352636
1FBF1 ; 006C ; MA # ( 🯱 → l ) SEGMENTED DIGIT ONE → LATIN SMALL LETTER L # →1→
2637+
A830 ; 006C ; MA #* ( ꠰ → l ) NORTH INDIC FRACTION ONE QUARTER → LATIN SMALL LETTER L # →।→→|→
26362638
0049 ; 006C ; MA # ( I → l ) LATIN CAPITAL LETTER I → LATIN SMALL LETTER L #
26372639
FF29 ; 006C ; MA # ( I → l ) FULLWIDTH LATIN CAPITAL LETTER I → LATIN SMALL LETTER L # →Ӏ→
26382640
2160 ; 006C ; MA # ( Ⅰ → l ) ROMAN NUMERAL ONE → LATIN SMALL LETTER L # →Ӏ→
@@ -2694,6 +2696,7 @@ A4F2 ; 006C ; MA # ( ꓲ → l ) LISU LETTER I → LATIN SMALL LETTER L # →I
26942696
16F28 ; 006C ; MA # ( 𖼨 → l ) MIAO LETTER GHA → LATIN SMALL LETTER L # →I→
26952697
1028A ; 006C ; MA # ( 𐊊 → l ) LYCIAN LETTER J → LATIN SMALL LETTER L # →I→
26962698
10309 ; 006C ; MA # ( 𐌉 → l ) OLD ITALIC LETTER I → LATIN SMALL LETTER L # →I→
2699+
16D63 ; 006C ; MA # ( 𖵣 → l ) KIRAT RAI VOWEL SIGN AA → LATIN SMALL LETTER L # →|→
26972700

26982701
1D22A ; 004C ; MA #* ( 𝈪 → L ) GREEK INSTRUMENTAL NOTATION SYMBOL-23 → LATIN CAPITAL LETTER L #
26992702
216C ; 004C ; MA # ( Ⅼ → L ) ROMAN NUMERAL FIFTY → LATIN CAPITAL LETTER L #
@@ -2811,6 +2814,17 @@ FE87 ; 006C 0655 ; MA # ( ‎ﺇ‎ → lٕ ) ARABIC LETTER ALEF WITH HAMZA BELO
28112814

28122815
01C7 ; 004C 004A ; MA # ( LJ → LJ ) LATIN CAPITAL LETTER LJ → LATIN CAPITAL LETTER L, LATIN CAPITAL LETTER J #
28132816

2817+
0965 ; 006C 006C ; MA #* ( ॥ → ll ) DEVANAGARI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2818+
A8CF ; 006C 006C ; MA #* ( ꣏ → ll ) SAURASHTRA DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2819+
104B ; 006C 006C ; MA #* ( ။ → ll ) MYANMAR SIGN SECTION → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2820+
11048 ; 006C 006C ; MA #* ( 𑁈 → ll ) BRAHMI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2821+
110C1 ; 006C 006C ; MA #* ( 𑃁 → ll ) KAITHI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2822+
11142 ; 006C 006C ; MA #* ( 𑅂 → ll ) CHAKMA DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2823+
111C6 ; 006C 006C ; MA #* ( 𑇆 → ll ) SHARADA DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2824+
113D5 ; 006C 006C ; MA #* ( 𑏕 → ll ) TULU-TIGALARI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2825+
1144C ; 006C 006C ; MA #* ( 𑑌 → ll ) NEWA DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2826+
11642 ; 006C 006C ; MA #* ( 𑙂 → ll ) MODI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
2827+
11C42 ; 006C 006C ; MA #* ( 𑱂 → ll ) BHAIKSUKI DOUBLE DANDA → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
28142828
2016 ; 006C 006C ; MA #* ( ‖ → ll ) DOUBLE VERTICAL LINE → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →∥→→||→
28152829
2225 ; 006C 006C ; MA #* ( ∥ → ll ) PARALLEL TO → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →||→
28162830
2161 ; 006C 006C ; MA # ( Ⅱ → ll ) ROMAN NUMERAL TWO → LATIN SMALL LETTER L, LATIN SMALL LETTER L # →II→
@@ -9964,5 +9978,5 @@ FACE ; 9F9C ; MA # ( 龜 → 龜 ) CJK COMPATIBILITY IDEOGRAPH-FACE → CJK UNIF
99649978

99659979
2FD5 ; 9FA0 ; MA #* ( ⿕ → 龠 ) KANGXI RADICAL FLUTE → CJK UNIFIED IDEOGRAPH-9FA0 #
99669980

9967-
# total: 6562
9981+
# total: 6582
99689982

unicodetools/data/security/dev/confusablesSummary.txt

Lines changed: 35 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusablesSummary.txt
2-
# Date: 2025-10-11, 02:30:37 GMT
2+
# Date: 2025-10-17, 00:06:13 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1057,9 +1057,12 @@
10571057
← (‎ 𑷠点 ‎) 11DE0 70B9 TOLONG SIKI DIGIT ZERO, CJK UNIFIED IDEOGRAPH-70B9
10581058
← (‎ ㍘ ‎) 3358 IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR ZERO
10591059

1060-
# l 𑷚 𑷡 𖺪 I 1 | Ɩ ǀ ӏ ו ן ا ١ ۱ Ι І Ӏ ߊ ᛁ Ⲓ ⵏ ꓲ 𐊊 𐌉 𖼨 ׀ ∣ 𐌠 𞣇 ⏽ 🯱 𜳱 𜳞 Ⅰ ⅼ I l ℐ ℑ ℓ 𞸀 𞺀 ﺍ ﺎ 𝐈 𝐥 𝐼 𝑙 𝑰 𝒍 𝓁 𝓘 𝓵 𝔩 𝕀 𝕝 𝕴 𝖑 𝖨 𝗅 𝗜 𝗹 𝘐 𝘭 𝙄 𝙡 𝙸 𝚕 𝚰 𝛪 𝜤 𝝞 𝞘 𝟏 𝟙 𝟣 𝟭 𝟷 │
1060+
# l । 𖵣 𑏔 𑷚 𑷡 𖺪 I 1 | Ɩ ǀ ӏ ו ן ا ١ ۱ Ι І Ӏ ߊ ᛁ Ⲓ ⵏ ꓲ 𐊊 𐌉 𖼨 ׀ ၊ ∣ ꠰ ꣎ ꩝ 𐌠 𑁇 𑃀 𑅁 𑇅 𑗅 𑙁 𞣇 ⏽ 𑑋 𑱁 🯱 𜳱 𜳞 Ⅰ ⅼ I l ℐ ℑ ℓ 𞸀 𞺀 ﺍ ﺎ 𝐈 𝐥 𝐼 𝑙 𝑰 𝒍 𝓁 𝓘 𝓵 𝔩 𝕀 𝕝 𝕴 𝖑 𝖨 𝗅 𝗜 𝗹 𝘐 𝘭 𝙄 𝙡 𝙸 𝚕 𝚰 𝛪 𝜤 𝝞 𝞘 𝟏 𝟙 𝟣 𝟭 𝟷 │
10611061
(‎ 1 ‎) 0031 DIGIT ONE
10621062
← (‎ l ‎) 006C LATIN SMALL LETTER L
1063+
← (‎ । ‎) 0964 DEVANAGARI DANDA # →|→→l→
1064+
← (‎ 𖵣 ‎) 16D63 KIRAT RAI VOWEL SIGN AA # →|→→l→
1065+
← (‎ 𑏔 ‎) 113D4 TULU-TIGALARI DANDA # →|→→l→
10631066
← (‎ 𑷚 ‎) 11DDA TOLONG SIKI SIGN HECAKA # →|→→l→
10641067
← (‎ 𑷡 ‎) 11DE1 TOLONG SIKI DIGIT ONE # →|→→l→
10651068
← (‎ 𖺪 ‎) 16EAA BERIA ERFE CAPITAL LETTER LAKKO # →I→
@@ -1085,10 +1088,22 @@
10851088
← (‎ 𐌉 ‎) 10309 OLD ITALIC LETTER I # →I→
10861089
← (‎ 𖼨 ‎) 16F28 MIAO LETTER GHA # →I→
10871090
← (‎ ׀ ‎) 05C0 HEBREW PUNCTUATION PASEQ # →|→→l→
1091+
← (‎ ၊ ‎) 104A MYANMAR SIGN LITTLE SECTION # →|→→l→
10881092
← (‎ ∣ ‎) 2223 DIVIDES # →ǀ→→I→
1093+
← (‎ ꠰ ‎) A830 NORTH INDIC FRACTION ONE QUARTER # →।→→|→→l→
1094+
← (‎ ꣎ ‎) A8CE SAURASHTRA DANDA # →|→→l→
1095+
← (‎ ꩝ ‎) AA5D CHAM PUNCTUATION DANDA # →|→→l→
10891096
← (‎ 𐌠 ‎) 10320 OLD ITALIC NUMERAL ONE # →𐌉→→I→
1097+
← (‎ 𑁇 ‎) 11047 BRAHMI DANDA # →|→→l→
1098+
← (‎ 𑃀 ‎) 110C0 KAITHI DANDA # →|→→l→
1099+
← (‎ 𑅁 ‎) 11141 CHAKMA DANDA # →|→→l→
1100+
← (‎ 𑇅 ‎) 111C5 SHARADA DANDA # →|→→l→
1101+
← (‎ 𑗅 ‎) 115C5 SIDDHAM SEPARATOR BAR # →|→→l→
1102+
← (‎ 𑙁 ‎) 11641 MODI DANDA # →|→→l→
10901103
← (‎ 𞣇 ‎) 1E8C7 MENDE KIKAKUI DIGIT ONE # →l→
10911104
← (‎ ⏽ ‎) 23FD POWER ON SYMBOL # →I→
1105+
← (‎ 𑑋 ‎) 1144B NEWA DANDA # →|→→l→
1106+
← (‎ 𑱁 ‎) 11C41 BHAIKSUKI DANDA # →|→→l→
10921107
← (‎ 🯱 ‎) 1FBF1 SEGMENTED DIGIT ONE
10931108
← (‎ 𜳱 ‎) 1CCF1 OUTLINED DIGIT ONE
10941109
← (‎ 𜳞 ‎) 1CCDE OUTLINED LATIN CAPITAL LETTER I # →I→
@@ -1179,17 +1194,32 @@
11791194
← (‎ l𑷠点 ‎) 006C 11DE0 70B9 LATIN SMALL LETTER L, TOLONG SIKI DIGIT ZERO, CJK UNIFIED IDEOGRAPH-70B9
11801195
← (‎ ㍢ ‎) 3362 IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR TEN
11811196

1182-
# ll 𑷚𑷚 II 11 || וו ǁ װ ‖ ∥ Ⅱ
1197+
# ll ।। ၊၊ 𑙁𑙁 𑑋𑑋 𑱁𑱁 II 11 || וו 𑏕 ǁ װ ॥ ။ ‖ ∥ ꣏ 𑁈 𑃁 𑅂 𑇆 𑙂 𑑌 𑱂
11831198
(‎ 11 ‎) 0031 0031 DIGIT ONE, DIGIT ONE
11841199
← (‎ ll ‎) 006C 006C LATIN SMALL LETTER L, LATIN SMALL LETTER L # →‎וו‎→
1185-
← (‎ 𑷚𑷚 ‎) 11DDA 11DDA TOLONG SIKI SIGN HECAKA, TOLONG SIKI SIGN HECAKA # →||→→ll→→‎וו‎→
1200+
← (‎ ।। ‎) 0964 0964 DEVANAGARI DANDA, DEVANAGARI DANDA # →॥→→||→→ll→→‎וו‎→
1201+
← (‎ ၊၊ ‎) 104A 104A MYANMAR SIGN LITTLE SECTION, MYANMAR SIGN LITTLE SECTION # →။→→||→→ll→→‎וו‎→
1202+
← (‎ 𑙁𑙁 ‎) 11641 11641 MODI DANDA, MODI DANDA # →𑙂→→||→→ll→→‎וו‎→
1203+
← (‎ 𑑋𑑋 ‎) 1144B 1144B NEWA DANDA, NEWA DANDA # →𑑌→→||→→ll→→‎וו‎→
1204+
← (‎ 𑱁𑱁 ‎) 11C41 11C41 BHAIKSUKI DANDA, BHAIKSUKI DANDA # →𑱂→→||→→ll→→‎וו‎→
11861205
← (‎ II ‎) 0049 0049 LATIN CAPITAL LETTER I, LATIN CAPITAL LETTER I # →ll→→‎וו‎→
11871206
← (‎ || ‎) 007C 007C VERTICAL LINE, VERTICAL LINE # →ll→→‎וו‎→
11881207
← (‎ וו ‎) 05D5 05D5 HEBREW LETTER VAV, HEBREW LETTER VAV
1208+
← (‎ 𑏕 ‎) 113D5 TULU-TIGALARI DOUBLE DANDA # →||→→ll→→‎וו‎→
11891209
← (‎ ǁ ‎) 01C1 LATIN LETTER LATERAL CLICK # →‖→→∥→→||→→ll→→‎וו‎→
11901210
← (‎ װ ‎) 05F0 HEBREW LIGATURE YIDDISH DOUBLE VAV # →‎וו‎→
1211+
← (‎ ॥ ‎) 0965 DEVANAGARI DOUBLE DANDA # →||→→ll→→‎וו‎→
1212+
← (‎ ။ ‎) 104B MYANMAR SIGN SECTION # →||→→ll→→‎וו‎→
11911213
← (‎ ‖ ‎) 2016 DOUBLE VERTICAL LINE # →∥→→||→→ll→→‎וו‎→
11921214
← (‎ ∥ ‎) 2225 PARALLEL TO # →||→→ll→→‎וו‎→
1215+
← (‎ ꣏ ‎) A8CF SAURASHTRA DOUBLE DANDA # →||→→ll→→‎וו‎→
1216+
← (‎ 𑁈 ‎) 11048 BRAHMI DOUBLE DANDA # →||→→ll→→‎וו‎→
1217+
← (‎ 𑃁 ‎) 110C1 KAITHI DOUBLE DANDA # →||→→ll→→‎וו‎→
1218+
← (‎ 𑅂 ‎) 11142 CHAKMA DOUBLE DANDA # →||→→ll→→‎וו‎→
1219+
← (‎ 𑇆 ‎) 111C6 SHARADA DOUBLE DANDA # →||→→ll→→‎וו‎→
1220+
← (‎ 𑙂 ‎) 11642 MODI DOUBLE DANDA # →||→→ll→→‎וו‎→
1221+
← (‎ 𑑌 ‎) 1144C NEWA DOUBLE DANDA # →||→→ll→→‎וו‎→
1222+
← (‎ 𑱂 ‎) 11C42 BHAIKSUKI DOUBLE DANDA # →||→→ll→→‎וו‎→
11931223
← (‎ Ⅱ ‎) 2161 ROMAN NUMERAL TWO # →II→→ll→→‎וו‎→
11941224

11951225
# ll. 11. ⒒
@@ -8738,14 +8768,6 @@
87388768
← (‎ ੍ ‎) 0A4D GURMUKHI SIGN VIRAMA
87398769
← (‎ ્ ‎) 0ACD GUJARATI SIGN VIRAMA
87408770

8741-
# । ꠰
8742-
(‎ । ‎) 0964 DEVANAGARI DANDA
8743-
← (‎ ꠰ ‎) A830 NORTH INDIC FRACTION ONE QUARTER
8744-
8745-
# ।। ॥
8746-
(‎ ।। ‎) 0964 0964 DEVANAGARI DANDA, DEVANAGARI DANDA
8747-
← (‎ ॥ ‎) 0965 DEVANAGARI DOUBLE DANDA
8748-
87498771
# २ ર ૨
87508772
(‎ २ ‎) 0968 DEVANAGARI DIGIT TWO
87518773
← (‎ ર ‎) 0AB0 GUJARATI LETTER RA # →૨→
@@ -9631,10 +9653,6 @@
96319653
(‎ ၁ ‎) 1041 MYANMAR DIGIT ONE
96329654
← (‎ ၥ ‎) 1065 MYANMAR LETTER WESTERN PWO KAREN THA
96339655

9634-
# ၊၊ ။
9635-
(‎ ၊၊ ‎) 104A 104A MYANMAR SIGN LITTLE SECTION, MYANMAR SIGN LITTLE SECTION
9636-
← (‎ ။ ‎) 104B MYANMAR SIGN SECTION
9637-
96389656
# ၽှ ၾ
96399657
(‎ ၽှ ‎) 107D 103E MYANMAR LETTER SHAN PHA, MYANMAR CONSONANT SIGN MEDIAL HA
96409658
← (‎ ၾ ‎) 107E MYANMAR LETTER SHAN FA
@@ -17243,10 +17261,6 @@
1724317261
(‎ 𑐯 ‎) 1142F NEWA LETTER LHA
1724417262
← (‎ 𑐴𑑂𑐮 ‎) 11434 11442 1142E NEWA LETTER HA, NEWA SIGN VIRAMA, NEWA LETTER LA
1724517263

17246-
# 𑑋𑑋 𑑌
17247-
(‎ 𑑋𑑋 ‎) 1144B 1144B NEWA DANDA, NEWA DANDA
17248-
← (‎ 𑑌 ‎) 1144C NEWA DOUBLE DANDA
17249-
1725017264
# 𑖂 𑗘 𑗙
1725117265
(‎ 𑖂 ‎) 11582 SIDDHAM LETTER I
1725217266
← (‎ 𑗘 ‎) 115D8 SIDDHAM LETTER THREE-CIRCLE ALTERNATE I
@@ -17268,10 +17282,6 @@
1726817282
(‎ 𑖳 ‎) 115B3 SIDDHAM VOWEL SIGN UU
1726917283
← (‎ 𑗝 ‎) 115DD SIDDHAM VOWEL SIGN ALTERNATE UU
1727017284

17271-
# 𑙁𑙁 𑙂
17272-
(‎ 𑙁𑙁 ‎) 11641 11641 MODI DANDA, MODI DANDA
17273-
← (‎ 𑙂 ‎) 11642 MODI DOUBLE DANDA
17274-
1727517285
# 𑫥𑫥 𑫨
1727617286
(‎ 𑫥𑫥 ‎) 11AE5 11AE5 PAU CIN HAU RISING TONE LONG, PAU CIN HAU RISING TONE LONG
1727717287
← (‎ 𑫨 ‎) 11AE8 PAU CIN HAU RISING TONE LONG FINAL
@@ -17329,10 +17339,6 @@
1732917339
← (‎ 𑫳𑫵 ‎) 11AF3 11AF5 PAU CIN HAU LOW-FALLING TONE LONG, PAU CIN HAU GLOTTAL STOP
1733017340
← (‎ 𑫸 ‎) 11AF8 PAU CIN HAU GLOTTAL STOP FINAL # →𑫳𑫵→
1733117341

17332-
# 𑱁𑱁 𑱂
17333-
(‎ 𑱁𑱁 ‎) 11C41 11C41 BHAIKSUKI DANDA, BHAIKSUKI DANDA
17334-
← (‎ 𑱂 ‎) 11C42 BHAIKSUKI DOUBLE DANDA
17335-
1733617342
# 𑲪 𑲲
1733717343
(‎ 𑲪 ‎) 11CAA MARCHEN SUBJOINED LETTER RA
1733817344
← (‎ 𑲲 ‎) 11CB2 MARCHEN VOWEL SIGN U
@@ -17798,5 +17804,5 @@
1779817804
(‎ 𪘀 ‎) 2A600 CJK UNIFIED IDEOGRAPH-2A600
1779917805
← (‎ 𪘀 ‎) 2FA1D CJK COMPATIBILITY IDEOGRAPH-2FA1D
1780017806

17801-
# total : 7606
17807+
# total : 7630
1780217808

unicodetools/data/security/dev/data/source/confusables-source.txt

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5759,3 +5759,30 @@ A7F1 ; 02E2 # ( ꟱ → ˢ ) MODIFIER LETTER CAPITAL S → MODIFIER LETTER SMAL
57595759

57605760
# Confusables data for U+00A1 INVERTED EXCLAMATION MARK (PAG ref #453)
57615761
00A1 ; 0069
5762+
5763+
# Confusables data for dandas and double dandas (PAG ref #468)
5764+
0964 ; 007C # DEVANAGARI DANDA
5765+
104A ; 007C # MYANMAR SIGN LITTLE SECTION
5766+
A8CE ; 007C # SAURASHTRA DANDA
5767+
11047 ; 007C # BRAHMI DANDA
5768+
110C0 ; 007C # KAITHI DANDA
5769+
11141 ; 007C # CHAKMA DANDA
5770+
111C5 ; 007C # SHARADA DANDA
5771+
1144B ; 007C # NEWA DANDA
5772+
11641 ; 007C # MODI DANDA
5773+
11C41 ; 007C # BHAIKSUKI DANDA
5774+
AA5D ; 007C # CHAM PUNCTUATION DANDA
5775+
113D4 ; 007C # TULU-TIGALARI DANDA
5776+
115C5 ; 007C # SIDDHAM SEPARATOR BAR
5777+
16D63 ; 007C # KIRAT RAI VOWEL SIGN AA
5778+
0965 ; 007C 007C # DEVANAGARI DOUBLE DANDA
5779+
104B ; 007C 007C # MYANMAR SIGN SECTION
5780+
A8CF ; 007C 007C # SAURASHTRA DOUBLE DANDA
5781+
11048 ; 007C 007C # BRAHMI DOUBLE DANDA
5782+
110C1 ; 007C 007C # KAITHI DOUBLE DANDA
5783+
11142 ; 007C 007C # CHAKMA DOUBLE DANDA
5784+
111C6 ; 007C 007C # SHARADA DOUBLE DANDA
5785+
1144C ; 007C 007C # NEWA DOUBLE DANDA
5786+
11642 ; 007C 007C # MODI DOUBLE DANDA
5787+
11C42 ; 007C 007C # BHAIKSUKI DOUBLE DANDA
5788+
113D5 ; 007C 007C # TULU-TIGALARI DOUBLE DANDA

0 commit comments

Comments
 (0)