12e5b6d6dSopenharmony_ci# Copyright (C) 2016 and later: Unicode, Inc. and others. 22e5b6d6dSopenharmony_ci# License & terms of use: http://www.unicode.org/copyright.html 32e5b6d6dSopenharmony_ci# Copyright (c) 2001-2016 International Business Machines 42e5b6d6dSopenharmony_ci# Corporation and others. All Rights Reserved. 52e5b6d6dSopenharmony_ci# 62e5b6d6dSopenharmony_ci# RBBI Test Data 72e5b6d6dSopenharmony_ci# 82e5b6d6dSopenharmony_ci# File: rbbitst.txt 92e5b6d6dSopenharmony_ci# 102e5b6d6dSopenharmony_ci# The format of this file looks vaguely like some kind of xml-ish markup, 112e5b6d6dSopenharmony_ci# but it is NOT. The syntax is this.. 122e5b6d6dSopenharmony_ci# 132e5b6d6dSopenharmony_ci# <word> any following data is for word break testing 142e5b6d6dSopenharmony_ci# <sent> any following data is for sentence break testing 152e5b6d6dSopenharmony_ci# <line> any following data is for line break testing 162e5b6d6dSopenharmony_ci# <char> any following data is for char break testing 172e5b6d6dSopenharmony_ci# <title> any following data is for title break testing 182e5b6d6dSopenharmony_ci# <rules> rules ... </rules> following data is tested against these rules. 192e5b6d6dSopenharmony_ci# Applies until a following occurrence of <word>, <sent>, etc. or another <rules> 202e5b6d6dSopenharmony_ci# <locale locale_name> Switch to the named locale at the next occurrence of <word>, <sent>, etc. 212e5b6d6dSopenharmony_ci# <data> ... </data> test data. May span multiple lines. 222e5b6d6dSopenharmony_ci# <> Break position, status == 0 232e5b6d6dSopenharmony_ci# • Break position, status == 0 (Bullet, \u2022) 242e5b6d6dSopenharmony_ci# <nnn> Break position, status == nnn 252e5b6d6dSopenharmony_ci# \ Escape. Normal ICU unescape applied. 262e5b6d6dSopenharmony_ci# \ at end of line -> Line Continuation. Remove both the backslash and the new line 272e5b6d6dSopenharmony_ci# 282e5b6d6dSopenharmony_ci# In ICU4C, this test data is run by intltest, rbbi/RBBITest/TestExtended. 292e5b6d6dSopenharmony_ci# In ICU4J, this test data is run by com.ibm.icu.dev.test.rbbi.RBBITestExtended 302e5b6d6dSopenharmony_ci# 312e5b6d6dSopenharmony_ci# There are two copies of this file in the source repository, 322e5b6d6dSopenharmony_ci# [ICU4C] source/test/testdata/rbbitst.txt 332e5b6d6dSopenharmony_ci# [ICU4J] main/tests/core/src/com/ibm/icu/dev/test/rbbi/rbbitst.txt 342e5b6d6dSopenharmony_ci# 352e5b6d6dSopenharmony_ci# ICU4C's copy is the primary one. If any changes are made to ICU4J's copy, make sure they 362e5b6d6dSopenharmony_ci# are merged back into ICU4C's copy of the file, lest they get overwritten later. 372e5b6d6dSopenharmony_ci# TODO: figure out how to have a single copy of the file for use by both C and Java. 382e5b6d6dSopenharmony_ci 392e5b6d6dSopenharmony_ci 402e5b6d6dSopenharmony_ci# Temp debugging tests 412e5b6d6dSopenharmony_ci# 422e5b6d6dSopenharmony_ci<word> 432e5b6d6dSopenharmony_ci<data>• 442e5b6d6dSopenharmony_ci•</data> 452e5b6d6dSopenharmony_ci 462e5b6d6dSopenharmony_ci## FILTERED BREAK TESTS 472e5b6d6dSopenharmony_ci 482e5b6d6dSopenharmony_ci# (William Bradford, public domain. http://catalog.hathitrust.org/Record/008651224 ) - edited. 492e5b6d6dSopenharmony_ci<locale en> 502e5b6d6dSopenharmony_ci<sent> 512e5b6d6dSopenharmony_ci<data>\ 522e5b6d6dSopenharmony_ci•In the meantime Mr. •Weston arrived with his small ship, which he had now recovered. •Capt. •Gorges, who informed the Sgt. here that one purpose of his going east was to meet with Mr. •Weston, took this opportunity to call him to account for some abuses he had to lay to his charge.•</data> 532e5b6d6dSopenharmony_ci 542e5b6d6dSopenharmony_ci<locale en@ss=standard> 552e5b6d6dSopenharmony_ci<sent> 562e5b6d6dSopenharmony_ci<data>\ 572e5b6d6dSopenharmony_ci•In the meantime Mr. Weston arrived with his small ship, which he had now recovered. •Capt. Gorges, who informed the Sgt. here that one purpose of his going east was to meet with Mr. Weston, took this opportunity to call him to account for some abuses he had to lay to his charge.•</data> 582e5b6d6dSopenharmony_ci 592e5b6d6dSopenharmony_ci# This hits the case where "D." would match the end of "Ph.D.". 602e5b6d6dSopenharmony_ci<locale en@ss=standard> 612e5b6d6dSopenharmony_ci<sent> 622e5b6d6dSopenharmony_ci<data>\ 632e5b6d6dSopenharmony_ci•Doctor with a D. •As in, Ph.D., you know.•</data> 642e5b6d6dSopenharmony_ci 652e5b6d6dSopenharmony_ci# ICU-21459 logic error. 662e5b6d6dSopenharmony_ci<locale en@ss=standard> 672e5b6d6dSopenharmony_ci<sent> 682e5b6d6dSopenharmony_ci<data>•on. •But after a day in the arena sun, the metal feels hot enough to blister my hands.•</data> 692e5b6d6dSopenharmony_ci 702e5b6d6dSopenharmony_ci# same as root (unless some exceptions are added!) 712e5b6d6dSopenharmony_ci<locale tfg@ss=standard> 722e5b6d6dSopenharmony_ci<sent> 732e5b6d6dSopenharmony_ci<data>\ 742e5b6d6dSopenharmony_ci•In the meantime Mr. •Weston arrived with his small ship, which he had now recovered. •Capt. •Gorges, who informed the Sgt. here that one purpose of his going east was to meet with Mr. •Weston, took this opportunity to call him to account for some abuses he had to lay to his charge.•</data> 752e5b6d6dSopenharmony_ci 762e5b6d6dSopenharmony_ci# same as root (unless some exceptions are added!) 772e5b6d6dSopenharmony_ci<locale ja@ss=standard> 782e5b6d6dSopenharmony_ci<sent> 792e5b6d6dSopenharmony_ci<data>\ 802e5b6d6dSopenharmony_ci•In the meantime Mr. •Weston arrived with his small ship, which he had now recovered. •Capt. •Gorges, who informed the Sgt. here that one purpose of his going east was to meet with Mr. •Weston, took this opportunity to call him to account for some abuses he had to lay to his charge.•</data> 812e5b6d6dSopenharmony_ci 822e5b6d6dSopenharmony_ci## END FILTERED BREAK TESTS 832e5b6d6dSopenharmony_ci 842e5b6d6dSopenharmony_ci 852e5b6d6dSopenharmony_ci######################################################################################## 862e5b6d6dSopenharmony_ci# 872e5b6d6dSopenharmony_ci# 882e5b6d6dSopenharmony_ci# G r a p h e m e C l u s t e r T e s t s 892e5b6d6dSopenharmony_ci# 902e5b6d6dSopenharmony_ci# 912e5b6d6dSopenharmony_ci########################################################################################## 922e5b6d6dSopenharmony_ci<char> 932e5b6d6dSopenharmony_ci 942e5b6d6dSopenharmony_ci<data>•a•b•c• •,•\u0666•</data> # Quick Test 952e5b6d6dSopenharmony_ci<data>•\r•\r•\r\n•\r\n•\n•\r•</data> # don't break CR/LF 962e5b6d6dSopenharmony_ci 972e5b6d6dSopenharmony_ci# Always break after controls. Combining chars don't combine with them. 982e5b6d6dSopenharmony_ci<data>•\u0003•\N{COMBINING GRAVE ACCENT}•\r•\N{COMBINING GRAVE ACCENT}•</data> 992e5b6d6dSopenharmony_ci<data>•\u0085•\N{COMBINING MACRON}•A\N{COMBINING MACRON}•</data> 1002e5b6d6dSopenharmony_ci 1012e5b6d6dSopenharmony_ci# Surrogates 1022e5b6d6dSopenharmony_ci<data>•\U00011000•\U00010020•\U00010000\N{COMBINING MACRON}•</data> 1032e5b6d6dSopenharmony_ci<data>•\ud800\udc00•\udbff\udfff•a•</data> 1042e5b6d6dSopenharmony_ci 1052e5b6d6dSopenharmony_ci# Extend (Combining chars) combine. 1062e5b6d6dSopenharmony_ci<data>•A\N{COMBINING GRAVE ACCENT}•B•</data> 1072e5b6d6dSopenharmony_ci<data>•\N{GREEK SMALL LETTER MU}\N{COMBINING LOW LINE}\N{COMBINING HORN}•</data> 1082e5b6d6dSopenharmony_ci<data>•a\u0301•b\u0302•c\u0303•d\u0304•e\u0305•f\u0306•g\u0307•h\u0308•i\u0309•</data> 1092e5b6d6dSopenharmony_ci 1102e5b6d6dSopenharmony_ci<data>•a\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304\u0301\u0302\u0303\u0304•</data> 1112e5b6d6dSopenharmony_ci 1122e5b6d6dSopenharmony_ci# Don't break Hangul Syllables 1132e5b6d6dSopenharmony_ci# L : \u1100 1142e5b6d6dSopenharmony_ci# V : \u1161 1152e5b6d6dSopenharmony_ci# T : \u11A8 1162e5b6d6dSopenharmony_ci# LV : \uAC00 1172e5b6d6dSopenharmony_ci# LVT : \uAC01 1182e5b6d6dSopenharmony_ci 1192e5b6d6dSopenharmony_ci<data>•\u1100\u1161\u11a8•\u1100\u1161\u11a8•</data> #LVT 1202e5b6d6dSopenharmony_ci<data>•\u1100\u1161•\u1100\u1161•</data> 1212e5b6d6dSopenharmony_ci<data>•\u1100\u1161\u11a8•\u1161•\u1100•\u11a8•\u1161\u1161\u1161\u11a8•</data> 1222e5b6d6dSopenharmony_ci<data>•\u1100\u1100\uac01•\u1100\uac01•\u1100\uac01\u0301•\uac01•</data> 1232e5b6d6dSopenharmony_ci<data>•\u1100\u0301•\u1161\u11a8\u0301•\u11a8•</data> 1242e5b6d6dSopenharmony_ci 1252e5b6d6dSopenharmony_ci 1262e5b6d6dSopenharmony_ci 1272e5b6d6dSopenharmony_ci# Hindi combining chars. (An old test) 1282e5b6d6dSopenharmony_ci# TODO: Update these tests for Unicode 5.1 Extended Grapheme clusters 1292e5b6d6dSopenharmony_ci#<data>•भ••ा•\u0930•\u0924• •\u0938\u0941\u0902•\u0926•\u0930• 1302e5b6d6dSopenharmony_ci#•\u0939•\u094c•\u0964•</data> 1312e5b6d6dSopenharmony_ci#<data>•\u0916\u0947•\u0938\u0941\u0902•\u0926•\u0930•\u0939•\u094c•\u0964•</data> 1322e5b6d6dSopenharmony_ci 1332e5b6d6dSopenharmony_ci 1342e5b6d6dSopenharmony_ci# Bug 1587. Tamil. \u0baa\u0bc1 is an Extended Grpaheme Cluster 1352e5b6d6dSopenharmony_ci<data>•\u0baa\u0bc1•\u0baa\u0bc1•</data> 1362e5b6d6dSopenharmony_ci 1372e5b6d6dSopenharmony_ci# Regression test for bug 1889 1382e5b6d6dSopenharmony_ci<data>•\u0f40\u0f7d•\u0000•\u0f7e•</data> 1392e5b6d6dSopenharmony_ci 1402e5b6d6dSopenharmony_ci 1412e5b6d6dSopenharmony_ci# 0xffff is a legal character, and should not stop the break iterator early. 1422e5b6d6dSopenharmony_ci# (Requires special casing in implementation, which is why it gets a test.) 1432e5b6d6dSopenharmony_ci<data>•\uffff•\uffff• •a•</data> 1442e5b6d6dSopenharmony_ci 1452e5b6d6dSopenharmony_ci# Treat Japanese Half Width voicing marks as combining 1462e5b6d6dSopenharmony_ci<data>•A\uff9e•B\uff9f\uff9e\uff9f•C•</data> 1472e5b6d6dSopenharmony_ci 1482e5b6d6dSopenharmony_ci# Test data originally from Java BreakIteratorTest.TestCharcterBreak() 1492e5b6d6dSopenharmony_ci<data>•S\u0300•i\u0317•m•p•l•e\u0301• •s•a\u0302•m•p•l•e\u0303•.•w•a\u0302•w•a•f•q•\n•\r•\r\n•\n•</data> 1502e5b6d6dSopenharmony_ci 1512e5b6d6dSopenharmony_ci######################################################################################## 1522e5b6d6dSopenharmony_ci# 1532e5b6d6dSopenharmony_ci# 1542e5b6d6dSopenharmony_ci# E x t e n d e d G r a p h e m e C l u s t e r T e s t s 1552e5b6d6dSopenharmony_ci# 1562e5b6d6dSopenharmony_ci# 1572e5b6d6dSopenharmony_ci########################################################################################## 1582e5b6d6dSopenharmony_ci#<xgc> 1592e5b6d6dSopenharmony_ci 1602e5b6d6dSopenharmony_ci# Plain Vanilla grapheme clusters 1612e5b6d6dSopenharmony_ci#<data>•a•b•c•</data> 1622e5b6d6dSopenharmony_ci#<data>•a\u0301\u0302• •b\u0303\u0304•</data> 1632e5b6d6dSopenharmony_ci 1642e5b6d6dSopenharmony_ci# Assorted Hindi combining marks 1652e5b6d6dSopenharmony_ci#<data>•\u0904\u0903• •\u0937\u093E• •\u0904\u093F• •\u0937\u0940• •\u0937\u0949• •\u0937\u094A• •\u0937\u094B• •\u0937\u094C•</data> 1662e5b6d6dSopenharmony_ci 1672e5b6d6dSopenharmony_ci# Thai Clusters 1682e5b6d6dSopenharmony_ci# $Prepend $Extend* $PrependBase $Extend*; 1692e5b6d6dSopenharmony_ci# 1702e5b6d6dSopenharmony_ci#<data>•\u0e40\u0e01•\u0e44\u0301\u0e23\u0302\u0303•\u0e40•\u0e40\u0e02•\u0e02• •</data> 1712e5b6d6dSopenharmony_ci 1722e5b6d6dSopenharmony_ci# 1732e5b6d6dSopenharmony_ci# ICU-13637 and CLDR-10994 - Indic Grapheme Cluster Boundary changes to support aksaras 1742e5b6d6dSopenharmony_ci# New rule: LinkingConsonant ExtCccZwj* Virama ExtCccZwj* × LinkingConsonant 1752e5b6d6dSopenharmony_ci# Sample Chars: LinkingConsonant: \u0915 1762e5b6d6dSopenharmony_ci# Virama: \u094d [also Extend] 1772e5b6d6dSopenharmony_ci# ExtCccZWJ: \u0308 1782e5b6d6dSopenharmony_ci# Extend but not ExtCCCZWJ \u093A 1792e5b6d6dSopenharmony_ci<char> 1802e5b6d6dSopenharmony_ci<data>•\u0915\u094d\u0915•</data> 1812e5b6d6dSopenharmony_ci<data>•\u0915\u0308\u0308\u094d\u0308\u0308\u0915•</data> 1822e5b6d6dSopenharmony_ci<data>•\u0915\u0308\u0308\u094d\u0308\u0308•\u0041•</data> 1832e5b6d6dSopenharmony_ci<data>•\u0915\u0308\u0308\u094d\u093A\u093A•\u0915•</data> 1842e5b6d6dSopenharmony_ci 1852e5b6d6dSopenharmony_ci# 1862e5b6d6dSopenharmony_ci# From cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Bengali.txt 1872e5b6d6dSopenharmony_ci# 1882e5b6d6dSopenharmony_ci 1892e5b6d6dSopenharmony_ci# ব্যক্তিত্বের ; 1902e5b6d6dSopenharmony_ci<data>•ব্য•ক্তি•ত্বে•র•</data> 1912e5b6d6dSopenharmony_ci# আত্মবিশ্বাস ; 1922e5b6d6dSopenharmony_ci<data>•আ•ত্ম•বি•শ্বা•স•</data> 1932e5b6d6dSopenharmony_ci# ব্যাক্টেরিয়া ; 1942e5b6d6dSopenharmony_ci<data>•ব্যা•ক্টে•রি•য়া•</data> 1952e5b6d6dSopenharmony_ci# সমস্যার ; 1962e5b6d6dSopenharmony_ci<data>•স•ম•স্যা•র•</data> 1972e5b6d6dSopenharmony_ci# মিশ্রণ ; 1982e5b6d6dSopenharmony_ci<data>•মি•শ্র•ণ•</data> 1992e5b6d6dSopenharmony_ci# দুর্গন্ধ ; 2002e5b6d6dSopenharmony_ci<data>•দু•র্গ•ন্ধ•</data> 2012e5b6d6dSopenharmony_ci# পরীক্ষার ; 2022e5b6d6dSopenharmony_ci<data>•প•রী•ক্ষা•র•</data> 2032e5b6d6dSopenharmony_ci# কোলেস্টেরল ; 2042e5b6d6dSopenharmony_ci<data>•কো•লে•স্টে•র•ল•</data> 2052e5b6d6dSopenharmony_ci# ব্যায়ামকে ; 2062e5b6d6dSopenharmony_ci<data>•ব্যা•য়া•ম•কে•</data> 2072e5b6d6dSopenharmony_ci# সপ্তাহে ; 2082e5b6d6dSopenharmony_ci<data>•স•প্তা•হে•</data> 2092e5b6d6dSopenharmony_ci# পরীক্ষার ; 2102e5b6d6dSopenharmony_ci<data>•প•রী•ক্ষা•র•</data> 2112e5b6d6dSopenharmony_ci# চর্বিজাতীয় ; 2122e5b6d6dSopenharmony_ci<data>•চ•র্বি•জা•তী•য়•</data> 2132e5b6d6dSopenharmony_ci# নিয়ণ্ত্রণ ; 2142e5b6d6dSopenharmony_ci<data>•নি•য়•ণ্ত্র•ণ•</data> 2152e5b6d6dSopenharmony_ci# অবশ্যই ; 2162e5b6d6dSopenharmony_ci<data>•অ•ব•শ্য•ই•</data> 2172e5b6d6dSopenharmony_ci# নয়াদিল্লির ; 2182e5b6d6dSopenharmony_ci<data>•ন•য়া•দি•ল্লি•র•</data> 2192e5b6d6dSopenharmony_ci# সমীক্ষাটা ; 2202e5b6d6dSopenharmony_ci<data>•স•মী•ক্ষা•টা•</data> 2212e5b6d6dSopenharmony_ci# #ভূমিকম্পের ; 2222e5b6d6dSopenharmony_ci# <data>•ভূ•মি•ক•ম্পের•</data> # line 17 in TestSegmenter-Bengali.txt 2232e5b6d6dSopenharmony_ci# কেন্দ্রীয় ; 2242e5b6d6dSopenharmony_ci<data>•কে•ন্দ্রী•য়•</data> 2252e5b6d6dSopenharmony_ci# উস্কানিই ; 2262e5b6d6dSopenharmony_ci<data>•উ•স্কা•নি•ই•</data> 2272e5b6d6dSopenharmony_ci# সমীক্ষকরা ; 2282e5b6d6dSopenharmony_ci<data>•স•মী•ক্ষ•ক•রা•</data> 2292e5b6d6dSopenharmony_ci# মুহূর্তে ; 2302e5b6d6dSopenharmony_ci<data>•মু•হূ•র্তে•</data> 2312e5b6d6dSopenharmony_ci# সম্পর্কে ; 2322e5b6d6dSopenharmony_ci<data>•স•ম্প•র্কে•</data> 2332e5b6d6dSopenharmony_ci# পৌষসংক্রান্তির ; 2342e5b6d6dSopenharmony_ci<data>•পৌ•ষ•সং•ক্রা•ন্তি•র•</data> 2352e5b6d6dSopenharmony_ci# মুখ্যমন্ত্রী ; 2362e5b6d6dSopenharmony_ci<data>•মু•খ্য•ম•ন্ত্রী•</data> 2372e5b6d6dSopenharmony_ci 2382e5b6d6dSopenharmony_ci# 2392e5b6d6dSopenharmony_ci# from cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Devanagari.txt 2402e5b6d6dSopenharmony_ci# 2412e5b6d6dSopenharmony_ci 2422e5b6d6dSopenharmony_ci# संदिग्ध ; 2432e5b6d6dSopenharmony_ci<data>•सं•दि•ग्ध•</data> 2442e5b6d6dSopenharmony_ci# सुरक्षा ; 2452e5b6d6dSopenharmony_ci<data>•सु•र•क्षा•</data> 2462e5b6d6dSopenharmony_ci# टक्कर ; 2472e5b6d6dSopenharmony_ci<data>•ट•क्क•र•</data> 2482e5b6d6dSopenharmony_ci# सत्र ; 2492e5b6d6dSopenharmony_ci<data>•स•त्र•</data> 2502e5b6d6dSopenharmony_ci# दक्षिण ; 2512e5b6d6dSopenharmony_ci<data>•द•क्षि•ण•</data> 2522e5b6d6dSopenharmony_ci# मिश्रणाने ; 2532e5b6d6dSopenharmony_ci<data>•मि•श्र•णा•ने•</data> 2542e5b6d6dSopenharmony_ci# दुर्घटनाग्रस्त ; 2552e5b6d6dSopenharmony_ci<data>•दु•र्घ•ट•ना•ग्र•स्त•</data> 2562e5b6d6dSopenharmony_ci# मुहूर्त ; 2572e5b6d6dSopenharmony_ci<data>•मु•हू•र्त•</data> 2582e5b6d6dSopenharmony_ci# शर्करायुक्त ; 2592e5b6d6dSopenharmony_ci<data>•श•र्क•रा•यु•क्त•</data> 2602e5b6d6dSopenharmony_ci# अंतरराष्ट्रीय ; 2612e5b6d6dSopenharmony_ci<data>•अं•त•र•रा•ष्ट्री•य•</data> 2622e5b6d6dSopenharmony_ci# राष्ट्रपति ; 2632e5b6d6dSopenharmony_ci<data>•रा•ष्ट्र•प•ति•</data> 2642e5b6d6dSopenharmony_ci# फ्रांस ; 2652e5b6d6dSopenharmony_ci<data>•फ्रां•स•</data> 2662e5b6d6dSopenharmony_ci# ट्रैक्टर ; 2672e5b6d6dSopenharmony_ci<data>•ट्रै•क्ट•र•</data> 2682e5b6d6dSopenharmony_ci# सिट्रोनेलाचे ; 2692e5b6d6dSopenharmony_ci<data>•सि•ट्रो•ने•ला•चे•</data> 2702e5b6d6dSopenharmony_ci# टुक्रालाई ; 2712e5b6d6dSopenharmony_ci<data>•टु•क्रा•ला•ई•</data> 2722e5b6d6dSopenharmony_ci# इकट्ठा ; 2732e5b6d6dSopenharmony_ci<data>•इ•क•ट्ठा•</data> 2742e5b6d6dSopenharmony_ci# शास्त्र ; 2752e5b6d6dSopenharmony_ci<data>•शा•स्त्र•</data> 2762e5b6d6dSopenharmony_ci# स्त्री ; 2772e5b6d6dSopenharmony_ci<data>•स्त्री•</data> 2782e5b6d6dSopenharmony_ci# लक्ष्य ; 2792e5b6d6dSopenharmony_ci<data>•ल•क्ष्य•</data> 2802e5b6d6dSopenharmony_ci# तीक्ष्ण ; 2812e5b6d6dSopenharmony_ci<data>•ती•क्ष्ण•</data> 2822e5b6d6dSopenharmony_ci# Words ; 2832e5b6d6dSopenharmony_ci<data>•W•o•r•d•s•</data> 2842e5b6d6dSopenharmony_ci# त्रिवेदी ; 2852e5b6d6dSopenharmony_ci<data>•त्रि•वे•दी•</data> 2862e5b6d6dSopenharmony_ci# कृत्रिम ; 2872e5b6d6dSopenharmony_ci<data>•कृ•त्रि•म•</data> 2882e5b6d6dSopenharmony_ci# मात्रामा ; 2892e5b6d6dSopenharmony_ci<data>•मा•त्रा•मा•</data> 2902e5b6d6dSopenharmony_ci# सिद्धार्थनगर ; 2912e5b6d6dSopenharmony_ci<data>•सि•द्धा•र्थ•न•ग•र•</data> 2922e5b6d6dSopenharmony_ci# श्रद्धालुओं ; 2932e5b6d6dSopenharmony_ci<data>•श्र•द्धा•लु•ओं•</data> 2942e5b6d6dSopenharmony_ci# वृद्धिसँग ; 2952e5b6d6dSopenharmony_ci<data>•वृ•द्धि•सँ•ग•</data> 2962e5b6d6dSopenharmony_ci# अंतःज्ञानी ; 2972e5b6d6dSopenharmony_ci<data>•अं•तः•ज्ञा•नी• •</data> 2982e5b6d6dSopenharmony_ci# गन्नदी॑धिम ; 2992e5b6d6dSopenharmony_ci<data>•ग•न्न•दी॑•धि•म•</data> 3002e5b6d6dSopenharmony_ci# प्प्रप॑द्ये॒ ; 3012e5b6d6dSopenharmony_ci<data>•प्प्र•प॑•द्ये॒•</data> 3022e5b6d6dSopenharmony_ci# मनस्तापः ; 3032e5b6d6dSopenharmony_ci<data>•म•न•स्ता•पः•</data> 3042e5b6d6dSopenharmony_ci# हविष्करोमि ; 3052e5b6d6dSopenharmony_ci<data>•ह•वि•ष्क•रो•मि•</data> 3062e5b6d6dSopenharmony_ci# अहर्पतिः ; 3072e5b6d6dSopenharmony_ci<data>•अ•ह•र्प•तिः•</data> 3082e5b6d6dSopenharmony_ci# गच्छति ; 3092e5b6d6dSopenharmony_ci<data>•ग•च्छ•ति•</data> 3102e5b6d6dSopenharmony_ci# अयम् ; 3112e5b6d6dSopenharmony_ci<data>•अ•य•म्•</data> 3122e5b6d6dSopenharmony_ci# शिवश्चोदति ; 3132e5b6d6dSopenharmony_ci<data>•शि•व•श्चो•द•ति•</data> 3142e5b6d6dSopenharmony_ci# मनष्टालयति ; 3152e5b6d6dSopenharmony_ci<data>•म•न•ष्टा•ल•य•ति•</data> 3162e5b6d6dSopenharmony_ci# अश्वष्ठक्कस्य ; 3172e5b6d6dSopenharmony_ci<data>•अ•श्व•ष्ठ•क्क•स्य•</data> 3182e5b6d6dSopenharmony_ci# दुष्पुत्रः ; 3192e5b6d6dSopenharmony_ci<data>•दु•ष्पु•त्रः•</data> 3202e5b6d6dSopenharmony_ci# द्विःपक्वम् ; 3212e5b6d6dSopenharmony_ci<data>•द्विः•प•क्व•म्•</data> 3222e5b6d6dSopenharmony_ci# द्विष्कामः ; 3232e5b6d6dSopenharmony_ci<data>•द्वि•ष्का•मः•</data> 3242e5b6d6dSopenharmony_ci# भर्तुर्भोगः ; 3252e5b6d6dSopenharmony_ci<data>•भ•र्तु•र्भो•गः•</data> 3262e5b6d6dSopenharmony_ci# शॆत्युल ; 3272e5b6d6dSopenharmony_ci<data>•शॆ•त्यु•ल• •</data> 3282e5b6d6dSopenharmony_ci# महारॆन्य ; 3292e5b6d6dSopenharmony_ci<data>•म•हा•रॆ•न्य•</data> 3302e5b6d6dSopenharmony_ci# सॆक्युल ; 3312e5b6d6dSopenharmony_ci<data>•सॆ•क्यु•ल•</data> 3322e5b6d6dSopenharmony_ci# ल्यॊदुर ; 3332e5b6d6dSopenharmony_ci<data>•ल्यॊ•दु•र•</data> 3342e5b6d6dSopenharmony_ci# फयॊक ; 3352e5b6d6dSopenharmony_ci<data>•फ•यॊ•क•</data> 3362e5b6d6dSopenharmony_ci# मॊहन्युव ; 3372e5b6d6dSopenharmony_ci<data>•मॊ•ह•न्यु•व•</data> 3382e5b6d6dSopenharmony_ci# अन्यर ; 3392e5b6d6dSopenharmony_ci<data>•अ•न्य•र•</data> 3402e5b6d6dSopenharmony_ci# ख्वजि ; 3412e5b6d6dSopenharmony_ci<data>•ख्व•जि•</data> 3422e5b6d6dSopenharmony_ci# खॅरिन्य ; 3432e5b6d6dSopenharmony_ci<data>•खॅ•रि•न्य•</data> 3442e5b6d6dSopenharmony_ci# उच्छ्वास ; 3452e5b6d6dSopenharmony_ci<data>•उ•च्छ्वा•स•</data> 3462e5b6d6dSopenharmony_ci# व्यक्तिमत्व ; 3472e5b6d6dSopenharmony_ci<data>•व्य•क्ति•म•त्व•</data> 3482e5b6d6dSopenharmony_ci# दातांच्यामध्ये ; 3492e5b6d6dSopenharmony_ci<data>•दा•तां•च्या•म•ध्ये•</data> 3502e5b6d6dSopenharmony_ci# दुर्गंधी ; 3512e5b6d6dSopenharmony_ci<data>•दु•र्गं•धी•</data> 3522e5b6d6dSopenharmony_ci# दुर्गंधीपासूनसुद्धा ; 3532e5b6d6dSopenharmony_ci<data>•दु•र्गं•धी•पा•सू•न•सु•द्धा•</data> 3542e5b6d6dSopenharmony_ci# नित्यकर्मामध्ये ; 3552e5b6d6dSopenharmony_ci<data>•नि•त्य•क•र्मा•म•ध्ये•</data> 3562e5b6d6dSopenharmony_ci# आजारांपासूनसुद्धा ; 3572e5b6d6dSopenharmony_ci<data>•आ•जा•रां•पा•सू•न•सु•द्धा•</data> 3582e5b6d6dSopenharmony_ci# भाज्यांमध्ये ; 3592e5b6d6dSopenharmony_ci<data>•भा•ज्यां•म•ध्ये•</data> 3602e5b6d6dSopenharmony_ci# उच्छ्वासाच्या ; 3612e5b6d6dSopenharmony_ci<data>•उ•च्छ्वा•सा•च्या•</data> 3622e5b6d6dSopenharmony_ci# सुकिल्लीं ; 3632e5b6d6dSopenharmony_ci<data>•सु•कि•ल्लीं•</data> 3642e5b6d6dSopenharmony_ci# स्लिपां ; 3652e5b6d6dSopenharmony_ci<data>•स्लि•पां•</data> 3662e5b6d6dSopenharmony_ci# मिसळिल्ल्यान ; 3672e5b6d6dSopenharmony_ci<data>•मि•स•ळि•ल्ल्या•न•</data> 3682e5b6d6dSopenharmony_ci# रोंप्यांची ; 3692e5b6d6dSopenharmony_ci<data>•रों•प्यां•ची•</data> 3702e5b6d6dSopenharmony_ci# वर्सांतल्यान ; 3712e5b6d6dSopenharmony_ci<data>•व•र्सां•त•ल्या•न•</data> 3722e5b6d6dSopenharmony_ci# रोंप्याच्या ; 3732e5b6d6dSopenharmony_ci<data>•रों•प्या•च्या•</data> 3742e5b6d6dSopenharmony_ci# नाशिल्ल्यान ; 3752e5b6d6dSopenharmony_ci<data>•ना•शि•ल्ल्या•न•</data> 3762e5b6d6dSopenharmony_ci# जिल्ल्याच्या ; 3772e5b6d6dSopenharmony_ci<data>•जि•ल्ल्या•च्या•</data> 3782e5b6d6dSopenharmony_ci# कुरुक्षेत्रांतल्या ; 3792e5b6d6dSopenharmony_ci<data>•कु•रु•क्षे•त्रां•त•ल्या•</data> 3802e5b6d6dSopenharmony_ci# भाज्ज्यांची ; 3812e5b6d6dSopenharmony_ci<data>•भा•ज्ज्यां•ची•</data> 3822e5b6d6dSopenharmony_ci# सिट्रोनेलाका ; 3832e5b6d6dSopenharmony_ci<data>•सि•ट्रो•ने•ला•का•</data> 3842e5b6d6dSopenharmony_ci# गरिनुपर्छ ; 3852e5b6d6dSopenharmony_ci<data>•ग•रि•नु•प•र्छ•</data> 3862e5b6d6dSopenharmony_ci# सामान्यतः ; 3872e5b6d6dSopenharmony_ci<data>•सा•मा•न्य•तः•</data> 3882e5b6d6dSopenharmony_ci# वृद्धिसँग ; 3892e5b6d6dSopenharmony_ci<data>•वृ•द्धि•सँ•ग•</data> 3902e5b6d6dSopenharmony_ci# रिपोर्टनि ; 3912e5b6d6dSopenharmony_ci<data>•रि•पो•र्ट•नि•</data> 3922e5b6d6dSopenharmony_ci# टोस्टर्ज़ ; 3932e5b6d6dSopenharmony_ci<data>•टो•स्ट•र्ज़•</data> 3942e5b6d6dSopenharmony_ci# वक्तव्य ; 3952e5b6d6dSopenharmony_ci<data>•व•क्त•व्य•</data> 3962e5b6d6dSopenharmony_ci# प्रक्रिया ; 3972e5b6d6dSopenharmony_ci<data>•प्र•क्रि•या•</data> 3982e5b6d6dSopenharmony_ci# निर्दिष्ट ; 3992e5b6d6dSopenharmony_ci<data>•नि•र्दि•ष्ट•</data> 4002e5b6d6dSopenharmony_ci# अस्वीकृत ; 4012e5b6d6dSopenharmony_ci<data>•अ•स्वी•कृ•त•</data> 4022e5b6d6dSopenharmony_ci# प्रयोक्ता ; 4032e5b6d6dSopenharmony_ci<data>•प्र•यो•क्ता•</data> 4042e5b6d6dSopenharmony_ci# प्रकार्यक ; 4052e5b6d6dSopenharmony_ci<data>•प्र•का•र्य•क•</data> 4062e5b6d6dSopenharmony_ci# ट्रेक्टरु ; 4072e5b6d6dSopenharmony_ci<data>•ट्रे•क्ट•रु•</data> 4082e5b6d6dSopenharmony_ci# स्थानधारक ; 4092e5b6d6dSopenharmony_ci<data>•स्था•न•धा•र•क•</data> 4102e5b6d6dSopenharmony_ci# प्रकार्यक ; 4112e5b6d6dSopenharmony_ci<data>•प्र•का•र्य•क•</data> 4122e5b6d6dSopenharmony_ci# अनुच्छेदसँ ; 4132e5b6d6dSopenharmony_ci<data>•अ•नु•च्छे•द•सँ•</data> 4142e5b6d6dSopenharmony_ci# गर्मीपदु ; 4152e5b6d6dSopenharmony_ci<data>•ग•र्मी•प•दु•</data> 4162e5b6d6dSopenharmony_ci# शास्त्रु ; 4172e5b6d6dSopenharmony_ci<data>•शा•स्त्रु•</data> 4182e5b6d6dSopenharmony_ci# इन्द्री ; 4192e5b6d6dSopenharmony_ci<data>•इ•न्द्री•</data> 4202e5b6d6dSopenharmony_ci# श्रधालू ; 4212e5b6d6dSopenharmony_ci<data>•श्र•धा•लू•</data> 4222e5b6d6dSopenharmony_ci# आस्तिकु ; 4232e5b6d6dSopenharmony_ci<data>•आ•स्ति•कु•</data> 4242e5b6d6dSopenharmony_ci# सकार्थो ; 4252e5b6d6dSopenharmony_ci<data>•स•का•र्थो•</data> 4262e5b6d6dSopenharmony_ci# सन्ॿंधु ; 4272e5b6d6dSopenharmony_ci<data>•स•न्ॿं•धु•</data> 4282e5b6d6dSopenharmony_ci# मनुक्खो ; 4292e5b6d6dSopenharmony_ci<data>•म•नु•क्खो•</data> 4302e5b6d6dSopenharmony_ci# हानिफ्राय ; 4312e5b6d6dSopenharmony_ci<data>•हा•नि•फ्रा•य•</data> 4322e5b6d6dSopenharmony_ci# दैथाइहरग्रा ; 4332e5b6d6dSopenharmony_ci<data>•दै•था•इ•ह•र•ग्रा•</data> 4342e5b6d6dSopenharmony_ci# बोसोरब्रै ; 4352e5b6d6dSopenharmony_ci<data>•बो•सो•र•ब्रै•</data> 4362e5b6d6dSopenharmony_ci# रांखान्थियारि ; 4372e5b6d6dSopenharmony_ci<data>•रां•खा•न्थि•या•रि•</data> 4382e5b6d6dSopenharmony_ci# खान्थियाव ; 4392e5b6d6dSopenharmony_ci<data>•खा•न्थि•या•व•</data> 4402e5b6d6dSopenharmony_ci# स्लिप्स ; 4412e5b6d6dSopenharmony_ci<data>•स्लि•प्स•</data> 4422e5b6d6dSopenharmony_ci 4432e5b6d6dSopenharmony_ci# 4442e5b6d6dSopenharmony_ci# cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Gujarati.txt 4452e5b6d6dSopenharmony_ci# 4462e5b6d6dSopenharmony_ci 4472e5b6d6dSopenharmony_ci# અંગ્રેજી ; 4482e5b6d6dSopenharmony_ci<data>•અં•ગ્રે•જી•</data> 4492e5b6d6dSopenharmony_ci# શબ્દકોશ ; 4502e5b6d6dSopenharmony_ci<data>•શ•બ્દ•કો•શ•</data> 4512e5b6d6dSopenharmony_ci# બાપ્તિસ્મા ; 4522e5b6d6dSopenharmony_ci<data>•બા•પ્તિ•સ્મા•</data> 4532e5b6d6dSopenharmony_ci# મિસ્ત્રી ; 4542e5b6d6dSopenharmony_ci<data>•મિ•સ્ત્રી•</data> 4552e5b6d6dSopenharmony_ci# સિક્કા ; 4562e5b6d6dSopenharmony_ci<data>•સિ•ક્કા•</data> 4572e5b6d6dSopenharmony_ci#6 ; એકત્રીસમું ; એ÷કત્રી÷સ÷મું 4582e5b6d6dSopenharmony_ci 4592e5b6d6dSopenharmony_ci# સ્વસ્થાને ; 4602e5b6d6dSopenharmony_ci<data>•સ્વ•સ્થા•ને•</data> 4612e5b6d6dSopenharmony_ci# પ્રશ્નાર્થ ; 4622e5b6d6dSopenharmony_ci<data>•પ્ર•શ્ના•ર્થ•</data> 4632e5b6d6dSopenharmony_ci# વર્તમાનકૃદંત ; 4642e5b6d6dSopenharmony_ci<data>•વ•ર્ત•મા•ન•કૃ•દં•ત•</data> 4652e5b6d6dSopenharmony_ci# વાક્યની ; 4662e5b6d6dSopenharmony_ci<data>•વા•ક્ય•ની•</data> 4672e5b6d6dSopenharmony_ci# સાર્વજનિક ; 4682e5b6d6dSopenharmony_ci<data>•સા•ર્વ•જ•નિ•ક•</data> 4692e5b6d6dSopenharmony_ci# સમાપ્તિ ; 4702e5b6d6dSopenharmony_ci<data>•સ•મા•પ્તિ•</data> 4712e5b6d6dSopenharmony_ci# પધાર્યા ; 4722e5b6d6dSopenharmony_ci<data>•પ•ધા•ર્યા•</data> 4732e5b6d6dSopenharmony_ci# વ્યક્તિત્વને ; 4742e5b6d6dSopenharmony_ci<data>•વ્ય•ક્તિ•ત્વ•ને•</data> 4752e5b6d6dSopenharmony_ci# આત્મવિશ્વાસ ; 4762e5b6d6dSopenharmony_ci<data>•આ•ત્મ•વિ•શ્વા•સ•</data> 4772e5b6d6dSopenharmony_ci# વ્યાયામથી ; 4782e5b6d6dSopenharmony_ci<data>•વ્યા•યા•મ•થી•</data> 4792e5b6d6dSopenharmony_ci# યુક્ત ; 4802e5b6d6dSopenharmony_ci<data>•યુ•ક્ત•</data> 4812e5b6d6dSopenharmony_ci# #18 ; પુરુષોત્તમ ; 4822e5b6d6dSopenharmony_ci<data>•પ•રુ•ષો•ત્ત•મ•</data> 4832e5b6d6dSopenharmony_ci# કેન્દ્રીય ; 4842e5b6d6dSopenharmony_ci<data>•કે•ન્દ્રી•ય•</data> 4852e5b6d6dSopenharmony_ci# ક્ષત્રિય ; 4862e5b6d6dSopenharmony_ci<data>•ક્ષ•ત્રિ•ય•</data> 4872e5b6d6dSopenharmony_ci# ફોર્મ્યુલા ; 4882e5b6d6dSopenharmony_ci<data>•ફો•ર્મ્યુ•લા•</data> 4892e5b6d6dSopenharmony_ci# કેન્દ્રમાં ; 4902e5b6d6dSopenharmony_ci<data>•કે•ન્દ્ર•માં•</data> 4912e5b6d6dSopenharmony_ci# સ્પ્રિંગ ; 4922e5b6d6dSopenharmony_ci<data>•સ્પ્રિં•ગ•</data> 4932e5b6d6dSopenharmony_ci# પ્રારંભ ; 4942e5b6d6dSopenharmony_ci<data>•પ્રા•રં•ભ•</data> 4952e5b6d6dSopenharmony_ci# વિદ્યાર્થીઓ ; 4962e5b6d6dSopenharmony_ci<data>•વિ•દ્યા•ર્થી•ઓ•</data> 4972e5b6d6dSopenharmony_ci# સ્વાર્થની ; 4982e5b6d6dSopenharmony_ci<data>•સ્વા•ર્થ•ની•</data> 4992e5b6d6dSopenharmony_ci# લોન્ગયરબ્યેન ; 5002e5b6d6dSopenharmony_ci<data>•લો•ન્ગ•ય•ર•બ્યે•ન•</data> 5012e5b6d6dSopenharmony_ci# સ્થાનિક ; 5022e5b6d6dSopenharmony_ci<data>•સ્થા•નિ•ક•</data> 5032e5b6d6dSopenharmony_ci# બિલ્ડિંગની ; 5042e5b6d6dSopenharmony_ci<data>•બિ•લ્ડિં•ગ•ની•</data> 5052e5b6d6dSopenharmony_ci# ઉત્પાદક ; 5062e5b6d6dSopenharmony_ci<data>•ઉ•ત્પા•દ•ક•</data> 5072e5b6d6dSopenharmony_ci# ઝૂકાવ્યું ; 5082e5b6d6dSopenharmony_ci<data>•ઝૂ•કા•વ્યું•</data> 5092e5b6d6dSopenharmony_ci# પ્રપૌત્ર ; 5102e5b6d6dSopenharmony_ci<data>•પ્ર•પૌ•ત્ર•</data> 5112e5b6d6dSopenharmony_ci 5122e5b6d6dSopenharmony_ci 5132e5b6d6dSopenharmony_ci# 5142e5b6d6dSopenharmony_ci# cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Malayalam.txt 5152e5b6d6dSopenharmony_ci# 5162e5b6d6dSopenharmony_ci 5172e5b6d6dSopenharmony_ci# സ്ഥാനമൊഴിയുക ; 5182e5b6d6dSopenharmony_ci<data>•സ്ഥാ•ന•മൊ•ഴി•യു•ക•</data> 5192e5b6d6dSopenharmony_ci# വ്യക്തി ; 5202e5b6d6dSopenharmony_ci<data>•വ്യ•ക്തി•</data> 5212e5b6d6dSopenharmony_ci# കൗമാരക്കാരി ; 5222e5b6d6dSopenharmony_ci<data>•കൗ•മാ•ര•ക്കാ•രി•</data> 5232e5b6d6dSopenharmony_ci# കല്യാണം ; 5242e5b6d6dSopenharmony_ci<data>•ക•ല്യാ•ണം•</data> 5252e5b6d6dSopenharmony_ci# റദ്ദാക്കിയ ; 5262e5b6d6dSopenharmony_ci<data>•റ•ദ്ദാ•ക്കി•യ•</data> 5272e5b6d6dSopenharmony_ci# വ്യാപിക്കുക ; 5282e5b6d6dSopenharmony_ci<data>•വ്യാ•പി•ക്കു•ക•</data> 5292e5b6d6dSopenharmony_ci# സ്തുതി ; 5302e5b6d6dSopenharmony_ci<data>•സ്തു•തി•</data> 5312e5b6d6dSopenharmony_ci# ഭക്ഷ്യസുരക്ഷ ; 5322e5b6d6dSopenharmony_ci<data>•ഭ•ക്ഷ്യ•സു•ര•ക്ഷ•</data> 5332e5b6d6dSopenharmony_ci# പൂഴ്ത്തിവെക്കുക ; 5342e5b6d6dSopenharmony_ci<data>•പൂ•ഴ്ത്തി•വെ•ക്കു•ക•</data> 5352e5b6d6dSopenharmony_ci# നിശ്ചയിച്ച ; 5362e5b6d6dSopenharmony_ci<data>•നി•ശ്ച•യി•ച്ച•</data> 5372e5b6d6dSopenharmony_ci# പ്രശ്നം ; 5382e5b6d6dSopenharmony_ci<data>•പ്ര•ശ്നം•</data> 5392e5b6d6dSopenharmony_ci# സംസ്ഥാനം ; 5402e5b6d6dSopenharmony_ci<data>•സം•സ്ഥാ•നം•</data> 5412e5b6d6dSopenharmony_ci# പ്രോത്സാഹം ; 5422e5b6d6dSopenharmony_ci<data>•പ്രോ•ത്സാ•ഹം•</data> 5432e5b6d6dSopenharmony_ci# ഉദ്യോഗസ്ഥ ; 5442e5b6d6dSopenharmony_ci<data>•ഉ•ദ്യോ•ഗ•സ്ഥ•</data> 5452e5b6d6dSopenharmony_ci# സ്ഥാപനം ; 5462e5b6d6dSopenharmony_ci<data>•സ്ഥാ•പ•നം•</data> 5472e5b6d6dSopenharmony_ci# അത്ഭുതം ; 5482e5b6d6dSopenharmony_ci<data>•അ•ത്ഭു•തം•</data> 5492e5b6d6dSopenharmony_ci# പ്രഖ്യാപനം ; 5502e5b6d6dSopenharmony_ci<data>•പ്ര•ഖ്യാ•പ•നം•</data> 5512e5b6d6dSopenharmony_ci 5522e5b6d6dSopenharmony_ci# 5532e5b6d6dSopenharmony_ci# cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Odia.txt 5542e5b6d6dSopenharmony_ci# 5552e5b6d6dSopenharmony_ci 5562e5b6d6dSopenharmony_ci# ଅସ୍ବରାନ୍ତ ; 5572e5b6d6dSopenharmony_ci<data>•ଅ•ସ୍ବ•ରା•ନ୍ତ•</data> 5582e5b6d6dSopenharmony_ci# ଅକର୍ମଣ୍ୟତା ; 5592e5b6d6dSopenharmony_ci<data>•ଅ•କ•ର୍ମ•ଣ୍ୟ•ତା•</data> 5602e5b6d6dSopenharmony_ci# ଅକର୍ମା ; 5612e5b6d6dSopenharmony_ci<data>•ଅ•କ•ର୍ମା•</data> 5622e5b6d6dSopenharmony_ci# ଆକର୍ଣ୍ଣ ; 5632e5b6d6dSopenharmony_ci<data>•ଆ•କ•ର୍ଣ୍ଣ•</data> 5642e5b6d6dSopenharmony_ci# ଆକସ୍ମିକୀ ; 5652e5b6d6dSopenharmony_ci<data>•ଆ•କ•ସ୍ମି•କୀ•</data> 5662e5b6d6dSopenharmony_ci# ଇଞ୍ଚମନୌତୀ ; 5672e5b6d6dSopenharmony_ci<data>•ଇ•ଞ୍ଚ•ମ•ନୌ•ତୀ•</data> 5682e5b6d6dSopenharmony_ci# ଅଗତ୍ୟା ; 5692e5b6d6dSopenharmony_ci<data>•ଅ•ଗ•ତ୍ୟା•</data> 5702e5b6d6dSopenharmony_ci# ଇନ୍ଦ୍ରଧ୍ବଜ ; 5712e5b6d6dSopenharmony_ci<data>•ଇ•ନ୍ଦ୍ର•ଧ୍ବ•ଜ•</data> 5722e5b6d6dSopenharmony_ci# ଊରୁତ୍ରାଣ ; 5732e5b6d6dSopenharmony_ci<data>•ଊ•ରୁ•ତ୍ରା•ଣ•</data> 5742e5b6d6dSopenharmony_ci# ଐଶ୍ବର୍ଯ୍ୟ ; 5752e5b6d6dSopenharmony_ci<data>•ଐ•ଶ୍ବ•ର୍ଯ୍ୟ•</data> 5762e5b6d6dSopenharmony_ci# ଅଗତ୍ୟା ; 5772e5b6d6dSopenharmony_ci<data>•ଅ•ଗ•ତ୍ୟା•</data> 5782e5b6d6dSopenharmony_ci# ଔପନ୍ୟାସିକ ; 5792e5b6d6dSopenharmony_ci<data>•ଔ•ପ•ନ୍ୟା•ସି•କ•</data> 5802e5b6d6dSopenharmony_ci# ଔଷ୍ଠ୍ୟ ; 5812e5b6d6dSopenharmony_ci<data>•ଔ•ଷ୍ଠ୍ୟ•</data> 5822e5b6d6dSopenharmony_ci# ଯଜୁର୍ବେଦୀ ; 5832e5b6d6dSopenharmony_ci<data>•ଯ•ଜୁ•ର୍ବେ•ଦୀ•</data> 5842e5b6d6dSopenharmony_ci# ପକ୍ଷ୍ମ ; 5852e5b6d6dSopenharmony_ci<data>•ପ•କ୍ଷ୍ମ•</data> 5862e5b6d6dSopenharmony_ci# ପଞ୍ଚଭୌତିକ ; 5872e5b6d6dSopenharmony_ci<data>•ପ•ଞ୍ଚ•ଭୌ•ତି•କ•</data> 5882e5b6d6dSopenharmony_ci# ତନ୍ତ୍ରିକାତନ୍ତ୍ର ; 5892e5b6d6dSopenharmony_ci<data>•ତ•ନ୍ତ୍ରି•କା•ତ•ନ୍ତ୍ର•</data> 5902e5b6d6dSopenharmony_ci# ସ୍ନାୟୁତନ୍ତ୍ର ; 5912e5b6d6dSopenharmony_ci<data>•ସ୍ନା•ୟୁ•ତ•ନ୍ତ୍ର•</data> 5922e5b6d6dSopenharmony_ci# ତପ୍ତକୁଣ୍ଡ ; 5932e5b6d6dSopenharmony_ci<data>•ତ•ପ୍ତ•କୁ•ଣ୍ଡ•</data> 5942e5b6d6dSopenharmony_ci# ଚଣ୍ଡୋଦରୀ ; 5952e5b6d6dSopenharmony_ci<data>•ଚ•ଣ୍ଡୋ•ଦ•ରୀ•</data> 5962e5b6d6dSopenharmony_ci# ଝଙ୍କାର ; 5972e5b6d6dSopenharmony_ci<data>•ଝ•ଙ୍କା•ର•</data> 5982e5b6d6dSopenharmony_ci# କଙ୍କପୃଷ୍ଠୀ ; 5992e5b6d6dSopenharmony_ci<data>•କ•ଙ୍କ•ପୃ•ଷ୍ଠୀ•</data> 6002e5b6d6dSopenharmony_ci# ଖଣ୍ଡନୀୟ ; 6012e5b6d6dSopenharmony_ci<data>•ଖ•ଣ୍ଡ•ନୀ•ୟ•</data> 6022e5b6d6dSopenharmony_ci# ଖମ୍ଭାବତୀ ; 6032e5b6d6dSopenharmony_ci<data>•ଖ•ମ୍ଭା•ବ•ତୀ•</data> 6042e5b6d6dSopenharmony_ci# ଘାଣେନ୍ଦ୍ରିୟ ; 6052e5b6d6dSopenharmony_ci<data>•ଘା•ଣେ•ନ୍ଦ୍ରି•ୟ•</data> 6062e5b6d6dSopenharmony_ci# ଘୁଞ୍ଚାଇବା ; 6072e5b6d6dSopenharmony_ci<data>•ଘୁ•ଞ୍ଚା•ଇ•ବା•</data> 6082e5b6d6dSopenharmony_ci 6092e5b6d6dSopenharmony_ci# 6102e5b6d6dSopenharmony_ci# cldr/common/testData/segmentation/graphemeCluster/TestSegmenter-Telugu.txt 6112e5b6d6dSopenharmony_ci# 6122e5b6d6dSopenharmony_ci 6132e5b6d6dSopenharmony_ci# అదృశ్యం ; 6142e5b6d6dSopenharmony_ci<data>•అ•దృ•శ్యం•</data> 6152e5b6d6dSopenharmony_ci# ఉద్యోగాలు ; 6162e5b6d6dSopenharmony_ci<data>•ఉ•ద్యో•గా•లు•</data> 6172e5b6d6dSopenharmony_ci# ఉన్నాయన్న ; 6182e5b6d6dSopenharmony_ci<data>•ఉ•న్నా•య•న్న•</data> 6192e5b6d6dSopenharmony_ci# కార్యదర్శి ; 6202e5b6d6dSopenharmony_ci<data>•కా•ర్య•ద•ర్శి•</data> 6212e5b6d6dSopenharmony_ci# సామర్థ్యం ; 6222e5b6d6dSopenharmony_ci<data>•సా•మ•ర్థ్యం•</data> 6232e5b6d6dSopenharmony_ci# అభిప్రాయం ; 6242e5b6d6dSopenharmony_ci<data>•అ•భి•ప్రా•యం•</data> 6252e5b6d6dSopenharmony_ci# రాష్ట్రస్థాయి ; 6262e5b6d6dSopenharmony_ci<data>•రా•ష్ట్ర•స్థా•యి•</data> 6272e5b6d6dSopenharmony_ci# నిర్లక్ష్యం ; 6282e5b6d6dSopenharmony_ci<data>•ని•ర్ల•క్ష్యం•</data> 6292e5b6d6dSopenharmony_ci# వ్యాజ్యాలన్నీ ; 6302e5b6d6dSopenharmony_ci<data>•వ్యా•జ్యా•ల•న్నీ•</data> 6312e5b6d6dSopenharmony_ci# న్యాయవ్యవస్థ ; 6322e5b6d6dSopenharmony_ci<data>•న్యా•య•వ్య•వ•స్థ•</data> 6332e5b6d6dSopenharmony_ci# వ్యాఖ్యలు ; 6342e5b6d6dSopenharmony_ci<data>•వ్యా•ఖ్య•లు•</data> 6352e5b6d6dSopenharmony_ci# నేతృత్వం ; 6362e5b6d6dSopenharmony_ci<data>•నే•తృ•త్వం•</data> 6372e5b6d6dSopenharmony_ci# ఉద్రిక్తత ; 6382e5b6d6dSopenharmony_ci<data>•ఉ•ద్రి•క్త•త•</data> 6392e5b6d6dSopenharmony_ci# వ్యాఖ్యలు ; 6402e5b6d6dSopenharmony_ci<data>•వ్యా•ఖ్య•లు•</data> 6412e5b6d6dSopenharmony_ci# అత్యున్నత ; 6422e5b6d6dSopenharmony_ci<data>•అ•త్యు•న్న•త•</data> 6432e5b6d6dSopenharmony_ci# మనస్పర్ధలు ; 6442e5b6d6dSopenharmony_ci<data>•మ•న•స్ప•ర్ధ•లు•</data> 6452e5b6d6dSopenharmony_ci# కార్యక్రమం ; 6462e5b6d6dSopenharmony_ci<data>•కా•ర్య•క్ర•మం•</data> 6472e5b6d6dSopenharmony_ci# గుప్పిస్తున్నారు ; 6482e5b6d6dSopenharmony_ci<data>•గు•ప్పి•స్తు•న్నా•రు•</data> 6492e5b6d6dSopenharmony_ci# నటిస్తున్నారు ; 6502e5b6d6dSopenharmony_ci<data>•న•టి•స్తు•న్నా•రు•</data> 6512e5b6d6dSopenharmony_ci# ద్వితీయార్ధం ; 6522e5b6d6dSopenharmony_ci<data>•ద్వి•తీ•యా•ర్ధం•</data> 6532e5b6d6dSopenharmony_ci# జీర్ణవ్యవస్థ ; 6542e5b6d6dSopenharmony_ci<data>•జీ•ర్ణ•వ్య•వ•స్థ•</data> 6552e5b6d6dSopenharmony_ci# ఉత్సాహం ; 6562e5b6d6dSopenharmony_ci<data>•ఉ•త్సా•హం•</data> 6572e5b6d6dSopenharmony_ci# హృద్రోగాలు ; 6582e5b6d6dSopenharmony_ci<data>•హృ•ద్రో•గా•లు•</data> 6592e5b6d6dSopenharmony_ci# పాల్గొనాల్సింది ; 6602e5b6d6dSopenharmony_ci<data>•పా•ల్గొ•నా•ల్సిం•ది•</data> 6612e5b6d6dSopenharmony_ci# మార్గదర్శకాలు ; 6622e5b6d6dSopenharmony_ci<data>•మా•ర్గ•ద•ర్శ•కా•లు•</data> 6632e5b6d6dSopenharmony_ci 6642e5b6d6dSopenharmony_ci 6652e5b6d6dSopenharmony_ci######################################################################################## 6662e5b6d6dSopenharmony_ci# 6672e5b6d6dSopenharmony_ci# 6682e5b6d6dSopenharmony_ci# W o r d B o u n d a r y T e s t s 6692e5b6d6dSopenharmony_ci# 6702e5b6d6dSopenharmony_ci# 6712e5b6d6dSopenharmony_ci########################################################################################## 6722e5b6d6dSopenharmony_ci 6732e5b6d6dSopenharmony_ci<word> 6742e5b6d6dSopenharmony_ci# 6752e5b6d6dSopenharmony_ci# Quick sanity test 6762e5b6d6dSopenharmony_ci# 6772e5b6d6dSopenharmony_ci<data>•hello<200> •there<200> •goodbye<200></data> 6782e5b6d6dSopenharmony_ci<data>•hello<200> •12345<100> •,•</data> 6792e5b6d6dSopenharmony_ci 6802e5b6d6dSopenharmony_ci 6812e5b6d6dSopenharmony_ci# 6822e5b6d6dSopenharmony_ci# Test data originally in RBBIAPITest::TestFirstNextFollowing() and TestLastPreviousPreceding() 6832e5b6d6dSopenharmony_ci# 6842e5b6d6dSopenharmony_ci 6852e5b6d6dSopenharmony_ci<word> 6862e5b6d6dSopenharmony_ci<data>•This<200> •is<200> •a<200> •word<200> •break<200>.• •Isn't<200> •it<200>?• •2.25<100></data> 6872e5b6d6dSopenharmony_ci 6882e5b6d6dSopenharmony_ci 6892e5b6d6dSopenharmony_ci 6902e5b6d6dSopenharmony_ci# 6912e5b6d6dSopenharmony_ci# Data originally from TestDefaultRuleBasedWordIteration() 6922e5b6d6dSopenharmony_ci# 6932e5b6d6dSopenharmony_ci<data>•Write<200> •wordrules<200>.• •123.456<100> •alpha\u00adbeta\u00adgamma<200> •\u092f\u0939<200> •</data> 6942e5b6d6dSopenharmony_ci<data>• •\u0939\u093f\u0928\u094d\u200d\u0926\u0940<200> •\u0939\u0948<200> •\u0905\u093e\u092a<200> •\u0938\u093f\u0916\u094b\u0917\u0947<200>?•</data> 6952e5b6d6dSopenharmony_ci 6962e5b6d6dSopenharmony_ci#Hindi Numbers 6972e5b6d6dSopenharmony_ci<data>• •\u0968\u0966.\u0969\u096f<100> •\u0967\u0966\u0966.\u0966\u0966<100> •\N{RUPEE SIGN}•\u0967,\u0967\u0966\u0966.\u0966\u0966<100> •\u0905\u092e\u091c<200>\n•</data> 6982e5b6d6dSopenharmony_ci 6992e5b6d6dSopenharmony_ci<data>•\u0938\u094d\u200d\u0935\u0924\u0902deadTA\u0930<200>\r•It's<200> •$•30.10<100> •12,34<100>¢•£•¤•¥•alpha\u05f3beta\u05f4gamma<200> •</data> 7002e5b6d6dSopenharmony_ci 7012e5b6d6dSopenharmony_ci<data>•Badges<200>?• •BADGES<200>!•?•!• •We<200> •don't<200> •need<200> •no<200> •STINKING<200> •BADGES<200>!•!•1000,233,456.000<100> •1,23.322<100>%•123.1222<100>$•123,000.20<100> •179.01<100>%•X<200> •Now<200>\r•is<200>\n•the<200>\r\n•time<200> •</data> 7022e5b6d6dSopenharmony_ci 7032e5b6d6dSopenharmony_ci#Hangul 7042e5b6d6dSopenharmony_ci<data>•\uc5f0\ud569<200> •\uc7a5\ub85c\uad50\ud68c<200> •\u1109\u1161\u11bc\u1112\u1161\u11bc<200> •\u1112\u1161\u11ab\u110b\u1175\u11ab<200> •Hello<200>,• •how<200> •are<200> •you<200> •</data> 7052e5b6d6dSopenharmony_ci 7062e5b6d6dSopenharmony_ci<data>•Hello<200>,• •how<200> •are<200> •you<200> •\uc5f0\ud569<200> •\uc7a5\ub85c\uad50\ud68c<200> •\u1109\u1161\u11bc\u1112\u1161\u11bc<200> •\u1112\u1161\u11ab\u110b\u1175\u11ab<200> •</data> 7072e5b6d6dSopenharmony_ci 7082e5b6d6dSopenharmony_ci# Words containing non-BMP letters 7092e5b6d6dSopenharmony_ci<data>•abc\U00010300<200> •abc\N{DESERET SMALL LETTER ENG}<200> •abc\N{MATHEMATICAL BOLD SMALL Z}<200> •abc\N{MATHEMATICAL SANS-SERIF BOLD ITALIC PI SYMBOL}<200> •</data> 7102e5b6d6dSopenharmony_ci 7112e5b6d6dSopenharmony_ci# Unassigned code points 7122e5b6d6dSopenharmony_ci<data>•abc<200>\U0001D800•def<200>\U0001D3FF• •</data> 7132e5b6d6dSopenharmony_ci 7142e5b6d6dSopenharmony_ci# Hiragana & Katakana stay together, but separates from each other and Latin. 7152e5b6d6dSopenharmony_ci# *** what to do about theoretical combos of chars? i.e. hiragana + accent 7162e5b6d6dSopenharmony_ci#<data>•abc<200>\N{HIRAGANA LETTER SMALL A}<400>\N{HIRAGANA LETTER VU}\N{COMBINING ACUTE ACCENT}<400>\N{HIRAGANA ITERATION MARK}<400>\N{KATAKANA LETTER SMALL A}\N{KATAKANA ITERATION MARK}\N{HALFWIDTH KATAKANA LETTER WO}\N{HALFWIDTH KATAKANA LETTER N}<400>def<200>#•</data> 7172e5b6d6dSopenharmony_ci 7182e5b6d6dSopenharmony_ci# test normalization/dictionary handling of halfwidth katakana: same dictionary phrase in fullwidth and halfwidth 7192e5b6d6dSopenharmony_ci<data>•芽キャベツ<400>芽キャベツ<400></data> 7202e5b6d6dSopenharmony_ci 7212e5b6d6dSopenharmony_ci# more Japanese tests 7222e5b6d6dSopenharmony_ci# TODO: some script=common characters in the Hiragana and the Katakana block may not be treated correctly 7232e5b6d6dSopenharmony_ci# (was formerly true for U+30FC); need to check and fix if so. 7242e5b6d6dSopenharmony_ci#<data>•どー<400>せ<400>日本語<400>を<400>勉強<400>する<400>理由<400>について<400> •て<400>こと<400>は<400>我<400>でも<400>知<400>ら<400>も<400>い<400>こと<400>なん<400>だ<400>。•</data> 7252e5b6d6dSopenharmony_ci<data>•日本語<400>を<400>勉強<400>する<400>理由<400>について<400> •て<400>こと<400>は<400>我<400>でも<400>知<400>ら<400>も<400>い<400>こと<400>なん<400>だ<400>。•</data> 7262e5b6d6dSopenharmony_ci 7272e5b6d6dSopenharmony_ci# Testing of word boundary for dictionary word containing both kanji and kana 7282e5b6d6dSopenharmony_ci<data>•中だるみ<400>蔵王の森<400>ウ離島<400></data> 7292e5b6d6dSopenharmony_ci 7302e5b6d6dSopenharmony_ci# Testing of Chinese segmentation (taken from a Chinese news article) 7312e5b6d6dSopenharmony_ci<data>•400<100>余<400>名<400>中央<400>委员<400>和<400>中央<400>候补<400>委员<400>都<400>领<400>到了<400>“•推荐<400>票<400>”•,•有<400>资格<400>在<400>200<100>多<400>名<400>符合<400>条件<400>的<400>63<100>岁<400>以下<400>中共<400>正<400>部<400>级<400>干部<400>中<400>,•选出<400>他们<400>属意<400>的<400>中央<400>政治局<400>委员<400>以<400>向<400>政治局<400>常委<400>会<400>举荐<400>。•</data> 7322e5b6d6dSopenharmony_ci 7332e5b6d6dSopenharmony_ci# Words with interior formatting characters 7342e5b6d6dSopenharmony_ci<data>•def\N{COMBINING ACUTE ACCENT}\N{SYRIAC ABBREVIATION MARK}ghi<200> •</data> 7352e5b6d6dSopenharmony_ci 7362e5b6d6dSopenharmony_ci# to test for bug #4097779 7372e5b6d6dSopenharmony_ci<data>•aa\N{COMBINING GRAVE ACCENT}a<200> •</data> 7382e5b6d6dSopenharmony_ci 7392e5b6d6dSopenharmony_ci# fullwidth numeric, midletter characters etc should be treated like their halfwidth counterparts 7402e5b6d6dSopenharmony_ci<data>•ISN'T<200> •19<100>日<400></data> 7412e5b6d6dSopenharmony_ci 7422e5b6d6dSopenharmony_ci# to test for bug #4098467 7432e5b6d6dSopenharmony_ci# What follows is a string of Korean characters (I found it in the Yellow Pages 7442e5b6d6dSopenharmony_ci# ad for the Korean Presbyterian Church of San Francisco, and I hope I transcribed 7452e5b6d6dSopenharmony_ci# it correctly), first as precomposed syllables, and then as conjoining jamo. 7462e5b6d6dSopenharmony_ci# Both sequences should be semantically identical and break the same way. 7472e5b6d6dSopenharmony_ci# precomposed syllables... 7482e5b6d6dSopenharmony_ci<data>•\uc0c1\ud56d<200> •\ud55c\uc778<200> •\uc5f0\ud569<200> •\uc7a5\ub85c\uad50\ud68c<200> •\u1109\u1161\u11bc\u1112\u1161\u11bc<200> •\u1112\u1161\u11ab\u110b\u1175\u11ab<200> •\u110b\u1167\u11ab\u1112\u1161\u11b8<200> •\u110c\u1161\u11bc\u1105\u1169\u1100\u116d\u1112\u116c<200> •</data> 7492e5b6d6dSopenharmony_ci 7502e5b6d6dSopenharmony_ci# more Korean tests (Jamo not tested here, not counted as dictionary characters) 7512e5b6d6dSopenharmony_ci# Disable them now because we don't include a Korean dictionary. 7522e5b6d6dSopenharmony_ci#<data>•\ud55c\uad6d<200>\ub300\ud559\uad50<200>\uc790\uc5f0<200>\uacfc\ud559<200>\ub300\ud559<200>\ubb3c\ub9ac\ud559\uacfc<200></data> 7532e5b6d6dSopenharmony_ci#<data>•\ud604\uc7ac<200>\ub294<200> •\uac80\ucc30<200>\uc774<200> •\ubd84\uc2dd<200>\ud68c\uacc4<200>\ubb38\uc81c<200>\ub97c<200> •\uc870\uc0ac<200>\ud560<200> •\uac00\ub2a5\uc131<200>\uc740<200> •\uc5c6\ub2e4<200>\u002e•</data> 7542e5b6d6dSopenharmony_ci 7552e5b6d6dSopenharmony_ci<data>•abc<200>\u4e01<400>\u4e02<400>\u3005<400>\u4e03\u4e03<400>abc<200> •</data> 7562e5b6d6dSopenharmony_ci 7572e5b6d6dSopenharmony_ci<data>•\u06c9<200>\uc799\ufffa•</data> 7582e5b6d6dSopenharmony_ci 7592e5b6d6dSopenharmony_ci 7602e5b6d6dSopenharmony_ci# 7612e5b6d6dSopenharmony_ci# Try some words from other scripts. 7622e5b6d6dSopenharmony_ci# 7632e5b6d6dSopenharmony_ci 7642e5b6d6dSopenharmony_ci# Try some words from other scripts. 7652e5b6d6dSopenharmony_ci# Greek, Cyrillic, Hebrew, Arabic, Arabic, Georgian, Latin 7662e5b6d6dSopenharmony_ci# 7672e5b6d6dSopenharmony_ci<data>•ΑΒΓ<200> •БВГ<200> •אבג֓<200> •ابت<200> •١٢٣<100> •\u10A0\u10A1\u10A2<200> •ABC<200> •</data> 7682e5b6d6dSopenharmony_ci 7692e5b6d6dSopenharmony_ci<data>•\u0301•A<200></data> 7702e5b6d6dSopenharmony_ci 7712e5b6d6dSopenharmony_ci 7722e5b6d6dSopenharmony_ci# 7732e5b6d6dSopenharmony_ci# Hindi word break tests, imported from the old RBBI tests. 7742e5b6d6dSopenharmony_ci# An historical note: a much earlier version of ICU break iterators had a number 7752e5b6d6dSopenharmony_ci# of special case rules for Hindi, which were tested by an earlier version of 7762e5b6d6dSopenharmony_ci# this test data. The current RBBI rules do not special case Hindi in 7772e5b6d6dSopenharmony_ci# any way, making this test data much less signfificant. 7782e5b6d6dSopenharmony_ci# 7792e5b6d6dSopenharmony_ci<data>•\u0917\u092a\u00ad\u0936\u092a<200>!•\u092f\u0939<200> •\u0939\u093f\u0928\u094d\u200d\u0926\u0940<200> •\u0939\u0948<200> •\u0905\u093e\u092a<200> •\u0938\u093f\u0916\u094b\u0917\u0947<200>?•\n•:•\u092a\u094d\u0930\u093e\u092f\u0903<200> 7802e5b6d6dSopenharmony_ci•\u0935\u0930\u094d\u0937\u093e<200>\r\n•\u092a\u094d\u0930\u0915\u093e\u0936<200>,•\u0924\u0941\u092e\u093e\u0930\u094b<200> •\u092e\u093f\u0924\u094d\u0930<200> •\u0915\u093e<200> •\u092a\u0924\u094d\u0930<200> •\u092a\u095d\u094b<200> •\u0938\u094d\u0924\u094d\u0930\u093f<200>.• •\u0968\u0966.\u0969\u096f<100> •\u0967\u0966\u0966.\u0966\u0966<100>\u20a8•\u0967,\u0967\u0966\u0966.\u0966\u0966<100> •\u0905\u092e\u091c<200>\n•\u0938\u094d\u200d\u0935\u0924\u0902\u0924\u094d\u0930<200>\r•</data> 7812e5b6d6dSopenharmony_ci 7822e5b6d6dSopenharmony_ci# 7832e5b6d6dSopenharmony_ci# Failures from monkey tests 7842e5b6d6dSopenharmony_ci# 7852e5b6d6dSopenharmony_ci<data>•\u8527<400>\u02ba<200>\u0027\u0d42•\u00b7•\u09ea<100></data> 7862e5b6d6dSopenharmony_ci 7872e5b6d6dSopenharmony_ci# 7882e5b6d6dSopenharmony_ci# Jitterbug 5276 - treat Japanese half width voicing marks as Grapheme Extend 7892e5b6d6dSopenharmony_ci# 7902e5b6d6dSopenharmony_ci<data>•A\uff9e\uff9fBC<200> •1\uff9e\uff9f23<100></data> 7912e5b6d6dSopenharmony_ci 7922e5b6d6dSopenharmony_ci# User guide example: 7932e5b6d6dSopenharmony_ci<data>•Parlez<200>-•vous<200> •français<200> •?•</data> 7942e5b6d6dSopenharmony_ci 7952e5b6d6dSopenharmony_ci# Test for #11673 7962e5b6d6dSopenharmony_ci<word> 7972e5b6d6dSopenharmony_ci<data>•ジョージア<400> •</data> 7982e5b6d6dSopenharmony_ci 7992e5b6d6dSopenharmony_ci<word> 8002e5b6d6dSopenharmony_ci<data>•[<0>携帯<400>電話<400>]<0>お金<400>が<400>かかる<400>ん<400>です<400>。<0></data> 8012e5b6d6dSopenharmony_ci 8022e5b6d6dSopenharmony_ci# Test for #11723 8032e5b6d6dSopenharmony_ci<word> 8042e5b6d6dSopenharmony_ci<data>•アレルギー性<400>結膜炎<400></data> 8052e5b6d6dSopenharmony_ci<data>•アテ<400>ローム<400>性<400>動脈硬化<400></data> 8062e5b6d6dSopenharmony_ci 8072e5b6d6dSopenharmony_ci# Ticket #11996 8082e5b6d6dSopenharmony_ci<locale en> 8092e5b6d6dSopenharmony_ci<word> 8102e5b6d6dSopenharmony_ci<data>•栃木<400>県<400>足利<400>市<400>で<400>の<400>撮影<400>が<400>公開<400></data> 8112e5b6d6dSopenharmony_ci<data>•栃木<400>県<400>足利<400>市<400>で<400>の<400>撮影<400>が<400>公開<400>さ<400>れ<400>た<400></data> 8122e5b6d6dSopenharmony_ci 8132e5b6d6dSopenharmony_ci# Ticket #11999 8142e5b6d6dSopenharmony_ci# Unhandled Break Engine was consuming all characters, not just unhandled. 8152e5b6d6dSopenharmony_ci# \U00011700 is AHOM LETTER KA. There is no dictionary for AHOM, triggering the unhandled engine, 8162e5b6d6dSopenharmony_ci# which then incorrectly also consumed the following Japanese text. (ICU4J only) 8172e5b6d6dSopenharmony_ci<word> 8182e5b6d6dSopenharmony_ci<locale en> 8192e5b6d6dSopenharmony_ci<data>•ロ<400>から<400>売却<400>完了<400>時<400>の<400>時価<400>が<400>提示<400>さ<400>れ<400>て<400>いる<400></data> 8202e5b6d6dSopenharmony_ci<data>•\U00011700<200>ロ<400>から<400>売却<400>完了<400>時<400>の<400>時価<400>が<400>提示<400>さ<400>れ<400>て<400>いる<400></data> 8212e5b6d6dSopenharmony_ci 8222e5b6d6dSopenharmony_ci# 8232e5b6d6dSopenharmony_ci# Ticket #13549 8242e5b6d6dSopenharmony_ci# CjiBreakEngine::divideUpDictionaryRange: assertion failure. 8252e5b6d6dSopenharmony_ci# 8262e5b6d6dSopenharmony_ci<locale en> 8272e5b6d6dSopenharmony_ci<word> 8282e5b6d6dSopenharmony_ci<data>•\U00020029<400>\u3300<400>\U0002C400<400></data> 8292e5b6d6dSopenharmony_ci<data>•\uFAD7<400>\u331B<400>\u87DF<400>\u006D<200>\uFFFD•</data> 8302e5b6d6dSopenharmony_ci 8312e5b6d6dSopenharmony_ci# 8322e5b6d6dSopenharmony_ci# What Is Unicode in Japanese 8332e5b6d6dSopenharmony_ci# From http://unicode.org/standard/translations/japanese.html 8342e5b6d6dSopenharmony_ci 8352e5b6d6dSopenharmony_ci<locale en> 8362e5b6d6dSopenharmony_ci<word> 8372e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>と<400>は<400>何<400>か<400>?<0></data> 8382e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>は<400>、<0>すべて<400>の<400>文字<400>に<400>固有<400>の<400>番号<400>を<400>付与<400>し<400>ます<400></data> 8392e5b6d6dSopenharmony_ci<data><0>プラットフォーム<400>に<400>は<400>依存<400>しま<400>せん<400></data> 8402e5b6d6dSopenharmony_ci<data><0>プログラム<400>に<400>も<400>依存<400>しま<400>せん<400></data> 8412e5b6d6dSopenharmony_ci<data><0>言語<400>に<400>も<400>依存<400>しま<400>せん<400></data> 8422e5b6d6dSopenharmony_ci 8432e5b6d6dSopenharmony_ci<data><0>コンピューター<400>は<400>、<0>本質<400>的<400>に<400>は<400>数字<400>しか<400>扱う<400>こと<400>が<400>でき<400>ま<400>せん<400>。<0>\ 8442e5b6d6dSopenharmony_ciコンピューター<400>は<400>、<0>文字<400>や<400>記号<400>など<400>の<400>それぞれに<400>番号<400>を<400>割り振る<400>こと<400>によって<400>扱える<400>\ 8452e5b6d6dSopenharmony_ciよう<400>にし<400>ます<400>。<0>ユニ<400>コード<400>が<400>出来る<400>まで<400>は<400>、<0>これらの<400>番号<400>を<400>割り振る<400>仕組み<400>が<400>\ 8462e5b6d6dSopenharmony_ci何<400>百<400>種類<400>も<400>存在<400>しま<400>した<400>。<0>どの<400>一つ<400>を<400>とっても<400>、<0>十分<400>な<400>文字<400>を<400>含<400>\ 8472e5b6d6dSopenharmony_ciんで<400>は<400>いま<400>せん<400>で<400>した<400>。<0>例えば<400>、<0>欧州<400>連合<400>一つ<400>を<400>見<400>て<400>も<400>、<0>その<400>\ 8482e5b6d6dSopenharmony_ciすべて<400>の<400>言語<400>を<400>カバー<400>する<400>ため<400>に<400>は<400>、<0>いくつか<400>の<400>異なる<400>符号<400>化<400>の<400>仕組み<400>\ 8492e5b6d6dSopenharmony_ciが<400>必要<400>で<400>した<400>。<0>英語<400>の<400>よう<400>な<400>一つ<400>の<400>言語<400>に<400>限<400>って<400>も<400>、<0>一つ<400>だけ<400>\ 8502e5b6d6dSopenharmony_ciの<400>符号<400>化<400>の<400>仕組み<400>では<400>、<0>一般<400>的<400>に<400>使<400>われる<400>すべて<400>の<400>文字<400>、<0>句読点<400>、<0>\ 8512e5b6d6dSopenharmony_ci技術<400>的<400>な<400>記号<400>など<400>を<400>扱う<400>に<400>は<400>不十分<400>で<400>した<400>。<0></data> 8522e5b6d6dSopenharmony_ci 8532e5b6d6dSopenharmony_ci<data><0>これらの<400>符号<400>化<400>の<400>仕組み<400>は<400>、<0>相互<400>に<400>矛盾<400>する<400>もの<400>でも<400>ありま<400>した<400>。<0>\ 8542e5b6d6dSopenharmony_ci二つ<400>の<400>異なる<400>符号<400>化<400>の<400>仕組み<400>が<400>、<0>二つ<400>の<400>異なる<400>文字<400>に<400>同一<400>の<400>番号<400>\ 8552e5b6d6dSopenharmony_ciを<400>付ける<400>こと<400>も<400>できる<400>し<400>、<0>同じ<400>文字<400>に<400>異なる<400>番号<400>を<400>付ける<400>こと<400>も<400>できる<400>\ 8562e5b6d6dSopenharmony_ciの<400>です<400>。<0>どの<400>よう<400>な<400>コンピューター<400>も<400>(<0>特に<400>サーバー<400>は<400>)<0>多く<400>の<400>異<400>な<400>っ<400>\ 8572e5b6d6dSopenharmony_ciた<400>符号<400>化<400>の<400>仕組み<400>を<400>サポート<400>する<400>必要<400>が<400>あり<400>ます<400>。<0>たとえ<400>データ<400>が<400>異なる<400>\ 8582e5b6d6dSopenharmony_ci符号<400>化<400>の<400>仕組み<400>や<400>プラットフォーム<400>を<400>通過<400>し<400>て<400>も<400>、<0>いつ<400>どこ<400>で<400>データ<400>が<400>\ 8592e5b6d6dSopenharmony_ci乱れる<400>か<400>分<400>から<400>ない<400>危険<400>を<400>冒す<400>こと<400>の<400>なる<400>の<400>です<400>。<0></data> 8602e5b6d6dSopenharmony_ci 8612e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>は<400>すべて<400>を<400>変<400>え<400>ます<400></data> 8622e5b6d6dSopenharmony_ci 8632e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>は<400>、<0>プラットフォーム<400>に<400>係<400>わら<400>ず<400>、<0>プログラム<400>に<400>係<400>わら<400>ず<400>、<0>\ 8642e5b6d6dSopenharmony_ci言語<400>に<400>係<400>わら<400>ず<400>、<0>すべて<400>の<400>文字<400>に<400>独立<400>した<400>番号<400>を<400>与<400>え<400>ます<400>。<0>\ 8652e5b6d6dSopenharmony_ciユニ<400>コード<400>標準<400>は<400>、<0>アップル<400>、<0>ヒュー<400>レット<400>パッ<400>カード<400>、<0>IBM<200>、<0>ジャスト<400>システム<400>\ 8662e5b6d6dSopenharmony_ci、<0>マイクロ<400>ソフト<400>、<0>オラクル<400>、<0>SAP<200>、<0>サン<400>、<0>サイ<400>ベース<400>など<400>の<400>産業<400>界<400>の<400>\ 8672e5b6d6dSopenharmony_ci主導<400>的<400>企業<400>と<400>他の<400>多く<400>の<400>企業<400>に<400>採用<400>さ<400>れ<400>てい<400>ます<400>。<0>ユニ<400>コード<400>\ 8682e5b6d6dSopenharmony_ciは<400>、<0>XML<200>、<0>Java<200>、<0>ECMAScript<200>(<0>JavaScript<200>)<0>、<0>LDAP<200>、<0>CORBA<200> <0>3.0<100>など<400>\ 8692e5b6d6dSopenharmony_ciの<400>最先端<400>の<400>標準<400>の<400>前提<400>と<400>な<400>って<400>おり<400>、<0>ユニ<400>コード<400>を<400>実装<400>す<400>れ<400>ば<400>\ 8702e5b6d6dSopenharmony_ci、<0>ISO<200>/<0>IEC<200></data> 8712e5b6d6dSopenharmony_ci<data><0> <0>10646<100>に<400>適合<400>する<400>ことに<400>なり<400>ます<400>。<0>ユニ<400>コード<400>は<400>、<0>多く<400>の<400>\ 8722e5b6d6dSopenharmony_ciオペレーティングシステム<400>と<400>すべて<400>の<400>最新<400>の<400>ブラウザー<400>と<400>他の<400>多く<400>の<400>製品<400>で<400>サポート<400>\ 8732e5b6d6dSopenharmony_ciさ<400>れ<400>てい<400>ます<400>。<0>ユニ<400>コード<400>標準<400>の<400>出現<400>と<400>ユニ<400>コード<400>を<400>サポート<400>する<400>\ 8742e5b6d6dSopenharmony_ciツール<400>類<400>は<400>、<0>昨今<400>顕著<400>に<400>な<400>って<400>いる<400>ソフトウエア<400>技術<400>の<400>グローバル<400>化<400>の<400>\ 8752e5b6d6dSopenharmony_ci流れ<400>に対して<400>、<0>特に<400>役<400>に<400>立<400>って<400>い<400>ます<400>。<0></data> 8762e5b6d6dSopenharmony_ci 8772e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>を<400>ク<400>ライアン<400>ト<400>サーバー<400>型<400>の<400>アプリケーション<400>や<400>、<0>多層<400>構造<400>\ 8782e5b6d6dSopenharmony_ciを<400>持つ<400>アプリケーション<400>、<0>ウェブサイト<400>など<400>に<400>に<400>組み込む<400>こと<400>で<400>、<0>従来<400>の<400>文字<400>\ 8792e5b6d6dSopenharmony_ciコードセット<400>を<400>用いる<400>より<400>も<400>明らか<400>な<400>コスト<400>削減<400>が<400>可能<400>です<400>。<0>ユニ<400>コード<400>は<400>\ 8802e5b6d6dSopenharmony_ci、<0>単一<400>の<400>ソフトウエア<400>製品<400>、<0>単一<400>の<400>ウェブサイト<400>に<400>、<0>何ら<400>手<400>を<400>加える<400>こと<400>なく<400>\ 8812e5b6d6dSopenharmony_ci、<0>複数<400>の<400>プラットフォーム<400>、<0>複数<400>の<400>言語<400>、<0>複数<400>の<400>国<400>を<400>カバー<400>する<400>こと<400>が<400>\ 8822e5b6d6dSopenharmony_ci出来る<400>の<400>です<400>。<0>ユニ<400>コード<400>は<400>、<0>データ<400>が<400>多く<400>の<400>異なる<400>システム<400>の<400>間<400>を<400>、<0>\ 8832e5b6d6dSopenharmony_ci何<400>の<400>乱れ<400>も<400>なし<400>に<400>転送<400>する<400>こと<400>を<400>可能<400>と<400>する<400>の<400>です<400>。<0></data> 8842e5b6d6dSopenharmony_ci 8852e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>コンソーシアム<400>について<400></data> 8862e5b6d6dSopenharmony_ci 8872e5b6d6dSopenharmony_ci<data><0>ユニ<400>コード<400>コンソーシアム<400>は<400>、<0>最新<400>の<400>ソフトウエア<400>製品<400>と<400>標準<400>において<400>テキスト<400>\ 8882e5b6d6dSopenharmony_ciを<400>表現<400>する<400>こと<400>を<400>意味<400>する<400>“<0>ユニ<400>コード<400>標準<400>”<0>の<400>構築<400>、<0>発展<400>、<0>普及<400>、<0>\ 8892e5b6d6dSopenharmony_ci利用<400>促進<400>を<400>目的<400>として<400>設立<400>さ<400>れ<400>た<400>非<400>営利<400>組織<400>です<400>。<0>同<400>コンソーシアム<400>\ 8902e5b6d6dSopenharmony_ciの<400>会員<400>は<400>、<0>コンピューター<400>と<400>情報処理<400>に<400>係わる<400>広汎<400>な<400>企業<400>や<400>組織<400>から<400>構成<400>\ 8912e5b6d6dSopenharmony_ciさ<400>れ<400>てい<400>ます<400>。<0>同<400>コンソーシアム<400>は<400>、<0>財政<400>的<400>に<400>は<400>、<0>純粋<400>に<400>会費<400>のみ<400>\ 8922e5b6d6dSopenharmony_ciによって<400>運営<400>さ<400>れ<400>てい<400>ます<400>。<0>ユニ<400>コード<400>標準<400>を<400>支持<400>し<400>、<0>その<400>拡張<400>と<400>\ 8932e5b6d6dSopenharmony_ci実装<400>を<400>支援<400>する<400>世界中<400>の<400>組織<400>や<400>個人<400>は<400>、<0>だれ<400>も<400>が<400>ユニ<400>コード<400>\ 8942e5b6d6dSopenharmony_ciコンソーシアム<400>の<400>会員<400>なる<400>こと<400>が<400>でき<400>ます<400>。<0></data> 8952e5b6d6dSopenharmony_ci 8962e5b6d6dSopenharmony_ci<data><0>より<400>詳しい<400>こと<400>を<400>お<400>知<400>り<400>に<400>なり<400>たい<400>方<400>は<400>、<0>Glossary<200>,<0> <0>\ 8972e5b6d6dSopenharmony_ciTechnical<200> <0>Introduction<200> <0>および<400> <0>Useful<200> <0>Resources<200>を<400>ご<400>参照<400>くだ<400>さい<400>。<0></data> 8982e5b6d6dSopenharmony_ci 8992e5b6d6dSopenharmony_ci# 9002e5b6d6dSopenharmony_ci# Bug ICU-13334 Report of different boundaries between Windows and Linux. 9012e5b6d6dSopenharmony_ci# 9022e5b6d6dSopenharmony_ci<data><0>丙酮<400></data> 9032e5b6d6dSopenharmony_ci 9042e5b6d6dSopenharmony_ci######################################################################################## 9052e5b6d6dSopenharmony_ci# 9062e5b6d6dSopenharmony_ci# 9072e5b6d6dSopenharmony_ci# S e n t e n c e B o u n d a r y T e s t s 9082e5b6d6dSopenharmony_ci# 9092e5b6d6dSopenharmony_ci# 9102e5b6d6dSopenharmony_ci########################################################################################## 9112e5b6d6dSopenharmony_ci 9122e5b6d6dSopenharmony_ci 9132e5b6d6dSopenharmony_ci# 9142e5b6d6dSopenharmony_ci# Test data originally from RBBI RBBITest::TestDefaultRuleBasedSentenceIteration() 9152e5b6d6dSopenharmony_ci# 9162e5b6d6dSopenharmony_ci<sent> 9172e5b6d6dSopenharmony_ci 9182e5b6d6dSopenharmony_ci 9192e5b6d6dSopenharmony_ci<sent> 9202e5b6d6dSopenharmony_ci<data>•This\n<100></data> 9212e5b6d6dSopenharmony_ci<data>•Hello! •how are you? •I'am fine. •Thankyou. •How are you \ 9222e5b6d6dSopenharmony_cidoing? •This\n<100> costs $20,00,000. •</data> 9232e5b6d6dSopenharmony_ci 9242e5b6d6dSopenharmony_ci 9252e5b6d6dSopenharmony_ci# Sentence ending in a quote. 9262e5b6d6dSopenharmony_ci<data>•"Sentence ending with a quote." •Bye.•</data> 9272e5b6d6dSopenharmony_ci 9282e5b6d6dSopenharmony_ci# Sentence, and test data, ending without a period or other terminator. 9292e5b6d6dSopenharmony_ci<data>•Here is a random sentence, no ending period<100></data> 9302e5b6d6dSopenharmony_ci 9312e5b6d6dSopenharmony_ci 9322e5b6d6dSopenharmony_ci<data>• (This is it). •Testing the sentence iterator. •\ 9332e5b6d6dSopenharmony_ci"This isn't it." •Hi! \ 9342e5b6d6dSopenharmony_ci•This is a simple sample sentence. •(This is it.) •This is a simple sample sentence. •\ 9352e5b6d6dSopenharmony_ci"This isn't it." •\ 9362e5b6d6dSopenharmony_ciHi! •This is a simple sample sentence. •It does not have to make any sense as you can see. •Nel mezzo del cammin di nostra vita, mi ritrovai in una selva oscura. •Che la dritta via aveo smarrita. •He said, that I said, that you said!! •Don't rock the boat.\u2029•Because I am the daddy, that is why. 9372e5b6d6dSopenharmony_ci•Not on my time (el timo.)! •</data> 9382e5b6d6dSopenharmony_ci 9392e5b6d6dSopenharmony_ci<data>•Hello. •So what!!\u2029•"But now," he said, \ 9402e5b6d6dSopenharmony_ci"I know!" •\ 9412e5b6d6dSopenharmony_ciHarris thumbed down several, including "Away We Go" (which became the huge success Oklahoma!). •One species, B. anthracis, is highly virulent. 9422e5b6d6dSopenharmony_ci•Wolf said about Sounder:\ 9432e5b6d6dSopenharmony_ci"Beautifully thought-out and directed." •\ 9442e5b6d6dSopenharmony_ciHave you ever said, "This is where\tI shall live"? •He answered, \ 9452e5b6d6dSopenharmony_ci"You may not!" •Another popular saying is: "How do you do?". \n•\ 9462e5b6d6dSopenharmony_ciYet another popular saying is: \ 9472e5b6d6dSopenharmony_ci'I'm fine thanks.' •\ 9482e5b6d6dSopenharmony_ciWhat is the proper use of the abbreviation pp.? •Yes, I am definatelly 12" tall!!\ 9492e5b6d6dSopenharmony_ci•Now\r<100>is\n<100>the\r\n<100>time\n<100>\r<100>for\r<100>\r<100></data> 9502e5b6d6dSopenharmony_ci 9512e5b6d6dSopenharmony_ci<data>•No breaks when . is surrounded by UPPER.Case letters. •</data> 9522e5b6d6dSopenharmony_ci<data>•No breaks when . is followed by Numeric .4 a.4 C.4 3.1 .•</data> 9532e5b6d6dSopenharmony_ci<data>•No breaks when . is followed by a lower, with possible intervening punct .,a .$a .)a. •</data> 9542e5b6d6dSopenharmony_ci 9552e5b6d6dSopenharmony_ci# 9562e5b6d6dSopenharmony_ci# Sentence Breaks: no break at the boundary between CJK and other letters. TestBug4111338 9572e5b6d6dSopenharmony_ci# 9582e5b6d6dSopenharmony_ci<data>•\u5487\u67ff\ue591\u5017\u61b3\u60a1\u9510\u8165:"JAVA\u821c\u8165\u7fc8\u51ce\u306d,\u2494\u56d8\u4ec0\u60b1\u8560\u51ba\u611d\u57b6\u2510\u5d46".\u2029\ 9592e5b6d6dSopenharmony_ci•\u5487\u67ff\ue591\u5017\u61b3\u60a1\u9510\u8165\u9de8\u97e4JAVA\u821c\u8165\u7fc8\u51ce\u306d\ue30b\u2494\u56d8\u4ec0\u60b1\u8560\u51ba\u611d\u57b6\u2510\u5d46\u97e5\u7751\u3002\ 9602e5b6d6dSopenharmony_ci•\u5487\u67ff\ue591\u5017\u61b3\u60a1\u9510\u8165\u9de8\u97e4\u6470\u8790JAVA\u821c\u8165\u7fc8\u51ce\u306d\ue30b\u2494\u56d8\u4ec0\u60b1\u8560\u51ba\u611d\u57b6\u2510\u5d46\u97e5\u7751\u2048\ 9612e5b6d6dSopenharmony_ci•He said, "I can go there."\u2029•Bye, now.•</data> 9622e5b6d6dSopenharmony_ci 9632e5b6d6dSopenharmony_ci# 9642e5b6d6dSopenharmony_ci# Treat fullwidth variants of .!? the same as their 9652e5b6d6dSopenharmony_ci# normal counterparts 9662e5b6d6dSopenharmony_ci# 9672e5b6d6dSopenharmony_ci<data>•I know I'm right\uff0e •Right\uff1f •Right\uff01 •</data> 9682e5b6d6dSopenharmony_ci 9692e5b6d6dSopenharmony_ci 9702e5b6d6dSopenharmony_ci# 9712e5b6d6dSopenharmony_ci# Don't break sentences at boundary between CJK and digits 9722e5b6d6dSopenharmony_ci# 9732e5b6d6dSopenharmony_ci<data>•\u5487\u67ff\ue591\u5017\u61b3\u60a1\u9510\u8165\u9de8\u97e48888\u821c\u8165\u7fc8\u51ce\u306d\ue30b\u2494\u56d8\u4ec0\u60b1\u8560\u51ba\u611d\u57b6\u2510\u5d46\u97e5\u7751\u3002•Bye, now<100></data> 9742e5b6d6dSopenharmony_ci 9752e5b6d6dSopenharmony_ci# 9762e5b6d6dSopenharmony_ci# Breaks around '(' following a sentence TERM. (Rule 9) 9772e5b6d6dSopenharmony_ci# 9782e5b6d6dSopenharmony_ci<data>•How do you do?(•Fine). •</data> 9792e5b6d6dSopenharmony_ci<data>•How do you do? •(Fine). •</data> 9802e5b6d6dSopenharmony_ci<data>•How do you do?(•fine). •</data> 9812e5b6d6dSopenharmony_ci<data>•How do you do? •(fine). •</data> 9822e5b6d6dSopenharmony_ci 9832e5b6d6dSopenharmony_ci# 9842e5b6d6dSopenharmony_ci<data>•Hello.123<100></data> # Rule 6 9852e5b6d6dSopenharmony_ci<data>•Hello?•123<100></data> 9862e5b6d6dSopenharmony_ci 9872e5b6d6dSopenharmony_ci<data>•HELLO.Bye<100></data> # Rule 7 9882e5b6d6dSopenharmony_ci<data>•HELLO?•Bye<100></data> 9892e5b6d6dSopenharmony_ci 9902e5b6d6dSopenharmony_ci<data>•Hello.goodbye<100></data> #Rule 8 9912e5b6d6dSopenharmony_ci<data>•Hello. •Goodbye<100></data> 9922e5b6d6dSopenharmony_ci<data>•Hello. goodbye<100></data> 9932e5b6d6dSopenharmony_ci 9942e5b6d6dSopenharmony_ci 9952e5b6d6dSopenharmony_ci 9962e5b6d6dSopenharmony_ci# 9972e5b6d6dSopenharmony_ci# test for bug #4158381: No breaks when there are no terminators around 9982e5b6d6dSopenharmony_ci# 9992e5b6d6dSopenharmony_ci<data>•\<P>Provides a set of "lightweight" (all-java\<FONT SIZE="-2">\<SUP>TM\</SUP>\</FONT> language) components that, to the maximum degree possible, work the same on all platforms. •</data> 10002e5b6d6dSopenharmony_ci<data>•Another test.\u2029•</data> 10012e5b6d6dSopenharmony_ci 10022e5b6d6dSopenharmony_ci# test for bug #4143071: Make sure sentences that end with digits 10032e5b6d6dSopenharmony_ci# work right 10042e5b6d6dSopenharmony_ci# 10052e5b6d6dSopenharmony_ci<data>•Today is the 27th of May, 1998. •</data> 10062e5b6d6dSopenharmony_ci<data>•Tomorrow with be 28 May 1998. •</data> 10072e5b6d6dSopenharmony_ci<data>•The day after will be the 30th.\u2029•</data> 10082e5b6d6dSopenharmony_ci 10092e5b6d6dSopenharmony_ci# test for bug #4152416: Make sure sentences ending with a capital 10102e5b6d6dSopenharmony_ci# letter are treated correctly 10112e5b6d6dSopenharmony_ci# 10122e5b6d6dSopenharmony_ci<data>•The type of all primitive \<code>boolean\</code> values accessed in the target VM. •\ 10132e5b6d6dSopenharmony_ciCalls to xxx will return an implementor of this interface. \u2029•</data> 10142e5b6d6dSopenharmony_ci 10152e5b6d6dSopenharmony_ci# test for bug #4152117: Make sure sentence breaking is handling 10162e5b6d6dSopenharmony_ci# punctuation correctly [COULD NOT REPRODUCE THIS BUG, BUT TEST IS 10172e5b6d6dSopenharmony_ci# HERE TO MAKE SURE IT DOESN'T CROP UP] 10182e5b6d6dSopenharmony_ci# 10192e5b6d6dSopenharmony_ci<data>•Constructs a randomly generated BigInteger, uniformly distributed over the range \<tt>0\</tt> to\ 10202e5b6d6dSopenharmony_ci \<tt>(2\<sup>numBits\</sup> - 1\)\</tt>, inclusive. \ 10212e5b6d6dSopenharmony_ci •The uniformity of the distribution assumes that a fair source of random bits is provided in \<tt>rnd\</tt>. \ 10222e5b6d6dSopenharmony_ci •Note that this constructor always constructs a non-negative biginteger. \n•Ahh abc.•</data> 10232e5b6d6dSopenharmony_ci 10242e5b6d6dSopenharmony_ci# sentence breaks for hindi which used Devanagari script 10252e5b6d6dSopenharmony_ci# make sure there is sentence break after ?,danda(hindi phrase separator), 10262e5b6d6dSopenharmony_ci# fullstop followed by space. (VERY old test) 10272e5b6d6dSopenharmony_ci# 10282e5b6d6dSopenharmony_ci<data>•\u0928\u092e\u0938\u094d\u200d\u0924\u0947 \u0930\u092e\u0947\u0936\u0905\u093e\u092a\u0915\u0948\u0938\u0947 \u0939\u0948?\ 10292e5b6d6dSopenharmony_ci•\u092e\u0948 \u0905\u091a\u094d\u200d \u091b\u093e \u0939\u0942\u0901\u0964 •\u0905\u093e\u092a\r\n<100>\ 10302e5b6d6dSopenharmony_ci\u0915\u0948\u0938\u0947 \u0939\u0948?•\u0935\u0939 \u0915\u094d\u200d\u092f\u093e\n\ 10312e5b6d6dSopenharmony_ci<100>\u0939\u0948?•\u092f\u0939 \u0905\u093e\u092e \u0939\u0948. •\u092f\u0939 means "this". \ 10322e5b6d6dSopenharmony_ci•"\u092a\u095d\u093e\u0908" meaning "education" or "studies". \ 10332e5b6d6dSopenharmony_ci•\u0905\u093e\u091c(\u0938\u094d\u200d\u0935\u0924\u0902\u0924\u094d\u0930 \u0926\u093f\u0935\u093e\u0938) \u0939\u0948\u0964 •Let's end here. •</data> 10342e5b6d6dSopenharmony_ci 10352e5b6d6dSopenharmony_ci# Regression test for bug #1984, Sentence break in Arabic text. 10362e5b6d6dSopenharmony_ci 10372e5b6d6dSopenharmony_ci<data>\ 10382e5b6d6dSopenharmony_ci•\u0623\u0633\u0627\u0633\u064b\u0627\u060c\u0020\u062a\u062a\u0639\u0627"\u0645\u0644\u0020\u0627\u0644\u062d\u0648\u0627\u0633\u064a\u0628\u0020"\u0641\u0642\u0637\u0020\u0645\u0639\u0020\u0627\u0644\u0623\u0631\u0642\u0627\u0645\u060c\u0648\u062a\u0642\u0648\u0645\u0020\u0628\u062a\u062e\u0632\u064a\u0646\u0020\u0627\u0644\u0623\u062d\u0631\u0641\u0020\u0648\u0627\u0644\u0645\u062d\u0627\u0631\u0641\u0020\u0627\u0644\u0623\u062e\u0631\u0649\u0020\u0628\u0639\u062f\u0020\u0623\u0646\u062a\u064f\u0639\u0637\u064a\u0020\u0631\u0642\u0645\u0627\u0020\u0645\u0639\u064a\u0646\u0627\u0020\u0644\u0643\u0644\u0020\u0648\u0627\u062d\u062f\u0020\u0645\u0646\u0647\u0627\u002e\u0020•\u0648\u0642\u0628\u0644\u0020\u0627\u062e\u062a\u0631\u0627\u0639\u0022\u064a\u0648\u0646\u0650\u0643\u0648\u062f\u0022\u060c\u0020\u0643\u0627\u0646\u0020\u0647\u0646\u0627\u0643\u0020\u0645\u0626\u0627\u062a\u0020\u0627\u0644\u0623\u0646\u0638\u0645\u0629\u0020\u0644\u0644\u062a\u0634\u0641\u064a\u0631\u0648\u062a\u062e\u0635\u064a\u0635\u0020\u0647\u0630\u0647\u0020\u0627\u0644\u0623\u0631\u0642\u0627\u0645\u0020\u0644\u0644\u0645\u062d\u0627\u0631\u0641\u060c\u0020\u0648\u0644\u0645\u0020\u064a\u0648\u062c\u062f\u0020\u0646\u0638\u0627\u0645\u062a\u0634\u0641\u064a\u0020\u0639\u0644\u0649\u0020\u062c\u0645\u064a\u0639\u0020\u0627\u0644\u0645\u062d\u0627\u0631\u0641\u0020\u0627\u0644\u0636\u0631\u0648\u0631\u064a\u0629. •</data> 10392e5b6d6dSopenharmony_ci 10402e5b6d6dSopenharmony_ci# Try a few more of the less common sentence endings. 10412e5b6d6dSopenharmony_ci<data>•Hello, world\u3002 •Hello, world\u1803 •Hello, world\u2048 •Hello, world\u203c •Let's end here. •</data> 10422e5b6d6dSopenharmony_ci 10432e5b6d6dSopenharmony_ci 10442e5b6d6dSopenharmony_ci 10452e5b6d6dSopenharmony_ci 10462e5b6d6dSopenharmony_ci################################################################ 10472e5b6d6dSopenharmony_ci# 10482e5b6d6dSopenharmony_ci# 10492e5b6d6dSopenharmony_ci# L I N E B R E A K 10502e5b6d6dSopenharmony_ci# 10512e5b6d6dSopenharmony_ci# 10522e5b6d6dSopenharmony_ci################################################################ 10532e5b6d6dSopenharmony_ci 10542e5b6d6dSopenharmony_ci<line> 10552e5b6d6dSopenharmony_ci# 10562e5b6d6dSopenharmony_ci# Test Character for each of the line break classes. 10572e5b6d6dSopenharmony_ci# 10582e5b6d6dSopenharmony_ci# 00A1;AI # INVERTED EXCLAMATION MARK ¡ 10592e5b6d6dSopenharmony_ci# 0041;AL # LATIN CAPITAL LETTER A 10602e5b6d6dSopenharmony_ci# 0009;BA # <control> 10612e5b6d6dSopenharmony_ci# 00B4;BB # ACUTE ACCENT 10622e5b6d6dSopenharmony_ci# 000C;BK # <control> 10632e5b6d6dSopenharmony_ci# 2014;B2 # EM DASH 10642e5b6d6dSopenharmony_ci# FFFC;CB # OBJECT REPLACEMENT CHARACTER 10652e5b6d6dSopenharmony_ci# 0029;CL # RIGHT PARENTHESIS 10662e5b6d6dSopenharmony_ci# 0301;CM # COMBINING ACUTE ACCENT 10672e5b6d6dSopenharmony_ci# 0021;EX # EXCLAMATION MARK 10682e5b6d6dSopenharmony_ci# 00A0;GL # NO-BREAK SPACE 10692e5b6d6dSopenharmony_ci# 002D;HY # HYPHEN-MINUS 10702e5b6d6dSopenharmony_ci# 4E00;ID # <CJK Ideograph, First> 10712e5b6d6dSopenharmony_ci# 2024;IN # ONE DOT LEADER 10722e5b6d6dSopenharmony_ci# 002C;IS # COMMA 10732e5b6d6dSopenharmony_ci# 000A;LF # <control> 10742e5b6d6dSopenharmony_ci# 0E5A;NS # THAI CHARACTER ANGKHANKHU 10752e5b6d6dSopenharmony_ci# 0032;NU # DIGIT TWO 10762e5b6d6dSopenharmony_ci# 0028;OP # LEFT PARENTHESIS 10772e5b6d6dSopenharmony_ci# 0025;PO # PERCENT SIGN 10782e5b6d6dSopenharmony_ci# 0024;PR # DOLLAR SIGN 10792e5b6d6dSopenharmony_ci# 0022;QU # QUOTATION MARK 10802e5b6d6dSopenharmony_ci# 0E01;SA # THAI CHARACTER KO KAI 10812e5b6d6dSopenharmony_ci# DB7F;SG # Surrogate 10822e5b6d6dSopenharmony_ci# 0020;SP # SPACE 10832e5b6d6dSopenharmony_ci# 002F;SY # SOLIDUS / 10842e5b6d6dSopenharmony_ci# F8FF;XX # Private Use 10852e5b6d6dSopenharmony_ci# 200B;ZW # ZERO WIDTH SPACE 10862e5b6d6dSopenharmony_ci 10872e5b6d6dSopenharmony_ci 10882e5b6d6dSopenharmony_ci# 2b Always break at end of text 10892e5b6d6dSopenharmony_ci 10902e5b6d6dSopenharmony_ci<data>• •\u00A1•</data> 10912e5b6d6dSopenharmony_ci<data>• •\u0041•</data> 10922e5b6d6dSopenharmony_ci<data>• •\u0009•</data> 10932e5b6d6dSopenharmony_ci<data>• •\u00B4•</data> 10942e5b6d6dSopenharmony_ci<data>• \u000C<100></data> # LB3C × BK 10952e5b6d6dSopenharmony_ci<data>• •\u2014•</data> 10962e5b6d6dSopenharmony_ci<data>• •\uFFFC•</data> 10972e5b6d6dSopenharmony_ci<data>• \u0029•</data> # LB 8 × CL 10982e5b6d6dSopenharmony_ci# <data>• • \u0301•</data> # LB 7a Treat SP CM* as if it were ID #TODO: SP CM 10992e5b6d6dSopenharmony_ci<data>• \u0021•</data> # LB 8 × EX 11002e5b6d6dSopenharmony_ci#<data>• \u00A0•</data> # LB 11b × GL TODO: fix. 11012e5b6d6dSopenharmony_ci<data>• •\u002D•</data> 11022e5b6d6dSopenharmony_ci<data>• •\u4E00•</data> 11032e5b6d6dSopenharmony_ci<data>• •\u2024•</data> 11042e5b6d6dSopenharmony_ci<data>• \u002C•</data> # LB 8 × IS 11052e5b6d6dSopenharmony_ci<data>• \u000A<100></data> # LB3C × ( BK | CR | LF | NL ) 11062e5b6d6dSopenharmony_ci<data>• •\u0E5A•</data> 11072e5b6d6dSopenharmony_ci<data>• •\u0032•</data> 11082e5b6d6dSopenharmony_ci<data>• •\u0028•</data> 11092e5b6d6dSopenharmony_ci<data>• •\u0025•</data> 11102e5b6d6dSopenharmony_ci<data>• •\u0024•</data> 11112e5b6d6dSopenharmony_ci<data>• •\u0022•</data> 11122e5b6d6dSopenharmony_ci<data>• •\u0E01•</data> 11132e5b6d6dSopenharmony_ci<data>• •\uDB7F•</data> 11142e5b6d6dSopenharmony_ci<data>• \u0020•</data> # LB4 - don't break before space. 11152e5b6d6dSopenharmony_ci<data>• \u002F•</data> # LB 8 × SY 11162e5b6d6dSopenharmony_ci<data>• •\uF8FF•</data> 11172e5b6d6dSopenharmony_ci<data>• \u200B•</data> # LB4 - don't break before ZA 11182e5b6d6dSopenharmony_ci 11192e5b6d6dSopenharmony_ci 11202e5b6d6dSopenharmony_ci# 3a Always break after hard line breaks. 11212e5b6d6dSopenharmony_ci# 3c Never break before hard line breaks. 11222e5b6d6dSopenharmony_ci 11232e5b6d6dSopenharmony_ci<data>• •\u00A1\u2028<100>\u00A1•</data> 11242e5b6d6dSopenharmony_ci<data>• •\u0041\u2028<100>\u0041•</data> 11252e5b6d6dSopenharmony_ci<data>• •\u0009\u2028<100>\u0009•</data> 11262e5b6d6dSopenharmony_ci<data>• •\u00B4\u2028<100>\u00B4•</data> 11272e5b6d6dSopenharmony_ci<data>• \u000C<100>\u2028<100>\u000C<100></data> 11282e5b6d6dSopenharmony_ci<data>• •\u2014\u2028<100>\u2014•</data> 11292e5b6d6dSopenharmony_ci<data>• •\uFFFC\u2028<100>\uFFFC•</data> 11302e5b6d6dSopenharmony_ci<data>• \u0029\u2028<100>\u0029•</data> 11312e5b6d6dSopenharmony_ci<data>• •\u0301\u2028<100>\u0301•</data> 11322e5b6d6dSopenharmony_ci<data>• \u0021\u2028<100>\u0021•</data> 11332e5b6d6dSopenharmony_ci<data>• •\u00A0\u2028<100>\u00A0•</data> 11342e5b6d6dSopenharmony_ci<data>• •\u002D\u2028<100>\u002D•</data> 11352e5b6d6dSopenharmony_ci<data>• •\u4E00\u2028<100>\u4E00•</data> 11362e5b6d6dSopenharmony_ci<data>• •\u2024\u2028<100>\u2024•</data> 11372e5b6d6dSopenharmony_ci<data>• \u002C\u2028<100>\u002C•</data> 11382e5b6d6dSopenharmony_ci<data>• \u000A<100>\u2028<100>\u000A<100></data> 11392e5b6d6dSopenharmony_ci<data>• •\u0E5A\u2028<100>\u0E5A•</data> 11402e5b6d6dSopenharmony_ci<data>• •\u0032\u2028<100>\u0032•</data> 11412e5b6d6dSopenharmony_ci<data>• •\u0028\u2028<100>\u0028•</data> 11422e5b6d6dSopenharmony_ci<data>• •\u0025\u2028<100>\u0025•</data> 11432e5b6d6dSopenharmony_ci<data>• •\u0024\u2028<100>\u0024•</data> 11442e5b6d6dSopenharmony_ci<data>• •\u0022\u2028<100>\u0022•</data> 11452e5b6d6dSopenharmony_ci<data>• •\u0E01\u2028<100>\u0E01•</data> 11462e5b6d6dSopenharmony_ci<data>• •\uDB7F\u2028<100>\uDB7F•</data> 11472e5b6d6dSopenharmony_ci<data>• \u0020\u2028<100>\u0020•</data> 11482e5b6d6dSopenharmony_ci<data>• \u002F\u2028<100>\u002F•</data> 11492e5b6d6dSopenharmony_ci<data>• •\uF8FF\u2028<100>\uF8FF•</data> 11502e5b6d6dSopenharmony_ci<data>• \u200B\u2028<100>\u200B•</data> 11512e5b6d6dSopenharmony_ci 11522e5b6d6dSopenharmony_ci# Regional Indicator sequences. They group in pairs. 11532e5b6d6dSopenharmony_ci# Sequences are long enough that the non-exaustive monkey test won't reliably pick up problems. 11542e5b6d6dSopenharmony_ci 11552e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•</data> 11562e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6•</data> 11572e5b6d6dSopenharmony_ci 11582e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6\u00a0\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•</data> 11592e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6\u00a0\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6•</data> 11602e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\u00a0\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•</data> 11612e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1E6•\U0001F1E6\u00a0\U0001F1E6\U0001F1E6•\U0001F1E6\U0001F1E6•\U0001F1E6•</data> 11622e5b6d6dSopenharmony_ci 11632e5b6d6dSopenharmony_ci 11642e5b6d6dSopenharmony_ci# User Guide example 11652e5b6d6dSopenharmony_ci 11662e5b6d6dSopenharmony_ci<data>•Parlez-•vous •français ?•</data> 11672e5b6d6dSopenharmony_ci 11682e5b6d6dSopenharmony_ci# 11692e5b6d6dSopenharmony_ci# Old Line Break Test data. Originally located in RBBITest::TestDefaultRuleBasedLineIteration() 11702e5b6d6dSopenharmony_ci# 11712e5b6d6dSopenharmony_ci 11722e5b6d6dSopenharmony_ci<line> 11732e5b6d6dSopenharmony_ci 11742e5b6d6dSopenharmony_ci<data>•Multi-•Level •example •of •a •semi-•idiotic •non-•sensical •(non-•important) •sentence. 11752e5b6d6dSopenharmony_ci<100>Hi •Hello •How\n<100>are\r<100>you\u2028<100>fine.\t•good. •Now\r<100>is\n<100>the\r\n<100>time\n<100>\r<100>for\r<100>\r<100>all•</data> 11762e5b6d6dSopenharmony_ci 11772e5b6d6dSopenharmony_ci<line> 11782e5b6d6dSopenharmony_ci<data>•Hello! •how\r\n<100> •(are)\r<100> •you? •I'am •fine- •Thankyou. •foo\u00a0bar 11792e5b6d6dSopenharmony_ci<100>How, •are, •you? •This, •costs •$20,00,000.•</data> 11802e5b6d6dSopenharmony_ci 11812e5b6d6dSopenharmony_ci# test for bug #4068133 11822e5b6d6dSopenharmony_ci# 11832e5b6d6dSopenharmony_ci<data>•\u96f6•\u4e00\u3002•\u4e8c\u3001•\u4e09\u3002\u3001•\u56db\u3001\u3002\u3001•\u4e94,•\u516d.•\u4e03.\u3001,\u3002•\u516b•</data> 11842e5b6d6dSopenharmony_ci 11852e5b6d6dSopenharmony_ci# to test for bug #4086052 11862e5b6d6dSopenharmony_ci<data>•foo\u00a0bar•</data> 11872e5b6d6dSopenharmony_ci 11882e5b6d6dSopenharmony_ci# to test for bug #4097920 11892e5b6d6dSopenharmony_ci<data>•dog,cat,mouse •(one)•(two)\n<100></data> 11902e5b6d6dSopenharmony_ci 11912e5b6d6dSopenharmony_ci# to test for bug #4035266 11922e5b6d6dSopenharmony_ci<data>•The •balance •is •$-23,456.78, •not •-•$32,456.78!\n<100></data> 11932e5b6d6dSopenharmony_ci 11942e5b6d6dSopenharmony_ci 11952e5b6d6dSopenharmony_ci# to test for bug #4098467 11962e5b6d6dSopenharmony_ci# What follows is a string of Korean characters (I found it in the Yellow Pages 11972e5b6d6dSopenharmony_ci# ad for the Korean Presbyterian Church of San Francisco, and I hope I transcribed 11982e5b6d6dSopenharmony_ci# it correctly), first as precomposed syllables, and then as conjoining jamo. 11992e5b6d6dSopenharmony_ci# Both sequences should be semantically identical and break the same way. 12002e5b6d6dSopenharmony_ci# precomposed syllables... (I == Rich Gillam?) 12012e5b6d6dSopenharmony_ci# 12022e5b6d6dSopenharmony_ci<data>•\uc0c1•\ud56d •\ud55c•\uc778 •\uc5f0•\ud569 •\uc7a5•\ub85c•\uad50•\ud68c•</data> 12032e5b6d6dSopenharmony_ci 12042e5b6d6dSopenharmony_ci# Bug 4450804 estLineBreakContractions 12052e5b6d6dSopenharmony_ci# 12062e5b6d6dSopenharmony_ci<line> 12072e5b6d6dSopenharmony_ci<data>•These •are •'foobles'. •Don't •you •like •them?•</data> 12082e5b6d6dSopenharmony_ci 12092e5b6d6dSopenharmony_ci 12102e5b6d6dSopenharmony_ci# conjoining jamo... 12112e5b6d6dSopenharmony_ci<data>•\u1109\u1161\u11bc•\u1112\u1161\u11bc •\u1112\u1161\u11ab•\u110b\u1175\u11ab •\u110b\u1167\u11ab•\u1112\u1161\u11b8 •\u110c\u1161\u11bc•\u1105\u1169•\u1100\u116d•\u1112\u116c•</data> 12122e5b6d6dSopenharmony_ci 12132e5b6d6dSopenharmony_ci# to test for bug #4117554: Fullwidth .!? should be treated as postJwrd 12142e5b6d6dSopenharmony_ci<data>•\u4e01\uff0e•\u4e02\uff01•\u4e03\uff1f•</data> 12152e5b6d6dSopenharmony_ci 12162e5b6d6dSopenharmony_ci# Surrogate line break tests. 12172e5b6d6dSopenharmony_ci# 12182e5b6d6dSopenharmony_ci<data>•\u4e01•\ud840\udc01•\u4e02•abc •\ue000 •\udb80\udc01•</data> #This line and the following are equivalent. 12192e5b6d6dSopenharmony_ci<data>•\u4e01•\U00020001•\u4e02•abc •\ue000 •\U000f0001•</data> 12202e5b6d6dSopenharmony_ci 12212e5b6d6dSopenharmony_ci# Regression for bug 836 12222e5b6d6dSopenharmony_ci# Note: Unicode 5.1 changed this behavior 12232e5b6d6dSopenharmony_ci# Unicode 5.2 changed it again, there is no break following the '(' 12242e5b6d6dSopenharmony_ci<data>•AAA(AAA •</data> 12252e5b6d6dSopenharmony_ci 12262e5b6d6dSopenharmony_ci# Try some words from other scripts. 12272e5b6d6dSopenharmony_ci# Greek, Cyrillic, Hebrew, Arabic, Arabic, Georgian, Latin 12282e5b6d6dSopenharmony_ci# 12292e5b6d6dSopenharmony_ci<data>•ΑΒΓ •БВГ •אבג֓ •ابت •١٢٣ •\u10A0\u10A1\u10A2 •ABC •</data> 12302e5b6d6dSopenharmony_ci 12312e5b6d6dSopenharmony_ci# 12322e5b6d6dSopenharmony_ci# ticket #4853: unpaired surrogates should behave like AL 12332e5b6d6dSopenharmony_ci# 12342e5b6d6dSopenharmony_ci<data>•abc\ud801xyz•</data> 12352e5b6d6dSopenharmony_ci 12362e5b6d6dSopenharmony_ci# a character sequence such as "X11" or "30F3" or "native2ascii" should 12372e5b6d6dSopenharmony_ci# be kept together as a single word. 12382e5b6d6dSopenharmony_ci<data>•X11 •30F3 •native2ascii•</data> 12392e5b6d6dSopenharmony_ci 12402e5b6d6dSopenharmony_ci# 12412e5b6d6dSopenharmony_ci# Regression tests for failures that originally came from the monkey test. 12422e5b6d6dSopenharmony_ci# Monkey test failure lines can, with slight reformatting, be copied into this section 12432e5b6d6dSopenharmony_ci# as test cases. The error display from here is more informative. 12442e5b6d6dSopenharmony_ci# 12452e5b6d6dSopenharmony_ci<data>•\ufffc•\u30e3\u000c<100>\u1b39•\u300a\u002f\u203a\u200b•\ufffc•\uaf64•\udcfb•</data> 12462e5b6d6dSopenharmony_ci<data>•\u114d\u31f3•\ube44\u002d•\u0362\u24e2\u276e\u2014\u205f\ufe16•\uc877•\u0fd0\u000a<100>\u20a3•</data> 12472e5b6d6dSopenharmony_ci<data>•\u080a\u215b\U0001d7d3\u002c\u2025\U000e012e•\u02df\u118d\u0029\ua8d6\u0085<100>\u6cc4\u2024\u202f\ufffc•</data> 12482e5b6d6dSopenharmony_ci 12492e5b6d6dSopenharmony_ci# Test for #10176 (in root) 12502e5b6d6dSopenharmony_ci<line> 12512e5b6d6dSopenharmony_ci<data>•abc/•s •def•</data> 12522e5b6d6dSopenharmony_ci<data>•abc/\u05D9 •def•</data> 12532e5b6d6dSopenharmony_ci<data>•\u05E7\u05D7/\u05D9 •\u05DE\u05E2\u05D9\u05DC•</data> 12542e5b6d6dSopenharmony_ci<data>•\u05D3\u05E8\u05D5\u05E9\u05D9\u05DD •\u05E9\u05D7\u05E7\u05E0\u05D9\u05DD/\u05D9\u05D5\u05EA•</data> 12552e5b6d6dSopenharmony_ci 12562e5b6d6dSopenharmony_ci# Ticket #11556 don't break "R$" or "JP¥" 12572e5b6d6dSopenharmony_ci<locale en> 12582e5b6d6dSopenharmony_ci<line> 12592e5b6d6dSopenharmony_ci<data>•R$ •JP¥ •a9 •3a •H% •CA$ •Travi$ •Scott •Ke$ha •Curren$y •A$AP •Rocky•</data> 12602e5b6d6dSopenharmony_ci 12612e5b6d6dSopenharmony_ci# Test Bug 4146175 Lines 12622e5b6d6dSopenharmony_ci# the fullwidth comma should stick to the preceding Japanese character 12632e5b6d6dSopenharmony_ci<line> 12642e5b6d6dSopenharmony_ci<data>•\u7d42\uff0c•\u308f•</data> 12652e5b6d6dSopenharmony_ci 12662e5b6d6dSopenharmony_ci# Empty String 12672e5b6d6dSopenharmony_ci<line> 12682e5b6d6dSopenharmony_ci<data>•</data> 12692e5b6d6dSopenharmony_ci 12702e5b6d6dSopenharmony_ci# Bug #13664 Mixed Thai & English text had different boundaries between forward & reverse. 12712e5b6d6dSopenharmony_ci# 12722e5b6d6dSopenharmony_ci<line> 12732e5b6d6dSopenharmony_ci<data>•Northern •Thailand •is •a •mountainous •area. •Parallel •mountain •ranges •extend •from •the •\ 12742e5b6d6dSopenharmony_ciDaen •Lao •Range •(ทิว•เขา•แดน•ลาว), •in •the •southern •region •of •the •Shan •Hills, •in •a •\ 12752e5b6d6dSopenharmony_cinorth/•south •direction, •the •Dawna •Range •(ทิว•เขา•ดอย•มอน•กุ•จู) •forming •the •western •border •of •Thailand •\ 12762e5b6d6dSopenharmony_cibetween •Mae •Hong •Son •and •the •Salween •River, •the •Thanon •Thong •Chai •Range •(เทือก•เขา•ถนน•ธงชัย), •\ 12772e5b6d6dSopenharmony_cithe •Khun •Tan •Range •(ดอย•ขุน•ตาน), •the •Phi •Pan •Nam •Range •(ทิว•เขา•ผี•ปัน•น้ำ), •as •well •as •the •western •\ 12782e5b6d6dSopenharmony_cipart •of •the •Luang •Prabang •Range •(ทิว•เขา•หลวง•พระ•บาง).•</data> 12792e5b6d6dSopenharmony_ci 12802e5b6d6dSopenharmony_ci# Breaking around numbers that begin with a decimal point. 12812e5b6d6dSopenharmony_ci# Bug ICU-12017 12822e5b6d6dSopenharmony_ci 12832e5b6d6dSopenharmony_ci<line> 12842e5b6d6dSopenharmony_ci<data>•start •.789 •end•</data> 12852e5b6d6dSopenharmony_ci<data>•start .abc •end•</data> 12862e5b6d6dSopenharmony_ci<data>•start •( .789 •end)•</data> 12872e5b6d6dSopenharmony_ci<data>•start •.\u0301789 •end•</data> # \u0301 is a CM (COMBINING ACUTE ACCENT) 12882e5b6d6dSopenharmony_ci<data>•start •.\u200D789 •end•</data> # \u200D is ZWJ 12892e5b6d6dSopenharmony_ci<data>•start •.\u200D\u0301789 •end•</data> 12902e5b6d6dSopenharmony_ci<data>•start •.\u0301\u200D789 •end•</data> 12912e5b6d6dSopenharmony_ci<data>•start .\u0301\u200D•</data> 12922e5b6d6dSopenharmony_ci<data>•start .\u0301\u200D •</data> 12932e5b6d6dSopenharmony_ci<data>•start ; •end •</data> 12942e5b6d6dSopenharmony_ci 12952e5b6d6dSopenharmony_ci######################################################################################## 12962e5b6d6dSopenharmony_ci# 12972e5b6d6dSopenharmony_ci# 12982e5b6d6dSopenharmony_ci# T i t l e B o u n d a r y T e s t s 12992e5b6d6dSopenharmony_ci# 13002e5b6d6dSopenharmony_ci# 13012e5b6d6dSopenharmony_ci########################################################################################## 13022e5b6d6dSopenharmony_ci<title> 13032e5b6d6dSopenharmony_ci<data>•Here •is •a •short •sample •sentence. •And •another.•</data> 13042e5b6d6dSopenharmony_ci<data>•HERE •IS •A •SHORT •SAMPLE •SENTENCE. •AND •ANOTHER.•</data> 13052e5b6d6dSopenharmony_ci<data>• •Start •and •end •with •spaces •</data> 13062e5b6d6dSopenharmony_ci<data>•Include 123 456 ^& •some 54332 •numbers 4445•abc123•abc •ending 1223 •</data> 13072e5b6d6dSopenharmony_ci 13082e5b6d6dSopenharmony_ci<data>•Combining\u0301 \u0301•ma\u0306rks •bye •</data> 13092e5b6d6dSopenharmony_ci<data>•123 •Start •with •a •number.•</data> 13102e5b6d6dSopenharmony_ci 13112e5b6d6dSopenharmony_ci<data>•'•start •with •a •case-•ignorable •cha'r'a'cter•</data> 13122e5b6d6dSopenharmony_ci<data>•' '' •start •with •case-•ignorable & •case-•insensitive •cha'r'a'cter•</data> 13132e5b6d6dSopenharmony_ci<data>• ''•aaa' •bbb '•ccc' '•ddd''' '''•eee '''•fff''' •ggg ''•</data> 13142e5b6d6dSopenharmony_ci# Note: apostrophe is case-ignorable. space is not cased. 13152e5b6d6dSopenharmony_ci 13162e5b6d6dSopenharmony_ci########################################################################################## 13172e5b6d6dSopenharmony_ci# 13182e5b6d6dSopenharmony_ci# Thai Tests 13192e5b6d6dSopenharmony_ci# 13202e5b6d6dSopenharmony_ci########################################################################################## 13212e5b6d6dSopenharmony_ci<locale th> 13222e5b6d6dSopenharmony_ci<word> 13232e5b6d6dSopenharmony_ci# 13242e5b6d6dSopenharmony_ci# Test data originally from the test code source file 13252e5b6d6dSopenharmony_ci# // @suwit -- Thai sample data from GVT Guideline 13262e5b6d6dSopenharmony_ci# 13272e5b6d6dSopenharmony_ci<data>•\u0E2B\u0E19\u0E36\u0E48\u0E07<200>\u0E04\u0E33<200>\u0E44\u0E17\u0E22<200>\ 13282e5b6d6dSopenharmony_ci\u0E2A\u0E32\u0E21\u0E32\u0E23\u0E16<200>\u0E1B\u0E23\u0E30\u0E01\u0E2D\u0E1A<200>\ 13292e5b6d6dSopenharmony_ci\u0E14\u0E49\u0E27\u0E22<200>\u0e2b\u0e25\u0e32\u0e22<200>\ 13302e5b6d6dSopenharmony_ci\u0e1e\u0e22\u0e32\u0e07\u0e04\u0e4c<200></data> 13312e5b6d6dSopenharmony_ci 13322e5b6d6dSopenharmony_ci# Test data originally from 13332e5b6d6dSopenharmony_ci# ICU-8576 svn r30327 = git commit 62d26cc5fe8cc66696143c44a43d664c2d7be100 13342e5b6d6dSopenharmony_ci<data>•กู<200> •กิน<200>กุ้ง<200> •ปิ้่<200>งอ<200>ยู่<200>ใน<200>ถ้ำ<200></data> 13352e5b6d6dSopenharmony_ci 13362e5b6d6dSopenharmony_ci<data>•\u0E01\u0E39<200>\u0020•\u0E01\u0E34\u0E19<200>\u0E01\u0E38\u0E49\u0E07<200>\ 13372e5b6d6dSopenharmony_ci\u0020•\u0E1B\u0E34\u0E49\u0E48<200>\u0E07\u0E2D<200>\u0E22\u0E39\u0E48<200>\ 13382e5b6d6dSopenharmony_ci\u0E43\u0E19<200>\u0E16\u0E49\u0E33<200></data> 13392e5b6d6dSopenharmony_ci 13402e5b6d6dSopenharmony_ci<line> 13412e5b6d6dSopenharmony_ci<data>•0E01\u0E39\u0020•\u0E01\u0E34\u0E19•\u0E01\u0E38\u0E49\u0E07\ 13422e5b6d6dSopenharmony_ci\u0020•\u0E1B\u0E34\u0E49\u0E48•\u0E07\u0E2D•\u0E22\u0E39\u0E48•\ 13432e5b6d6dSopenharmony_ci\u0E43\u0E19•\u0E16\u0E49\u0E33•</data> 13442e5b6d6dSopenharmony_ci 13452e5b6d6dSopenharmony_ci# Data originally from intltest RBBITest::TestThaiLineBreak() 13462e5b6d6dSopenharmony_ci# 13472e5b6d6dSopenharmony_ci# \u0e2f-- the Thai paiyannoi character-- isn't a letter. It's a symbol that 13482e5b6d6dSopenharmony_ci# represents elided letters at the end of a long word. It should be bound to 13492e5b6d6dSopenharmony_ci# the end of the word and not treated as an independent punctuation mark. 13502e5b6d6dSopenharmony_ci# 13512e5b6d6dSopenharmony_ci# the one time where the paiyannoi occurs somewhere other than at the end 13522e5b6d6dSopenharmony_ci# of a word is in the Thai abbreviation for "etc.", which both begins and 13532e5b6d6dSopenharmony_ci# ends with a paiyannoi 13542e5b6d6dSopenharmony_ci# 13552e5b6d6dSopenharmony_ci<line> 13562e5b6d6dSopenharmony_ci<data>•\u0e2a\u0e16\u0e32\u0e19\u0e35\u0e2f•\ 13572e5b6d6dSopenharmony_ci\u0e08\u0e30•\ 13582e5b6d6dSopenharmony_ci\u0e23\u0e30\u0e14\u0e21•\ 13592e5b6d6dSopenharmony_ci\u0e40\u0e08\u0e49\u0e32•\ 13602e5b6d6dSopenharmony_ci\u0e2b\u0e19\u0e49\u0e32\u0e17\u0e35\u0e48•\ 13612e5b6d6dSopenharmony_ci\u0e2d\u0e2d\u0e01•\ 13622e5b6d6dSopenharmony_ci\u0e21\u0e32•\ 13632e5b6d6dSopenharmony_ci\u0e40\u0e23\u0e48\u0e07•\ 13642e5b6d6dSopenharmony_ci\u0e23\u0e30\u0e1a\u0e32\u0e22•\ 13652e5b6d6dSopenharmony_ci\u0e2d\u0e22\u0e48\u0e32\u0e07•\ 13662e5b6d6dSopenharmony_ci\u0e40\u0e15\u0e47\u0e21•\ 13672e5b6d6dSopenharmony_ci\u0e2f\u0e25\u0e2f•\ 13682e5b6d6dSopenharmony_ci\u0e17\u0e35\u0e48•\ 13692e5b6d6dSopenharmony_ci\u0e19\u0e31\u0e49\u0e19•</data> 13702e5b6d6dSopenharmony_ci 13712e5b6d6dSopenharmony_ci# Data originally from RBBITest::TestMixedThaiLineBreak() 13722e5b6d6dSopenharmony_ci# @suwit -- Test Arabic numerals, Thai numerals, Punctuation and English characters start 13732e5b6d6dSopenharmony_ci# 13742e5b6d6dSopenharmony_ci<line> 13752e5b6d6dSopenharmony_ci<data>•\u0E1B\u0E35•\ 13762e5b6d6dSopenharmony_ci\u0E1E\u0E38\u0E17\u0E18\u0E28\u0E31\u0E01\u0E23\u0E32\u0E0A •\ 13772e5b6d6dSopenharmony_ci2545 •\ 13782e5b6d6dSopenharmony_ci\u0E40\u0E1B\u0E47\u0E19•\ 13792e5b6d6dSopenharmony_ci\u0E1B\u0E35•\ 13802e5b6d6dSopenharmony_ci\u0E09\u0E25\u0E2D\u0E07•\ 13812e5b6d6dSopenharmony_ci\u0E04\u0E23\u0E1A•\ 13822e5b6d6dSopenharmony_ci\u0E23\u0E2D\u0E1A •\ 13832e5b6d6dSopenharmony_ci\"\u0E52\u0E52\u0E50 •\ 13842e5b6d6dSopenharmony_ci\u0E1b\u0E35\" •\ 13852e5b6d6dSopenharmony_ci\u0E02\u0E2d\u0E07•\ 13862e5b6d6dSopenharmony_ci\u0E01\u0E23\u0E38\u0E07•\ 13872e5b6d6dSopenharmony_ci\u0E23\u0E31\u0E15\u0E19\u0E42\u0E01\u0E2A\u0E34\u0E19\u0E17\u0E23\u0E4C •\ 13882e5b6d6dSopenharmony_ci(\u0E01\u0E23\u0E38\u0E07\u0E40\u0E17\u0E1e\u0E2F•\ 13892e5b6d6dSopenharmony_ci\u0E2B\u0E23\u0E37\u0E2D •\ 13902e5b6d6dSopenharmony_ciBangkok)•</data> 13912e5b6d6dSopenharmony_ci 13922e5b6d6dSopenharmony_ci# Data originally from RBBITest::TestMaiyamok() 13932e5b6d6dSopenharmony_ci# The Thai maiyamok character is a shorthand symbol that means "repeat the previous 13942e5b6d6dSopenharmony_ci# word". Instead of appearing as a word unto itself, however, it's kept together 13952e5b6d6dSopenharmony_ci# with the word before it. 13962e5b6d6dSopenharmony_ci# 13972e5b6d6dSopenharmony_ci<line> 13982e5b6d6dSopenharmony_ci<data>•\u0e44\u0e1b\u0e46•\ 13992e5b6d6dSopenharmony_ci\u0e21\u0e32\u0e46•\ 14002e5b6d6dSopenharmony_ci\u0e23\u0e30\u0e2b\u0e27\u0e48\u0e32\u0e07•\ 14012e5b6d6dSopenharmony_ci\u0e01\u0e23\u0e38\u0e07•\ 14022e5b6d6dSopenharmony_ci\u0e40\u0e17\u0e1e•\ 14032e5b6d6dSopenharmony_ci\u0e41\u0e25\u0e30•\ 14042e5b6d6dSopenharmony_ci\u0e40\u0e03\u0e35•\ 14052e5b6d6dSopenharmony_ci\u0e22\u0e07•\ 14062e5b6d6dSopenharmony_ci\u0e43\u0e2b\u0e21\u0e48•</data> 14072e5b6d6dSopenharmony_ci 14082e5b6d6dSopenharmony_ci# Test for #10296 14092e5b6d6dSopenharmony_ci<line> 14102e5b6d6dSopenharmony_ci<data>•ใช•มั้ย•</data> 14112e5b6d6dSopenharmony_ci<data>•มั๊ยล่ะ•ที่รัก•</data> 14122e5b6d6dSopenharmony_ci 14132e5b6d6dSopenharmony_ci# Test for #10593 14142e5b6d6dSopenharmony_ci<line> 14152e5b6d6dSopenharmony_ci<data>•เล่น•ผ่าน•ทาง•บลูทูธ•บน•อุปกรณ์•</data> 14162e5b6d6dSopenharmony_ci 14172e5b6d6dSopenharmony_ci# Test for city names #10691 14182e5b6d6dSopenharmony_ci<line> 14192e5b6d6dSopenharmony_ci<data>•ไป•ที่•ซานฟรานซิสโก•</data> 14202e5b6d6dSopenharmony_ci 14212e5b6d6dSopenharmony_ci# Test for #10630, #10631 14222e5b6d6dSopenharmony_ci<line> 14232e5b6d6dSopenharmony_ci<data>•แท็ก•แอปพลิเคชัน•เป็น•พิเศษ•</data> 14242e5b6d6dSopenharmony_ci 14252e5b6d6dSopenharmony_ci# Test for #11019 14262e5b6d6dSopenharmony_ci<line> 14272e5b6d6dSopenharmony_ci<data>•เบ•เบราว์เซอร์•โพ•โพสต์•โพสท์•</data> 14282e5b6d6dSopenharmony_ci 14292e5b6d6dSopenharmony_ci# Test for #11688 14302e5b6d6dSopenharmony_ci<line> 14312e5b6d6dSopenharmony_ci<data>•อัปเดต•อีเวนต์•</data> 14322e5b6d6dSopenharmony_ci 14332e5b6d6dSopenharmony_ci<line> 14342e5b6d6dSopenharmony_ci<data>•แอป์•นี้•ไม่•รองรับ •Android •เวอร์ชัน•ล่าสุด •หาก•แอป•นี้•เข้า•ถึง•ไฟล์•เพลง•และ•ไฟล์•เสียง•ได้ •ก็•จะ•ได้•รับ•อนุญาต•ให้•เข้า•ถึง•รูปภาพ•และ•วิดีโอ•ด้วย•</data> 14352e5b6d6dSopenharmony_ci 14362e5b6d6dSopenharmony_ci########################################################################################## 14372e5b6d6dSopenharmony_ci# 14382e5b6d6dSopenharmony_ci# Lao Tests 14392e5b6d6dSopenharmony_ci# 14402e5b6d6dSopenharmony_ci########################################################################################## 14412e5b6d6dSopenharmony_ci<locale en> 14422e5b6d6dSopenharmony_ci# Basic check for #7647 14432e5b6d6dSopenharmony_ci<line> 14442e5b6d6dSopenharmony_ci<data>•ສະບາຍດີ•</data> 14452e5b6d6dSopenharmony_ci<data>•ດີ•ຂອບໃຈ•</data> 14462e5b6d6dSopenharmony_ci<data>•ເຈົ້າ•ເວົ້າ•ພາສາ•ອັງກິດ•ໄດ້•ບໍ່•</data> 14472e5b6d6dSopenharmony_ci<data>•ກະລຸນາ•ເວົ້າ•ຊ້າ•ໆ•</data> 14482e5b6d6dSopenharmony_ci# More checks for ICU-21413 (check both moved and added entries) 14492e5b6d6dSopenharmony_ci<line> 14502e5b6d6dSopenharmony_ci<data>•ກໍ•ກໍ່•ກໍ້•</data> 14512e5b6d6dSopenharmony_ci<data>•ກວາກຊວາກ•ກະເຈາະກະຈອກ•</data> 14522e5b6d6dSopenharmony_ci<data>•ກະຊວງປ້ອງກັນຄວາມສະຫງົບ•</data> 14532e5b6d6dSopenharmony_ci<data>•ໄຮ•ໄຮ່•ໄຮ້•</data> 14542e5b6d6dSopenharmony_ci# Checks for ICU-21409 (and ICU-21384) 14552e5b6d6dSopenharmony_ci<line> 14562e5b6d6dSopenharmony_ci<data>•ຣະຄັງ•ຣະມາ•ລູຊີເຟີ•</data> 14572e5b6d6dSopenharmony_ci 14582e5b6d6dSopenharmony_ci########################################################################################## 14592e5b6d6dSopenharmony_ci# 14602e5b6d6dSopenharmony_ci# Burmese/Myanmar Tests 14612e5b6d6dSopenharmony_ci# 14622e5b6d6dSopenharmony_ci########################################################################################## 14632e5b6d6dSopenharmony_ci<locale en> 14642e5b6d6dSopenharmony_ci# Basic sanity check for #10326 (some text from http://www.unicode.org/udhr/d/udhr_mya.txt) 14652e5b6d6dSopenharmony_ci<line> 14662e5b6d6dSopenharmony_ci<data>•လူ•တိုင်း•သည် •တူညီ •လွတ်လပ်•သော •ဂုဏ်•သိ•က္•ခါ•ဖြ•င့် •လည်းကောင်း၊ •</data> 14672e5b6d6dSopenharmony_ci<data>•တူညီ•လွတ်လပ်•သော •အ•ခွ•င့်•အရေး•များ•ဖြ•င့် •လည်းကောင်း၊ •မွေး•ဖွား•လာ•သူများ •ဖြစ်သည်။•</data> 14682e5b6d6dSopenharmony_ci<data>•ထို•သူ•တို့၌ •ပိုင်းခြား •ဝေဖန်•တတ်•သော •ဉာဏ်•နှ•င့် •ကျ•င့်•ဝတ် •သိတတ်•သော •စိတ်•တို့•ရှိ•ကြ၍ •</data> 14692e5b6d6dSopenharmony_ci<data>•ထို•သူ•တို့သည် •အချင်းချင်း •မေတ္တာ•ထား၍ •ဆက်ဆံ•ကျ•င့်•သုံး•</data> 14702e5b6d6dSopenharmony_ci 14712e5b6d6dSopenharmony_ci########################################################################################## 14722e5b6d6dSopenharmony_ci# 14732e5b6d6dSopenharmony_ci# Khmer Tests 14742e5b6d6dSopenharmony_ci# 14752e5b6d6dSopenharmony_ci########################################################################################## 14762e5b6d6dSopenharmony_ci 14772e5b6d6dSopenharmony_ci# Test data originally from 14782e5b6d6dSopenharmony_ci# ICU-8576 svn r30327 = git commit 62d26cc5fe8cc66696143c44a43d664c2d7be100 14792e5b6d6dSopenharmony_ci# from the file testdata/wordsegments.txt 14802e5b6d6dSopenharmony_ci<locale en> 14812e5b6d6dSopenharmony_ci<word> 14822e5b6d6dSopenharmony_ci 14832e5b6d6dSopenharmony_ci<data>•តើ<200>លោក<200>មក<200>ពី<200>ប្រទេស<200>ណា<200></data> 14842e5b6d6dSopenharmony_ci<data>•សណ្ដូក<200>ក<200>បណ្ដែត<200>ខ្លួន<200></data> 14852e5b6d6dSopenharmony_ci<data>•ពណ៌ស<200>ម្ដេច<200>ថា<200>ខ្មៅ<200></data> 14862e5b6d6dSopenharmony_ci#ប្រយោគ|ពី|របៀប|រួបរួម|និង|ភាព|ផ្សេងគ្នា|ដែល|អាច|ចូល<200></data> 14872e5b6d6dSopenharmony_ci<data>•ប្រយោគ<200>ពី<200>របៀប<200>ដែល<200>និង<200>ភាព<200>ផ្សេងគ្នា<200>ដែល<200>អាច<200>ចូល<200></data> 14882e5b6d6dSopenharmony_ci#ប្រយោគ|ពី|របៀប|ជា|មួយ|និង|ភាព|ផ្សេងគ្នា|ដែល|អាច|ចូល<200></data> 14892e5b6d6dSopenharmony_ci<data>•សូម<200>ចំណាយពេល<200>បន្តិច<200>ដើម្បី<200>អធិស្ឋាន<200>អរព្រះគុណ<200>ដល់<200>ព្រះអង្គ<200></data> 14902e5b6d6dSopenharmony_ci<data>•ការ<200>ថោកទាប<200>បរិប្បូណ៌<200>ដោយ<200></data> 14912e5b6d6dSopenharmony_ci<data>•ប្រើប្រាស់<200>ស្អាត<200>ទាំង<200>ចិត្ត<200>សិស្ស<200>នោះ<200></data> 14922e5b6d6dSopenharmony_ci<data>•បើ<200>អ្នក<200>ប្រព្រឺត្ត<200>អំពើអាក្រក់<200>មុខ<200>ជា<200>មាន<200></data> 14932e5b6d6dSopenharmony_ci<data>•ប្រដាប់<200>ប្រដា<200>រ<200>រៀនសូត្រ<200>បន្ទប់<200>រៀន<200></data> 14942e5b6d6dSopenharmony_ci<data>•ដើរតួ<200>មនុស្សគ<200>ឥត<200>បញ្ចេញ<200>យោបល់<200>សោះ<200>ឡើយ<200></data> 14952e5b6d6dSopenharmony_ci<data>•មិន<200>អាច<200>ឲ្យ<200>យើង<200>ធ្វើ<200>កសិកម្ម<200>បាន<200>ឡើយ<200></data> 14962e5b6d6dSopenharmony_ci<data>•បន្ត<200>សេចក្ត<200>ទៅទៀត<200></data> 14972e5b6d6dSopenharmony_ci<data>•ក្រុម<200>ប៉ូលិស<200>បណ្តាក់<200>គ្នា<200></data> 14982e5b6d6dSopenharmony_ci<data>•គ្មាន<200>សុខ<200>សំរាន្ត<200>ដង<200>ណា<200></data> 14992e5b6d6dSopenharmony_ci<data>•បាន<200>សុខភាព<200>បរិប្បូណ៌<200></data> 15002e5b6d6dSopenharmony_ci<data>•ជា<200>មេចោរ<200>ខ្ញុំ<200>នឹង<200>ស្លាប់<200>ទៅវិញ<200>ជា<200>មេចោរ<200></data> 15012e5b6d6dSopenharmony_ci<data>•ឯ<200>ការ<200>វាយ<200>ផ្ចាល<200>ដែល<200>នាំ<200></data> 15022e5b6d6dSopenharmony_ci<data>•គេ<200>ដឹក<200>ទៅ<200>សំឡាប់<200></data> 15032e5b6d6dSopenharmony_ci#អ្នក|ដែល|ជា|មន្ត្រី|ធំ|លើ|គាត់|ទេ<200></data> 15042e5b6d6dSopenharmony_ci<data>•យក<200>ទៅ<200>សម្លាប់ចោល<200>ស្ងាត់<200></data> 15052e5b6d6dSopenharmony_ci<data>•ត្រូវ<200>បាន<200>គេ<200>សម្លាប់<200></data> 15062e5b6d6dSopenharmony_ci<data>•នៅក្នុង<200>ស្រុក<200>ខ្ល<200>ងហ្ស៊ុន<200></data> 15072e5b6d6dSopenharmony_ci 15082e5b6d6dSopenharmony_ci 15092e5b6d6dSopenharmony_ci# 15102e5b6d6dSopenharmony_ci# Jitterbug 3671 Test Case 15112e5b6d6dSopenharmony_ci# 15122e5b6d6dSopenharmony_ci<data>•สวัสดี<200>ครับ<200>สบาย<200>ดี<200>ไหม<200> •ครับ<200></data> 15132e5b6d6dSopenharmony_ci 15142e5b6d6dSopenharmony_ci# 15152e5b6d6dSopenharmony_ci# Trac ticket 5595 Test Case 15162e5b6d6dSopenharmony_ci<data>•บท<200>ที่๑พายุ<200>ไซโคลน<200>โด<200>โรธี<200>อาศัย<200>อยู่<200>ท่ามกลาง<200>\ 15172e5b6d6dSopenharmony_ciทุ่งใหญ่<200>ใน<200>แคนซัส<200>กับ<200>ลุง<200>เฮ<200>นรี<200>ชาวไร่<200>และ<200>ป้า<200>เอ็ม<200>\ 15182e5b6d6dSopenharmony_ciภรรยา<200>ชาวไร่<200>บ้าน<200>ของ<200>พวก<200>เขา<200>หลัง<200>เล็ก<200>เพราะ<200>ไม้<200>\ 15192e5b6d6dSopenharmony_ciสร้าง<200>บ้าน<200>ต้อง<200>ขน<200>มา<200>ด้วย<200>เกวียน<200>เป็น<200>ระยะ<200>ทาง<200>หลาย<200>\ 15202e5b6d6dSopenharmony_ciไมล์<200></data> 15212e5b6d6dSopenharmony_ci 15222e5b6d6dSopenharmony_ci#################################################################################### 15232e5b6d6dSopenharmony_ci# 15242e5b6d6dSopenharmony_ci# Tailored (locale specific) breaking. 15252e5b6d6dSopenharmony_ci# 15262e5b6d6dSopenharmony_ci#################################################################################### 15272e5b6d6dSopenharmony_ci 15282e5b6d6dSopenharmony_ci# Japanese line break tailoring test 15292e5b6d6dSopenharmony_ci 15302e5b6d6dSopenharmony_ci<locale ja> 15312e5b6d6dSopenharmony_ci<line> 15322e5b6d6dSopenharmony_ci<data>•\u3041•\u3043•\u3045•\u31f1•</data> 15332e5b6d6dSopenharmony_ci<locale en> 15342e5b6d6dSopenharmony_ci<line> 15352e5b6d6dSopenharmony_ci<data>•\u3041\u3043\u3045\u31f1•</data> 15362e5b6d6dSopenharmony_ci 15372e5b6d6dSopenharmony_ci# The following data was originally in RBBITest::TestJapaneseWordBreak() 15382e5b6d6dSopenharmony_ci<locale ja> 15392e5b6d6dSopenharmony_ci<word> 15402e5b6d6dSopenharmony_ci<data>•\u4ECA\u65E5<400>\u306F<400>\u3044\u3044<400>\u5929\u6C17<400>\u3067\u3059<400>\u306D<400>\u3002•\u000D\u000A•</data> 15412e5b6d6dSopenharmony_ci 15422e5b6d6dSopenharmony_ci# UBreakIteratorType UBRK_WORD, Locale "ja" 15432e5b6d6dSopenharmony_ci# Don't break in runs of hiragana or runs of ideograph, where the latter includes \u3005 \u3007 \u303B (cldrbug #2009). 15442e5b6d6dSopenharmony_ci# \u79C1\u9054\u306B\u4E00\u3007\u3007\u3007\u306E\u30B3\u30F3\u30D4\u30E5\u30FC\u30BF\u304C\u3042\u308B\u3002\u5948\u3005\u306F\u30EF\u30FC\u30C9\u3067\u3042\u308B\u3002 15452e5b6d6dSopenharmony_ci# modified to work with dbbi code - should verify 15462e5b6d6dSopenharmony_ci 15472e5b6d6dSopenharmony_ci<locale ja> 15482e5b6d6dSopenharmony_ci<word> 15492e5b6d6dSopenharmony_ci<data>•私<400>達<400>に<400>一<400>〇<400>〇〇<400>の<400>コンピュータ<400>が<400>ある<400>。<0>奈々<400>は<400>ワード<400>で<400>ある<400>。•</data> 15502e5b6d6dSopenharmony_ci 15512e5b6d6dSopenharmony_ci# Test for #10176 (in ja) 15522e5b6d6dSopenharmony_ci<line> 15532e5b6d6dSopenharmony_ci<data>•abc/•s •def•</data> 15542e5b6d6dSopenharmony_ci<data>•abc/\u05D9 •def•</data> 15552e5b6d6dSopenharmony_ci<data>•\u05E7\u05D7/\u05D9 •\u05DE\u05E2\u05D9\u05DC•</data> 15562e5b6d6dSopenharmony_ci<data>•\u05D3\u05E8\u05D5\u05E9\u05D9\u05DD •\u05E9\u05D7\u05E7\u05E0\u05D9\u05DD/\u05D9\u05D5\u05EA•</data> 15572e5b6d6dSopenharmony_ci 15582e5b6d6dSopenharmony_ci 15592e5b6d6dSopenharmony_ci<locale root> 15602e5b6d6dSopenharmony_ci<word> 15612e5b6d6dSopenharmony_ci<data>•私<400>達<400>に<400>一<400>〇<400>〇〇<400>の<400>コンピュータ<400>が<400>ある<400>。<0>奈々<400>は<400>ワード<400>で<400>ある<400>。•</data> 15622e5b6d6dSopenharmony_ci# The following test is for #10300 15632e5b6d6dSopenharmony_ci<data>•例えば<400>オーストラリア<400>。•</data> 15642e5b6d6dSopenharmony_ci# The following test is for #10571 15652e5b6d6dSopenharmony_ci<data>•一部<400>の<400>地域<400>では<400>、<0>ブラジル<400>、<0>インドネシア<400>、<0>オーストリア<400>、<0>ニュージーランド<400>で<400>ある<400>。•</data> 15662e5b6d6dSopenharmony_ci 15672e5b6d6dSopenharmony_ci# UBreakIteratorType UBRK_SENTENCE, Locale "el" 15682e5b6d6dSopenharmony_ci# Add break after Greek question mark (cldrbug #2069). 15692e5b6d6dSopenharmony_ci# "\u0391\u03B2, \u03B3\u03B4; \u0395 \u03B6\u03B7\u037E \u0398 \u03B9\u03BA. " 15702e5b6d6dSopenharmony_ci# "\u039B\u03BC \u03BD\u03BE! \u039F\u03C0, \u03A1\u03C2? \u03A3" 15712e5b6d6dSopenharmony_ci# which is "Αβ, γδ; Ε ζη; Θ ικ. Λμ νξ! Οπ, Ρς? Σ" 15722e5b6d6dSopenharmony_ci 15732e5b6d6dSopenharmony_ci<locale root> 15742e5b6d6dSopenharmony_ci<sent> 15752e5b6d6dSopenharmony_ci<data>•Αβ, γδ; Ε ζη; Θ ικ. •Λμ νξ! •Οπ, Ρς? •Σ<100></data> 15762e5b6d6dSopenharmony_ci 15772e5b6d6dSopenharmony_ci<locale el> 15782e5b6d6dSopenharmony_ci<sent> 15792e5b6d6dSopenharmony_ci<data>•Αβ, γδ; •Ε ζη; •Θ ικ. •Λμ νξ! •Οπ, Ρς? •Σ<100></data> 15802e5b6d6dSopenharmony_ci 15812e5b6d6dSopenharmony_ci# UBreakIteratorType UBRK_WORD, Locale "en_US_POSIX" 15822e5b6d6dSopenharmony_ci# Words don't include colon or period (cldrbug #1969). 15832e5b6d6dSopenharmony_ci 15842e5b6d6dSopenharmony_ci<locale en_US> 15852e5b6d6dSopenharmony_ci<word> 15862e5b6d6dSopenharmony_ci<data>•Can't<200> •have<200> •breaks<200> •in<200> •xx<200>:•yy<200> •or<200> •struct.field<200> \ 15872e5b6d6dSopenharmony_ci•for<200> •CS<200>-•types<200>.•</data> 15882e5b6d6dSopenharmony_ci<data>•\uFF92\uFF76\uFF9E<400> •</data> 15892e5b6d6dSopenharmony_ci<data>•xx@yy<200>.•</data> 15902e5b6d6dSopenharmony_ci 15912e5b6d6dSopenharmony_ci<locale en_US_POSIX> 15922e5b6d6dSopenharmony_ci<word> 15932e5b6d6dSopenharmony_ci<data>•Can't<200> •have<200> •breaks<200> •in<200> •xx<200>:•yy<200> •or<200> •struct<200>.•field<200> \ 15942e5b6d6dSopenharmony_ci•for<200> •CS<200>-•types<200>.•</data> 15952e5b6d6dSopenharmony_ci<data>•\u06c9<200>\uc799\ufffa•</data> 15962e5b6d6dSopenharmony_ci<data>•\uFF92\uFF76\uFF9E<400> •</data> 15972e5b6d6dSopenharmony_ci<data>•xx@yy<200>.•</data> 15982e5b6d6dSopenharmony_ci 15992e5b6d6dSopenharmony_ci<locale fi> 16002e5b6d6dSopenharmony_ci<word> 16012e5b6d6dSopenharmony_ci<data>•Can't<200> •have<200> •breaks<200> •in<200> •xx:yy<200> •or<200> •struct.field<200> \ 16022e5b6d6dSopenharmony_ci•for<200> •CS<200>-•types<200>.•</data> 16032e5b6d6dSopenharmony_ci<data>•\uFF92\uFF76\uFF9E<400> •</data> 16042e5b6d6dSopenharmony_ci<data>•xx@yy<200>.•</data> 16052e5b6d6dSopenharmony_ci 16062e5b6d6dSopenharmony_ci<locale sv> 16072e5b6d6dSopenharmony_ci<word> 16082e5b6d6dSopenharmony_ci<data>•Can't<200> •have<200> •breaks<200> •in<200> •xx:yy<200> •or<200> •struct.field<200> \ 16092e5b6d6dSopenharmony_ci•for<200> •CS<200>-•types<200>.•</data> 16102e5b6d6dSopenharmony_ci<data>•\uFF92\uFF76\uFF9E<400> •</data> 16112e5b6d6dSopenharmony_ci<data>•xx@yy<200>.•</data> 16122e5b6d6dSopenharmony_ci 16132e5b6d6dSopenharmony_ci 16142e5b6d6dSopenharmony_ci# UBreakIteratorType UBRK_CHARACTER, Locale "th" 16152e5b6d6dSopenharmony_ci# Clusters should not include spacing Thai/Lao vowels (prefix or postfix), except for [SARA] AM (cldrbug #2161). 16162e5b6d6dSopenharmony_ci# Update: As of Unicode 6.1 root has same behavior as th for this. 16172e5b6d6dSopenharmony_ci# 16182e5b6d6dSopenharmony_ci# "\u0E01\u0E23\u0E30\u0E17\u0E48\u0E2D\u0E21\u0E23\u0E08\u0E19\u0E32 " 16192e5b6d6dSopenharmony_ci# "(\u0E2A\u0E38\u0E0A\u0E32\u0E15\u0E34-\u0E08\u0E38\u0E11\u0E32\u0E21\u0E32\u0E28) " 16202e5b6d6dSopenharmony_ci# "\u0E40\u0E14\u0E47\u0E01\u0E21\u0E35\u0E1B\u0E31\u0E0D\u0E2B\u0E32 " 16212e5b6d6dSopenharmony_ci# which is "กระท่อมรจนา (สุชาติ-จุฑามาศ) เด็กมีปัญหา " 16222e5b6d6dSopenharmony_ci 16232e5b6d6dSopenharmony_ci<locale th> 16242e5b6d6dSopenharmony_ci<char> 16252e5b6d6dSopenharmony_ci<data>•\u0E01•\u0E23•\u0E30•\u0E17\u0E48•\u0E2D•\u0E21•\u0E23•\u0E08•\u0E19•\u0E32• •\ 16262e5b6d6dSopenharmony_ci(•\u0E2A\u0E38•\u0E0A•\u0E32•\u0E15\u0E34•-•\u0E08\u0E38•\u0E11•\u0E32•\u0E21•\u0E32•\u0E28•)• •\ 16272e5b6d6dSopenharmony_ci\u0E40•\u0E14\u0E47•\u0E01•\u0E21\u0E35•\u0E1B\u0E31•\u0E0D•\u0E2B•\u0E32• •</data> 16282e5b6d6dSopenharmony_ci 16292e5b6d6dSopenharmony_ci# Finnish line breaking 16302e5b6d6dSopenharmony_ci# 16312e5b6d6dSopenharmony_ci# These rules deal with hyphens when there is a space on the leading side. 16322e5b6d6dSopenharmony_ci# When followed by a letter, there should be a break opportunity between 16332e5b6d6dSopenharmony_ci# the space and the hyphen, and not after the hyphen. 16342e5b6d6dSopenharmony_ci# See CLDR ticket 3029. 16352e5b6d6dSopenharmony_ci# See ICU ticket 8151 16362e5b6d6dSopenharmony_ci# As of ICU 63, the Finnish tailoring behavior is moved to root. 16372e5b6d6dSopenharmony_ci 16382e5b6d6dSopenharmony_ci<locale root> 16392e5b6d6dSopenharmony_ci<line> 16402e5b6d6dSopenharmony_ci<data>•abc •- •def •abc •-def •abc- •def •abc-•def•</data> # With ASCII hyphen 16412e5b6d6dSopenharmony_ci<data>•abc •‐ •def •abc •‐def •abc‐ •def •abc‐•def•</data> # With Unicode u2010 hyphen 16422e5b6d6dSopenharmony_ci 16432e5b6d6dSopenharmony_ci<locale fi> 16442e5b6d6dSopenharmony_ci<line> 16452e5b6d6dSopenharmony_ci<data>•abc •- •def •abc •-def •abc- •def •abc-•def•</data> # With ASCII hyphen 16462e5b6d6dSopenharmony_ci<data>•abc •‐ •def •abc •‐def •abc‐ •def •abc‐•def•</data> # With Unicode u2010 hyphen 16472e5b6d6dSopenharmony_ci 16482e5b6d6dSopenharmony_ci# Test for #10176 (in fi) 16492e5b6d6dSopenharmony_ci<line> 16502e5b6d6dSopenharmony_ci<data>•abc/•s •def•</data> 16512e5b6d6dSopenharmony_ci<data>•abc/\u05D9 •def•</data> 16522e5b6d6dSopenharmony_ci<data>•\u05E7\u05D7/\u05D9 •\u05DE\u05E2\u05D9\u05DC•</data> 16532e5b6d6dSopenharmony_ci<data>•\u05D3\u05E8\u05D5\u05E9\u05D9\u05DD •\u05E9\u05D7\u05E7\u05E0\u05D9\u05DD/\u05D9\u05D5\u05EA•</data> 16542e5b6d6dSopenharmony_ci 16552e5b6d6dSopenharmony_ci#################################################################################### 16562e5b6d6dSopenharmony_ci# 16572e5b6d6dSopenharmony_ci# Test CSS line break variants: strict, normal, loose 16582e5b6d6dSopenharmony_ci# 16592e5b6d6dSopenharmony_ci#################################################################################### 16602e5b6d6dSopenharmony_ci 16612e5b6d6dSopenharmony_ci<locale ja@lb=strict> 16622e5b6d6dSopenharmony_ci<line> 16632e5b6d6dSopenharmony_ci# •no brk before 3063 •no brk before 301C•no brk btw 2026 •no brk before FF01• 16642e5b6d6dSopenharmony_ci<data>•\u3084\u3063•\u3071•\u308A\u0020•\u0031\u301C\u0020•\u2026\u2026\u0020•\u30A2\uFF01\u0020•</data> 16652e5b6d6dSopenharmony_ci 16662e5b6d6dSopenharmony_ci<locale ja@lb=normal> 16672e5b6d6dSopenharmony_ci<line> 16682e5b6d6dSopenharmony_ci# •brk OK before 3063 •brk OK before 301C •no brk btw 2026 •no brk before FF01• 16692e5b6d6dSopenharmony_ci<data>•\u3084•\u3063•\u3071•\u308A\u0020•\u0031•\u301C\u0020•\u2026\u2026\u0020•\u30A2\uFF01\u0020•</data> 16702e5b6d6dSopenharmony_ci 16712e5b6d6dSopenharmony_ci# •no brk before 2010 • 16722e5b6d6dSopenharmony_ci<data>•\u3042\u2010•\u0031\u0020•\u0061\u2010•\u0031\u0020•</data> 16732e5b6d6dSopenharmony_ci 16742e5b6d6dSopenharmony_ci<locale ja@lb=loose> 16752e5b6d6dSopenharmony_ci<line> 16762e5b6d6dSopenharmony_ci# •brk OK before 3063 •brk OK before 301C •brk OK btw 2026 •brk OK before FF01• 16772e5b6d6dSopenharmony_ci<data>•\u3084•\u3063•\u3071•\u308A\u0020•\u0031•\u301C\u0020•\u2026•\u2026\u0020•u30A2•\uFF01\u0020•</data> 16782e5b6d6dSopenharmony_ci 16792e5b6d6dSopenharmony_ci# •no brk before 2010 except ok after ID • 16802e5b6d6dSopenharmony_ci<data>•\u3042•\u2010•\u0031\u0020•\u0061\u2010•\u0031\u0020•</data> 16812e5b6d6dSopenharmony_ci 16822e5b6d6dSopenharmony_ci<locale en@lb=strict> 16832e5b6d6dSopenharmony_ci<line> 16842e5b6d6dSopenharmony_ci# •no brk before 3063 •no brk before 301C•no brk btw 2026 •no brk before FF01• 16852e5b6d6dSopenharmony_ci<data>•\u3084\u3063•\u3071•\u308A\u0020•\u0031\u301C\u0020•\u2026\u2026\u0020•\u30A2\uFF01\u0020•</data> 16862e5b6d6dSopenharmony_ci 16872e5b6d6dSopenharmony_ci<locale en@lb=normal> 16882e5b6d6dSopenharmony_ci<line> 16892e5b6d6dSopenharmony_ci# •brk OK before 3063 •no brk before 301C •no brk btw 2026 •no brk before FF01• 16902e5b6d6dSopenharmony_ci<data>•\u3084•\u3063•\u3071•\u308A\u0020•\u0031\u301C\u0020•\u2026\u2026\u0020•\u30A2\uFF01\u0020•</data> 16912e5b6d6dSopenharmony_ci 16922e5b6d6dSopenharmony_ci<locale en@lb=loose> 16932e5b6d6dSopenharmony_ci<line> 16942e5b6d6dSopenharmony_ci# •brk OK before 3063 •no brk before 301C •brk OK btw 2026 •no brk before FF01• 16952e5b6d6dSopenharmony_ci<data>•\u3084•\u3063•\u3071•\u308A\u0020•\u0031\u301C\u0020•\u2026•\u2026\u0020•u30A2\uFF01\u0020•</data> 16962e5b6d6dSopenharmony_ci 16972e5b6d6dSopenharmony_ci#################################################################################### 16982e5b6d6dSopenharmony_ci# 16992e5b6d6dSopenharmony_ci# Test breaks for emoji clusters (same for all locales, mostly the same for all break types) 17002e5b6d6dSopenharmony_ci# 17012e5b6d6dSopenharmony_ci#################################################################################### 17022e5b6d6dSopenharmony_ci 17032e5b6d6dSopenharmony_ci<locale root> 17042e5b6d6dSopenharmony_ci 17052e5b6d6dSopenharmony_ci<char> 17062e5b6d6dSopenharmony_ci# woman zwj woman zwj girl zwj girl, woman/fitz-1-2 zwj woman/fitz-4 zwj boy/fitz-6 17072e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F469\u200D\U0001F467\u200D\U0001F467•\U0001F469\U0001F3FB\u200D\U0001F469\U0001F3FD\u200D\U0001F466\U0001F3FF•</data> 17082e5b6d6dSopenharmony_ci# woman zwj baby/fitz-3, older_woman/fitz-5, runner/fitz-4, raised_fist/fitz-3, fuel_pump, fitz-3 17092e5b6d6dSopenharmony_ci#TODO: <data>•\U0001F469\u200D\U0001F476\U0001F3FC•\U0001F475\U0001F3FE•\U0001F3C3\U0001F3FD•\u270A\U0001F3FC•\u26FD•\U0001F3FC•</data> 17102e5b6d6dSopenharmony_ci# man zwj hvy_blk_heart zwj man, woman, man zwj hvy_blk_heart/evar zwj man, woman 17112e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2764\u200D\U0001F468•\U0001F469•\U0001F468\u200D\u2764\uFE0F\u200D\U0001F468•\U0001F469•</data> 17122e5b6d6dSopenharmony_ci# woman zwj hvy_blk_heart/evar zwj kiss_mark zwj woman, sleuth/fitz-4, horse_racing/fitz-5 17132e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\u2764\uFE0F\u200D\U0001F48B\u200D\U0001F469•\U0001F575\U0001F3FD•\U0001F3C7\U0001F3FE•</data> 17142e5b6d6dSopenharmony_ci# victory_hand/evar, victory_hand/evar/fitz-1-2, victory_hand/fitz-1-2, rowboat/fitz-4, vulcan_salute/fitz-5, space, 17152e5b6d6dSopenharmony_ci<data>•\u270C\uFE0F•\u270C\uFE0F\U0001F3FB•\u270C\U0001F3FB•\U0001F6A3\U0001F3FD•\U0001F596\U0001F3FE•\u0020•</data> 17162e5b6d6dSopenharmony_ci# writing_hand fitz-1-2, splayed_hand/fitz-3, middle_finger/fitz-4, sign_of_horns/fitz-5, eye zwj left_speech_bubble, space 17172e5b6d6dSopenharmony_ci<data>•\u270D\U0001F3FB•\U0001F590\U0001F3FC•\U0001F595\U0001F3FD•\U0001F918\U0001F3FE•\U0001F441\u200D\U0001F5E8•\u0020•</data> 17182e5b6d6dSopenharmony_ci# family (man, boy, boy), family (woman, girl), space 17192e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\U0001F466\u200D\U0001F466•\U0001F469\u200D\U0001F467•\u0020•</data> 17202e5b6d6dSopenharmony_ci# rainbow flag using evar, rainbow flag no evar, space 17212e5b6d6dSopenharmony_ci<data>•\U0001F3F3\uFE0F\u200D\U0001F308•\U0001F3F3\u200D\U0001F308•\u0020•</data> 17222e5b6d6dSopenharmony_ci# woman with ball using evar, woman with ball no evar, woman with ball fitz 4 using evar, woman with ball fitz 4 no evar, space 17232e5b6d6dSopenharmony_ci<data>•\u26F9\uFE0F\u200D\u2640\uFE0F•\u26F9\u200D\u2640•\u26F9\U0001F3FD\u200D\u2640\uFE0F•\u26F9\U0001F3FD\u200D\u2640•\u0020•</data> 17242e5b6d6dSopenharmony_ci# woman runner using evar, woman runner no evar, woman runner fitz 4 using evar, woman runner fitz 4 no evar, space 17252e5b6d6dSopenharmony_ci<data>•\U0001F3C3\u200D\u2640\uFE0F•\U0001F3C3\u200D\u2640•\U0001F3C3\U0001F3FD\u200D\u2640\uFE0F•\U0001F3C3\U0001F3FD\u200D\u2640•\u0020•</data> 17262e5b6d6dSopenharmony_ci# 9.0 + professions 17272e5b6d6dSopenharmony_ci# black heart, fitz 4, squid, fitz4, man dancing /fitz4, mother xmas /fitz4 17282e5b6d6dSopenharmony_ci#TODO: <data>•\U0001F5A4•\U0001F3FD•\U0001F991•\U0001F3FD•\U0001F57A\U0001F3FD•\U0001F936\U0001F3FD•\u0020•</data> 17292e5b6d6dSopenharmony_ci# facepalm, facepalm / fitz4, facepalm + woman +var, facepalm + woman -var, facepalm/fitz4 + woman +var, facepalm/fitz4 + woman -var 17302e5b6d6dSopenharmony_ci<data>•\U0001F926•\U0001F926\U0001F3FD•\U0001F926\u200D\u2640\uFE0F•\U0001F926\u200D\u2640•\U0001F926\U0001F3FD\u200D\u2640\uFE0F•\U0001F926\U0001F3FD\u200D\u2640•\u0020•</data> 17312e5b6d6dSopenharmony_ci# handball, handball / fitz4, handball + man +var, handball + man -var, handball/fitz4 + man +var, handball/fitz4 + man -var 17322e5b6d6dSopenharmony_ci<data>•\U0001F93E•\U0001F93E\U0001F3FD•\U0001F93E\u200D\u2642\uFE0F•\U0001F93E\u200D\u2642•\U0001F93E\U0001F3FD\u200D\u2642\uFE0F•\U0001F93E\U0001F3FD\u200D\u2642•\u0020•</data> 17332e5b6d6dSopenharmony_ci# man health worker +var, man health worker -var, man health worker / fitz4 +var, man health worker / fitz4 -var 17342e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2695\uFE0F•\U0001F468\u200D\u2695•\U0001F468\U0001F3FD\u200D\u2695\uFE0F•\U0001F468\U0001F3FD\u200D\u2695•\u0020•</data> 17352e5b6d6dSopenharmony_ci# woman astronaut, woman astronaut / fitz4 17362e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F680•\U0001F469\U0001F3FD\u200D\U0001F680•\u0020•</data> 17372e5b6d6dSopenharmony_ci# modifier bases added in emoji 4.0 17382e5b6d6dSopenharmony_ci# SNOWBOARDER/fitz-3, HORSE RACING/fitz-4, GOLFER/fitz-5, MAN IN BUSINESS SUIT LEVITATING/fitz-6, SLEEPING ACCOMMODATION/fitz-12, space 17392e5b6d6dSopenharmony_ci<data>•\U0001F3C2\U0001F3FC•\U0001F3C7\U0001F3FD•\U0001F3CC\U0001F3FE•\U0001F574\U0001F3FF•\U0001F6CC\U0001F3FB•\u0020•</data> 17402e5b6d6dSopenharmony_ci# test kiss: man, man with variation selector 17412e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2764\uFE0F\u200D\U0001F48B\u200D\U0001F468•\u0020•</data> 17422e5b6d6dSopenharmony_ci 17432e5b6d6dSopenharmony_ci# flags1 AE AF AL AM AO AR AT 17442e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1EA•\U0001F1E6\U0001F1EB•\U0001F1E6\U0001F1F1•\U0001F1E6\U0001F1F2•\U0001F1E6\U0001F1F4•\U0001F1E6\U0001F1F7•\U0001F1E6\U0001F1F9•</data> 17452e5b6d6dSopenharmony_ci# flags2 AU AZ BA BD BE BF BG 17462e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1FA•\U0001F1E6\U0001F1FF•\U0001F1E7\U0001F1E6•\U0001F1E7\U0001F1E9•\U0001F1E7\U0001F1EA•\U0001F1E7\U0001F1EB•\U0001F1E7\U0001F1EC•</data> 17472e5b6d6dSopenharmony_ci# flags3 BH BJ BN BO BR BS BT 17482e5b6d6dSopenharmony_ci<data>•\U0001F1E7\U0001F1ED•\U0001F1E7\U0001F1EF•\U0001F1E7\U0001F1F3•\U0001F1E7\U0001F1F4•\U0001F1E7\U0001F1F7•\U0001F1E7\U0001F1F8•\U0001F1E7\U0001F1F9•</data> 17492e5b6d6dSopenharmony_ci# flags4 BW BY BZ CA CD CF CG 17502e5b6d6dSopenharmony_ci<data>•\U0001F1E7\U0001F1FC•\U0001F1E7\U0001F1FE•\U0001F1E7\U0001F1FF•\U0001F1E8\U0001F1E6•\U0001F1E8\U0001F1E9•\U0001F1E8\U0001F1EB•\U0001F1E8\U0001F1EC•</data> 17512e5b6d6dSopenharmony_ci# flags5 CH CI CL CM CN CO CR 17522e5b6d6dSopenharmony_ci<data>•\U0001F1E8\U0001F1ED•\U0001F1E8\U0001F1EE•\U0001F1E8\U0001F1F1•\U0001F1E8\U0001F1F2•\U0001F1E8\U0001F1F3•\U0001F1E8\U0001F1F4•\U0001F1E8\U0001F1F7•</data> 17532e5b6d6dSopenharmony_ci# flags6 CU CV CY CZ DE DJ DK 17542e5b6d6dSopenharmony_ci<data>•\U0001F1E8\U0001F1FA•\U0001F1E8\U0001F1FB•\U0001F1E8\U0001F1FE•\U0001F1E8\U0001F1FF•\U0001F1E9\U0001F1EA•\U0001F1E9\U0001F1EF•\U0001F1E9\U0001F1F0•</data> 17552e5b6d6dSopenharmony_ci# flags7 DM DO DZ EC EE EG ER 17562e5b6d6dSopenharmony_ci<data>•\U0001F1E9\U0001F1F2•\U0001F1E9\U0001F1F4•\U0001F1E9\U0001F1FF•\U0001F1EA\U0001F1E8•\U0001F1EA\U0001F1EA•\U0001F1EA\U0001F1EC•\U0001F1EA\U0001F1F7•</data> 17572e5b6d6dSopenharmony_ci# flags8 ES ET FI FJ FR GA GB 17582e5b6d6dSopenharmony_ci<data>•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F9•\U0001F1EB\U0001F1EE•\U0001F1EB\U0001F1EF•\U0001F1EB\U0001F1F7•\U0001F1EC\U0001F1E6•\U0001F1EC\U0001F1E7•</data> 17592e5b6d6dSopenharmony_ci# flags9 GE GH GM GN GR GT GW 17602e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1EA•\U0001F1EC\U0001F1ED•\U0001F1EC\U0001F1F2•\U0001F1EC\U0001F1F3•\U0001F1EC\U0001F1F7•\U0001F1EC\U0001F1F9•\U0001F1EC\U0001F1FC•</data> 17612e5b6d6dSopenharmony_ci# flags10 GY HK HN HR HT HU ID 17622e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1FE•\U0001F1ED\U0001F1F0•\U0001F1ED\U0001F1F3•\U0001F1ED\U0001F1F7•\U0001F1ED\U0001F1F9•\U0001F1ED\U0001F1FA•\U0001F1EE\U0001F1E9•</data> 17632e5b6d6dSopenharmony_ci# flags11 IE IL IN IQ IR IS IT 17642e5b6d6dSopenharmony_ci<data>•\U0001F1EE\U0001F1EA•\U0001F1EE\U0001F1F1•\U0001F1EE\U0001F1F3•\U0001F1EE\U0001F1F6•\U0001F1EE\U0001F1F7•\U0001F1EE\U0001F1F8•\U0001F1EE\U0001F1F9•</data> 17652e5b6d6dSopenharmony_ci# flags12 JM JO JP KE KG KH KR 17662e5b6d6dSopenharmony_ci<data>•\U0001F1EF\U0001F1F2•\U0001F1EF\U0001F1F4•\U0001F1EF\U0001F1F5•\U0001F1F0\U0001F1EA•\U0001F1F0\U0001F1EC•\U0001F1F0\U0001F1ED•\U0001F1F0\U0001F1F7•</data> 17672e5b6d6dSopenharmony_ci# flags13 MX MY NL NO PL PT 17682e5b6d6dSopenharmony_ci<data>•\U0001F1F2\U0001F1FD•\U0001F1F2\U0001F1FE•\U0001F1F3\U0001F1F1•\U0001F1F3\U0001F1F4•\U0001F1F5\U0001F1F1•\U0001F1F5\U0001F1F9•</data> 17692e5b6d6dSopenharmony_ci# flags14 RO RU SA SE SK TH TR 17702e5b6d6dSopenharmony_ci<data>•\U0001F1F7\U0001F1F4•\U0001F1F7\U0001F1FA•\U0001F1F8\U0001F1E6•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1F0•\U0001F1F9\U0001F1ED•\U0001F1F9\U0001F1F7•</data> 17712e5b6d6dSopenharmony_ci# flags15 UA US VN XK ZW 17722e5b6d6dSopenharmony_ci<data>•\U0001F1FA\U0001F1E6•\U0001F1FA\U0001F1F8•\U0001F1FB\U0001F1F3•\U0001F1FD\U0001F1F0•\U0001F1FF\U0001F1FC•</data> 17732e5b6d6dSopenharmony_ci# flagsX1 ES ES ES SE SE SE 17742e5b6d6dSopenharmony_ci<data>•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F8•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1EA•</data> 17752e5b6d6dSopenharmony_ci# flagsX2 GB GB GB BG BG BG 17762e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1E7•\U0001F1EC\U0001F1E7•\U0001F1EC\U0001F1E7•\U0001F1E7\U0001F1EC•\U0001F1E7\U0001F1EC•\U0001F1E7\U0001F1EC•</data> 17772e5b6d6dSopenharmony_ci# flagsXtnd AE AF AL AM AO AR 17782e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1EA\u200C•\U0001F1E6\U0001F1EB\u200C•\U0001F1E6\U0001F1F1\u200C•\U0001F1E6\U0001F1F2\u0300•\U0001F1E6\U0001F1F4\u20DE•\U0001F1E6\U0001F1F7\u200C•</data> 17792e5b6d6dSopenharmony_ci 17802e5b6d6dSopenharmony_ci<word> 17812e5b6d6dSopenharmony_ci# woman zwj woman zwj girl zwj girl, woman/fitz-1-2 zwj woman/fitz-4 zwj boy/fitz-6 17822e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F469\u200D\U0001F467\u200D\U0001F467•\U0001F469\U0001F3FB\u200D\U0001F469\U0001F3FD\u200D\U0001F466\U0001F3FF•</data> 17832e5b6d6dSopenharmony_ci# woman zwj baby/fitz-3, older_woman/fitz-5, runner/fitz-4, raised_fist/fitz-3, fuel_pump, fitz-3 17842e5b6d6dSopenharmony_ci#TODO: <data>•\U0001F469\u200D\U0001F476\U0001F3FC•\U0001F475\U0001F3FE•\U0001F3C3\U0001F3FD•\u270A\U0001F3FC•\u26FD•\U0001F3FC•</data> 17852e5b6d6dSopenharmony_ci# man zwj hvy_blk_heart zwj man, woman, man zwj hvy_blk_heart/evar zwj man, woman 17862e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2764\u200D\U0001F468•\U0001F469•\U0001F468\u200D\u2764\uFE0F\u200D\U0001F468•\U0001F469•</data> 17872e5b6d6dSopenharmony_ci# woman zwj hvy_blk_heart/evar zwj kiss_mark zwj woman, sleuth/fitz-4, horse_racing/fitz-5 17882e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\u2764\uFE0F\u200D\U0001F48B\u200D\U0001F469•\U0001F575\U0001F3FD•\U0001F3C7\U0001F3FE•</data> 17892e5b6d6dSopenharmony_ci# victory_hand/evar, victory_hand/evar/fitz-1-2, victory_hand/fitz-1-2, rowboat/fitz-4, vulcan_salute/fitz-5, space, 17902e5b6d6dSopenharmony_ci<data>•\u270C\uFE0F•\u270C\uFE0F\U0001F3FB•\u270C\U0001F3FB•\U0001F6A3\U0001F3FD•\U0001F596\U0001F3FE•\u0020•</data> 17912e5b6d6dSopenharmony_ci# writing_hand fitz-1-2, splayed_hand/fitz-3, middle_finger/fitz-4, sign_of_horns/fitz-5, eye zwj left_speech_bubble, space 17922e5b6d6dSopenharmony_ci<data>•\u270D\U0001F3FB•\U0001F590\U0001F3FC•\U0001F595\U0001F3FD•\U0001F918\U0001F3FE•\U0001F441\u200D\U0001F5E8•\u0020•</data> 17932e5b6d6dSopenharmony_ci# family (man, boy, boy), family (woman, girl), space 17942e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\U0001F466\u200D\U0001F466•\U0001F469\u200D\U0001F467•\u0020•</data> 17952e5b6d6dSopenharmony_ci# rainbow flag using evar, rainbow flag no evar, space 17962e5b6d6dSopenharmony_ci<data>•\U0001F3F3\uFE0F\u200D\U0001F308•\U0001F3F3\u200D\U0001F308•\u0020•</data> 17972e5b6d6dSopenharmony_ci# woman with ball using evar, woman with ball no evar, woman with ball fitz 4 using evar, woman with ball fitz 4 no evar, space 17982e5b6d6dSopenharmony_ci<data>•\u26F9\uFE0F\u200D\u2640\uFE0F•\u26F9\u200D\u2640•\u26F9\U0001F3FD\u200D\u2640\uFE0F•\u26F9\U0001F3FD\u200D\u2640•\u0020•</data> 17992e5b6d6dSopenharmony_ci# woman runner using evar, woman runner no evar, woman runner fitz 4 using evar, woman runner fitz 4 no evar, space 18002e5b6d6dSopenharmony_ci<data>•\U0001F3C3\u200D\u2640\uFE0F•\U0001F3C3\u200D\u2640•\U0001F3C3\U0001F3FD\u200D\u2640\uFE0F•\U0001F3C3\U0001F3FD\u200D\u2640•\u0020•</data> 18012e5b6d6dSopenharmony_ci# 9.0 + professions 18022e5b6d6dSopenharmony_ci# black heart, fitz 4, squid, fitz4, man dancing /fitz4, mother xmas /fitz4 18032e5b6d6dSopenharmony_ci#TODO: <data>•\U0001F5A4•\U0001F3FD•\U0001F991•\U0001F3FD•\U0001F57A\U0001F3FD•\U0001F936\U0001F3FD•\u0020•</data> 18042e5b6d6dSopenharmony_ci# facepalm, facepalm / fitz4, facepalm + woman +evar, facepalm + woman -evar, facepalm/fitz4 + woman +evar, facepalm/fitz4 + woman -evar 18052e5b6d6dSopenharmony_ci<data>•\U0001F926•\U0001F926\U0001F3FD•\U0001F926\u200D\u2640\uFE0F•\U0001F926\u200D\u2640•\U0001F926\U0001F3FD\u200D\u2640\uFE0F•\U0001F926\U0001F3FD\u200D\u2640•\u0020•</data> 18062e5b6d6dSopenharmony_ci# handball, handball / fitz4, handball + man +evar, handball + man -evar, handball/fitz4 + man +evar, handball/fitz4 + man -evar 18072e5b6d6dSopenharmony_ci<data>•\U0001F93E•\U0001F93E\U0001F3FD•\U0001F93E\u200D\u2642\uFE0F•\U0001F93E\u200D\u2642•\U0001F93E\U0001F3FD\u200D\u2642\uFE0F•\U0001F93E\U0001F3FD\u200D\u2642•\u0020•</data> 18082e5b6d6dSopenharmony_ci# man health worker +evar, man health worker -evar, man health worker / fitz4 +evar, man health worker / fitz4 -evar 18092e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2695\uFE0F•\U0001F468\u200D\u2695•\U0001F468\U0001F3FD\u200D\u2695\uFE0F•\U0001F468\U0001F3FD\u200D\u2695•\u0020•</data> 18102e5b6d6dSopenharmony_ci# woman astronaut, woman astronaut / fitz4 18112e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F680•\U0001F469\U0001F3FD\u200D\U0001F680•\u0020•</data> 18122e5b6d6dSopenharmony_ci# modifier bases added in emoji 4.0 18132e5b6d6dSopenharmony_ci# SNOWBOARDER/fitz-3, HORSE RACING/fitz-4, GOLFER/fitz-5, MAN IN BUSINESS SUIT LEVITATING/fitz-6, SLEEPING ACCOMMODATION/fitz-12, space 18142e5b6d6dSopenharmony_ci<data>•\U0001F3C2\U0001F3FC•\U0001F3C7\U0001F3FD•\U0001F3CC\U0001F3FE•\U0001F574\U0001F3FF•\U0001F6CC\U0001F3FB•\u0020•</data> 18152e5b6d6dSopenharmony_ci 18162e5b6d6dSopenharmony_ci# flags1 AE AF AL AM AO AR AT 18172e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1EA•\U0001F1E6\U0001F1EB•\U0001F1E6\U0001F1F1•\U0001F1E6\U0001F1F2•\U0001F1E6\U0001F1F4•\U0001F1E6\U0001F1F7•\U0001F1E6\U0001F1F9•</data> 18182e5b6d6dSopenharmony_ci# flags2 AU AZ BA BD BE BF BG 18192e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1FA•\U0001F1E6\U0001F1FF•\U0001F1E7\U0001F1E6•\U0001F1E7\U0001F1E9•\U0001F1E7\U0001F1EA•\U0001F1E7\U0001F1EB•\U0001F1E7\U0001F1EC•</data> 18202e5b6d6dSopenharmony_ci# flags3 BH BJ BN BO BR BS BT 18212e5b6d6dSopenharmony_ci<data>•\U0001F1E7\U0001F1ED•\U0001F1E7\U0001F1EF•\U0001F1E7\U0001F1F3•\U0001F1E7\U0001F1F4•\U0001F1E7\U0001F1F7•\U0001F1E7\U0001F1F8•\U0001F1E7\U0001F1F9•</data> 18222e5b6d6dSopenharmony_ci# flags4 BW BY BZ CA CD CF CG 18232e5b6d6dSopenharmony_ci<data>•\U0001F1E7\U0001F1FC•\U0001F1E7\U0001F1FE•\U0001F1E7\U0001F1FF•\U0001F1E8\U0001F1E6•\U0001F1E8\U0001F1E9•\U0001F1E8\U0001F1EB•\U0001F1E8\U0001F1EC•</data> 18242e5b6d6dSopenharmony_ci# flags5 CH CI CL CM CN CO CR 18252e5b6d6dSopenharmony_ci<data>•\U0001F1E8\U0001F1ED•\U0001F1E8\U0001F1EE•\U0001F1E8\U0001F1F1•\U0001F1E8\U0001F1F2•\U0001F1E8\U0001F1F3•\U0001F1E8\U0001F1F4•\U0001F1E8\U0001F1F7•</data> 18262e5b6d6dSopenharmony_ci# flags6 CU CV CY CZ DE DJ DK 18272e5b6d6dSopenharmony_ci<data>•\U0001F1E8\U0001F1FA•\U0001F1E8\U0001F1FB•\U0001F1E8\U0001F1FE•\U0001F1E8\U0001F1FF•\U0001F1E9\U0001F1EA•\U0001F1E9\U0001F1EF•\U0001F1E9\U0001F1F0•</data> 18282e5b6d6dSopenharmony_ci# flags7 DM DO DZ EC EE EG ER 18292e5b6d6dSopenharmony_ci<data>•\U0001F1E9\U0001F1F2•\U0001F1E9\U0001F1F4•\U0001F1E9\U0001F1FF•\U0001F1EA\U0001F1E8•\U0001F1EA\U0001F1EA•\U0001F1EA\U0001F1EC•\U0001F1EA\U0001F1F7•</data> 18302e5b6d6dSopenharmony_ci# flags8 ES ET FI FJ FR GA GB 18312e5b6d6dSopenharmony_ci<data>•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F9•\U0001F1EB\U0001F1EE•\U0001F1EB\U0001F1EF•\U0001F1EB\U0001F1F7•\U0001F1EC\U0001F1E6•\U0001F1EC\U0001F1E7•</data> 18322e5b6d6dSopenharmony_ci# flags9 GE GH GM GN GR GT GW 18332e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1EA•\U0001F1EC\U0001F1ED•\U0001F1EC\U0001F1F2•\U0001F1EC\U0001F1F3•\U0001F1EC\U0001F1F7•\U0001F1EC\U0001F1F9•\U0001F1EC\U0001F1FC•</data> 18342e5b6d6dSopenharmony_ci# flags10 GY HK HN HR HT HU ID 18352e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1FE•\U0001F1ED\U0001F1F0•\U0001F1ED\U0001F1F3•\U0001F1ED\U0001F1F7•\U0001F1ED\U0001F1F9•\U0001F1ED\U0001F1FA•\U0001F1EE\U0001F1E9•</data> 18362e5b6d6dSopenharmony_ci# flags11 IE IL IN IQ IR IS IT 18372e5b6d6dSopenharmony_ci<data>•\U0001F1EE\U0001F1EA•\U0001F1EE\U0001F1F1•\U0001F1EE\U0001F1F3•\U0001F1EE\U0001F1F6•\U0001F1EE\U0001F1F7•\U0001F1EE\U0001F1F8•\U0001F1EE\U0001F1F9•</data> 18382e5b6d6dSopenharmony_ci# flags12 JM JO JP KE KG KH KR 18392e5b6d6dSopenharmony_ci<data>•\U0001F1EF\U0001F1F2•\U0001F1EF\U0001F1F4•\U0001F1EF\U0001F1F5•\U0001F1F0\U0001F1EA•\U0001F1F0\U0001F1EC•\U0001F1F0\U0001F1ED•\U0001F1F0\U0001F1F7•</data> 18402e5b6d6dSopenharmony_ci# flags13 MX MY NL NO PL PT 18412e5b6d6dSopenharmony_ci<data>•\U0001F1F2\U0001F1FD•\U0001F1F2\U0001F1FE•\U0001F1F3\U0001F1F1•\U0001F1F3\U0001F1F4•\U0001F1F5\U0001F1F1•\U0001F1F5\U0001F1F9•</data> 18422e5b6d6dSopenharmony_ci# flags14 RO RU SA SE SK TH TR 18432e5b6d6dSopenharmony_ci<data>•\U0001F1F7\U0001F1F4•\U0001F1F7\U0001F1FA•\U0001F1F8\U0001F1E6•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1F0•\U0001F1F9\U0001F1ED•\U0001F1F9\U0001F1F7•</data> 18442e5b6d6dSopenharmony_ci# flags15 UA US VN XK ZW 18452e5b6d6dSopenharmony_ci<data>•\U0001F1FA\U0001F1E6•\U0001F1FA\U0001F1F8•\U0001F1FB\U0001F1F3•\U0001F1FD\U0001F1F0•\U0001F1FF\U0001F1FC•</data> 18462e5b6d6dSopenharmony_ci# flagsX1 ES ES ES SE SE SE 18472e5b6d6dSopenharmony_ci<data>•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F8•\U0001F1EA\U0001F1F8•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1EA•\U0001F1F8\U0001F1EA•</data> 18482e5b6d6dSopenharmony_ci# flagsX2 GB GB GB BG BG BG 18492e5b6d6dSopenharmony_ci<data>•\U0001F1EC\U0001F1E7•\U0001F1EC\U0001F1E7•\U0001F1EC\U0001F1E7•\U0001F1E7\U0001F1EC•\U0001F1E7\U0001F1EC•\U0001F1E7\U0001F1EC•</data> 18502e5b6d6dSopenharmony_ci# flagsXtnd AE AF AL AM AO AR 18512e5b6d6dSopenharmony_ci<data>•\U0001F1E6\U0001F1EA\u200C•\U0001F1E6\U0001F1EB\u200C•\U0001F1E6\U0001F1F1\u200C•\U0001F1E6\U0001F1F2\u0300•\U0001F1E6\U0001F1F4\u20DE•\U0001F1E6\U0001F1F7\u200C•</data> 18522e5b6d6dSopenharmony_ci 18532e5b6d6dSopenharmony_ci<line> 18542e5b6d6dSopenharmony_ci# woman zwj woman zwj girl zwj girl 18552e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F469\u200D\U0001F467\u200D\U0001F467•</data> 18562e5b6d6dSopenharmony_ci# woman zwj baby/fitz-3, older_woman/fitz-5, runner/fitz-4, raised_fist/fitz-3, fuel_pump, fitz-3 18572e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F476\U0001F3FC•\U0001F475\U0001F3FE•\U0001F3C3\U0001F3FD•\u270A\U0001F3FC•\u26FD•\U0001F3FC•</data> 18582e5b6d6dSopenharmony_ci# man zwj hvy_blk_heart zwj man, woman, man zwj hvy_blk_heart/evar zwj man, woman 18592e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2764\u200D\U0001F468•\U0001F469•\U0001F468\u200D\u2764\uFE0F\u200D\U0001F468•\U0001F469•</data> 18602e5b6d6dSopenharmony_ci# woman zwj hvy_blk_heart/evar zwj kiss_mark zwj woman, sleuth/fitz-4, horse_racing/fitz-5 18612e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\u2764\uFE0F\u200D\U0001F48B\u200D\U0001F469•\U0001F575\U0001F3FD•\U0001F3C7\U0001F3FE•</data> 18622e5b6d6dSopenharmony_ci# victory_hand/evar, victory_hand/evar/fitz-1-2, victory_hand/fitz-1-2, rowboat/fitz-4, vulcan_salute/fitz-5 space, 18632e5b6d6dSopenharmony_ci<data>•\u270C\uFE0F•\u270C\uFE0F\U0001F3FB•\u270C\U0001F3FB•\U0001F6A3\U0001F3FD•\U0001F596\U0001F3FE\u0020•</data> 18642e5b6d6dSopenharmony_ci# writing_hand fitz-1-2, splayed_hand/fitz-3, middle_finger/fitz-4, sign_of_horns/fitz-5, eye zwj left_speech_bubble, space 18652e5b6d6dSopenharmony_ci<data>•\u270D\U0001F3FB•\U0001F590\U0001F3FC•\U0001F595\U0001F3FD•\U0001F918\U0001F3FE•\U0001F441\u200D\U0001F5E8\u0020•</data> 18662e5b6d6dSopenharmony_ci# family (man, boy, boy), family (woman, girl), space 18672e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\U0001F466\u200D\U0001F466•\U0001F469\u200D\U0001F467\u0020•</data> 18682e5b6d6dSopenharmony_ci# rainbow flag using evar, rainbow flag no evar, space 18692e5b6d6dSopenharmony_ci<data>•\U0001F3F3\uFE0F\u200D\U0001F308•\U0001F3F3\u200D\U0001F308\u0020•</data> 18702e5b6d6dSopenharmony_ci# woman with ball using evar, woman with ball no evar, woman with ball fitz 4 using evar, woman with ball fitz 4 no evar, space 18712e5b6d6dSopenharmony_ci<data>•\u26F9\uFE0F\u200D\u2640\uFE0F•\u26F9\u200D\u2640•\u26F9\U0001F3FD\u200D\u2640\uFE0F•\u26F9\U0001F3FD\u200D\u2640\u0020•</data> 18722e5b6d6dSopenharmony_ci# woman runner using evar, woman runner no evar, woman runner fitz 4 using evar, woman runner fitz 4 no evar, space 18732e5b6d6dSopenharmony_ci<data>•\U0001F3C3\u200D\u2640\uFE0F•\U0001F3C3\u200D\u2640•\U0001F3C3\U0001F3FD\u200D\u2640\uFE0F•\U0001F3C3\U0001F3FD\u200D\u2640\u0020•</data> 18742e5b6d6dSopenharmony_ci# 9.0 + professions 18752e5b6d6dSopenharmony_ci# black heart, fitz 4, squid, fitz4, man dancing /fitz4, mother xmas /fitz4 18762e5b6d6dSopenharmony_ci<data>•\U0001F5A4•\U0001F3FD•\U0001F991•\U0001F3FD•\U0001F57A\U0001F3FD•\U0001F936\U0001F3FD\u0020•</data> 18772e5b6d6dSopenharmony_ci# facepalm, facepalm / fitz4, facepalm + woman +evar, facepalm + woman -evar, facepalm/fitz4 + woman +evar, facepalm/fitz4 + woman -evar 18782e5b6d6dSopenharmony_ci<data>•\U0001F926•\U0001F926\U0001F3FD•\U0001F926\u200D\u2640\uFE0F•\U0001F926\u200D\u2640•\U0001F926\U0001F3FD\u200D\u2640\uFE0F•\U0001F926\U0001F3FD\u200D\u2640\u0020•</data> 18792e5b6d6dSopenharmony_ci# handball, handball / fitz4, handball + man +evar, handball + man -evar, handball/fitz4 + man +evar, handball/fitz4 + man -evar 18802e5b6d6dSopenharmony_ci<data>•\U0001F93E•\U0001F93E\U0001F3FD•\U0001F93E\u200D\u2642\uFE0F•\U0001F93E\u200D\u2642•\U0001F93E\U0001F3FD\u200D\u2642\uFE0F•\U0001F93E\U0001F3FD\u200D\u2642\u0020•</data> 18812e5b6d6dSopenharmony_ci# man health worker +evar, man health worker -evar, man health worker / fitz4 +evar, man health worker / fitz4 -evar 18822e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2695\uFE0F•\U0001F468\u200D\u2695•\U0001F468\U0001F3FD\u200D\u2695\uFE0F•\U0001F468\U0001F3FD\u200D\u2695\u0020•</data> 18832e5b6d6dSopenharmony_ci# woman astronaut, woman astronaut / fitz4 18842e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F680•\U0001F469\U0001F3FD\u200D\U0001F680\u0020•</data> 18852e5b6d6dSopenharmony_ci# modifier bases added in emoji 4.0 18862e5b6d6dSopenharmony_ci# SNOWBOARDER/fitz-3, HORSE RACING/fitz-4, GOLFER/fitz-5, MAN IN BUSINESS SUIT LEVITATING/fitz-6, SLEEPING ACCOMMODATION/fitz-12 space 18872e5b6d6dSopenharmony_ci<data>•\U0001F3C2\U0001F3FC•\U0001F3C7\U0001F3FD•\U0001F3CC\U0001F3FE•\U0001F574\U0001F3FF•\U0001F6CC\U0001F3FB\u0020•</data> 18882e5b6d6dSopenharmony_ci 18892e5b6d6dSopenharmony_ci<locale ja@lb=loose> 18902e5b6d6dSopenharmony_ci<line> 18912e5b6d6dSopenharmony_ci# woman zwj woman zwj girl zwj girl 18922e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F469\u200D\U0001F467\u200D\U0001F467•</data> 18932e5b6d6dSopenharmony_ci# woman zwj baby/fitz-3, older_woman/fitz-5, runner/fitz-4, raised_fist/fitz-3, fuel_pump, fitz-3 18942e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F476\U0001F3FC•\U0001F475\U0001F3FE•\U0001F3C3\U0001F3FD•\u270A\U0001F3FC•\u26FD•\U0001F3FC•</data> 18952e5b6d6dSopenharmony_ci# man zwj hvy_blk_heart zwj man, woman, man zwj hvy_blk_heart/evar zwj man, woman 18962e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2764\u200D\U0001F468•\U0001F469•\U0001F468\u200D\u2764\uFE0F\u200D\U0001F468•\U0001F469•</data> 18972e5b6d6dSopenharmony_ci# woman zwj hvy_blk_heart/evar zwj kiss_mark zwj woman, sleuth/fitz-4, horseRacing/fitz-5 18982e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\u2764\uFE0F\u200D\U0001F48B\u200D\U0001F469•\U0001F575\U0001F3FD•\U0001F3C7\U0001F3FE•</data> 18992e5b6d6dSopenharmony_ci# victory_hand/evar, victory_hand/evar/fitz-1-2, victory_hand/fitz-1-2, rowboat/fitz-4, vulcan_salute/fitz-5 space, 19002e5b6d6dSopenharmony_ci<data>•\u270C\uFE0F•\u270C\uFE0F\U0001F3FB•\u270C\U0001F3FB•\U0001F6A3\U0001F3FD•\U0001F596\U0001F3FE\u0020•</data> 19012e5b6d6dSopenharmony_ci# writing_hand fitz-1-2, splayed_hand/fitz-3, middle_finger/fitz-4, sign_of_horns/fitz-5, eye zwj left_speech_bubble, space 19022e5b6d6dSopenharmony_ci<data>•\u270D\U0001F3FB•\U0001F590\U0001F3FC•\U0001F595\U0001F3FD•\U0001F918\U0001F3FE•\U0001F441\u200D\U0001F5E8\u0020•</data> 19032e5b6d6dSopenharmony_ci# 9.0 + professions 19042e5b6d6dSopenharmony_ci# black heart, fitz 4, squid, fitz4, man dancing /fitz4, mother xmas /fitz4 19052e5b6d6dSopenharmony_ci<data>•\U0001F5A4•\U0001F3FD•\U0001F991•\U0001F3FD•\U0001F57A\U0001F3FD•\U0001F936\U0001F3FD\u0020•</data> 19062e5b6d6dSopenharmony_ci# facepalm, facepalm / fitz4, facepalm + woman +evar, facepalm + woman -evar, facepalm/fitz4 + woman +evar, facepalm/fitz4 + woman -evar 19072e5b6d6dSopenharmony_ci<data>•\U0001F926•\U0001F926\U0001F3FD•\U0001F926\u200D\u2640\uFE0F•\U0001F926\u200D\u2640•\U0001F926\U0001F3FD\u200D\u2640\uFE0F•\U0001F926\U0001F3FD\u200D\u2640\u0020•</data> 19082e5b6d6dSopenharmony_ci# handball, handball / fitz4, handball + man +evar, handball + man -evar, handball/fitz4 + man +evar, handball/fitz4 + man -evar 19092e5b6d6dSopenharmony_ci<data>•\U0001F93E•\U0001F93E\U0001F3FD•\U0001F93E\u200D\u2642\uFE0F•\U0001F93E\u200D\u2642•\U0001F93E\U0001F3FD\u200D\u2642\uFE0F•\U0001F93E\U0001F3FD\u200D\u2642\u0020•</data> 19102e5b6d6dSopenharmony_ci# man health worker +evar, man health worker -evar, man health worker / fitz4 +evar, man health worker / fitz4 -evar 19112e5b6d6dSopenharmony_ci<data>•\U0001F468\u200D\u2695\uFE0F•\U0001F468\u200D\u2695•\U0001F468\U0001F3FD\u200D\u2695\uFE0F•\U0001F468\U0001F3FD\u200D\u2695\u0020•</data> 19122e5b6d6dSopenharmony_ci# woman astronaut, woman astronaut / fitz4 19132e5b6d6dSopenharmony_ci<data>•\U0001F469\u200D\U0001F680•\U0001F469\U0001F3FD\u200D\U0001F680\u0020•</data> 19142e5b6d6dSopenharmony_ci 19152e5b6d6dSopenharmony_ci<locale ja@lw=phrase> 19162e5b6d6dSopenharmony_ci<line> 19172e5b6d6dSopenharmony_ci#[京都観光]時雨殿に行った。-> [京都•観光]•時雨•殿に•行った。• 19182e5b6d6dSopenharmony_ci<data>•\uff3b\u4eac\u90fd•\u89b3\u5149\uff3d•\u6642\u96e8•\u6bbf\u306b•\u884c\u3063\u305f\u3002•</data> 19192e5b6d6dSopenharmony_ci#9月に東京から友達が遊びに来た -> 9月に•東京から•友達が•遊びに•来た• 19202e5b6d6dSopenharmony_ci<data>•\uff19\u6708\u306b•\u6771\u4eac\u304b\u3089•\u53cb\u9054\u304c•\u904a\u3073\u306b•\u6765\u305f•</data> 19212e5b6d6dSopenharmony_ci#る文字「そうだ、京都」-> る•文字•「そうだ、•京都」• 19222e5b6d6dSopenharmony_ci<data>•\u308b•\u6587\u5b57•\u300c\u305d\u3046\u3060\u3001•\u4eac\u90fd\u300d•</data> 19232e5b6d6dSopenharmony_ci#乗車率90%程度だろうか。 -> 乗車•率•90%•程度だろうか。• 19242e5b6d6dSopenharmony_ci<data>•\u4e57\u8eca•\u7387•\uff19\uff10\uff05•\u7a0b\u5ea6\u3060\u308d\u3046\u304b\u3002•</data> 19252e5b6d6dSopenharmony_ci#[携帯電話]正しい選択 -> [携帯•電話]•正しい•選択• 19262e5b6d6dSopenharmony_ci<data>•\uff3b\u643a\u5e2f•\u96fb\u8a71\uff3d•\u6b63\u3057\u3044•\u9078\u629e•</data> 19272e5b6d6dSopenharmony_ci#純金製百人一首にサッカーボール -> 純金•製•百人一首に•サッカーボール 19282e5b6d6dSopenharmony_ci<data>•\u7D14\u91D1•\u88FD•\u767E\u4EBA\u4E00\u9996\u306B•\u30B5\u30C3\u30AB\u30FC\u30DC\u30FC\u30EB•</data> 19292e5b6d6dSopenharmony_ci#Kana supplement: (U+1B048) -> \uD82C\uDC48, (U+1B038) -> \uD82C\uDC38, (U+1B019)-> \uD82C\uDC19</data> 19302e5b6d6dSopenharmony_ci#る(しるこ)、あよろし(あかよろし) -> る•(しるこ)、•あよろし•(あ•かよろし) 19312e5b6d6dSopenharmony_ci<data>•\uD82C\uDC48\u308B\uD82C\uDC38•\uFF08\u3057\u308B\u3053\uFF09\u3001•\u3042\uD82C\uDC19\u3088\u308D\u3057•\uFF08\u3042•\u304B\u3088\u308D\u3057\uFF09•</data> 19322e5b6d6dSopenharmony_ci#最初に目に入るのは、「許諾なき写真禁止」のサインである。 -> 最初に▁目に▁入るのは、▁「許諾なき▁写真▁禁止」▁の▁サインで▁ある。 19332e5b6d6dSopenharmony_ci<data>•\u6700\u521D\u306B•\u76EE\u306B•\u5165\u308B\u306E\u306F\u3001•\u300C\u8A31\u8AFE\u306A\u304D•\u5199\u771F•\u7981\u6B62\u300D•\u306E•\u30B5\u30A4\u30F3\u3067•\u3042\u308B\u3002•</data> 19342e5b6d6dSopenharmony_ci#docomoのサイト情報によると、78000パケット以上▁使うならパケ放題がいいとか -> docomoの▁サイト▁情報によると、▁78000パケット▁以上▁使うなら▁パケ▁放題が▁いい▁とか 19352e5b6d6dSopenharmony_ci<data>•\uFF44\uFF4F\uFF43\uFF4F\uFF4D\uFF4F\u306E•\u30B5\u30A4\u30C8•\u60C5\u5831\u306B\u3088\u308B\u3068\u3001•\uFF17\uFF18\uFF10\uFF10\uFF10\u30D1\u30B1\u30C3\u30C8•\u4EE5\u4E0A•\u4F7F\u3046\u306A\u3089•\u30D1\u30B1•\u653E\u984C\u304C•\u3044\u3044•\u3068\u304B•</data> 19362e5b6d6dSopenharmony_ci#日本の携帯はCDMAの形だといわれています -> 日本の▁携帯は▁CDMAの▁形だと▁いわれ▁ています 19372e5b6d6dSopenharmony_ci<data>•\u65E5\u672C\u306E•\u643A\u5E2F\u306F•\uFF23\uFF24\uFF2D\uFF21\u306E•\u5F62\u3060\u3068•\u3044\u308F\u308C•\u3066\u3044\u307E\u3059•</data> 19382e5b6d6dSopenharmony_ci#中国の携帯は約500元から5000元です -> 中国の▁携帯は▁約▁500元から▁5000元です 19392e5b6d6dSopenharmony_ci<data>•\u4E2D\u56FD\u306E•\u643A\u5E2F\u306F•\u7D04•\uFF15\uFF10\uFF10\u5143\u304B\u3089•\uFF15\uFF10\uFF10\uFF10\u5143\u3067\u3059•</data> 19402e5b6d6dSopenharmony_ci#プリペイドカード携帯布教 -> プリペイドカード▁携帯▁布教 19412e5b6d6dSopenharmony_ci<data>•\u30D7\u30EA\u30DA\u30A4\u30C9\u30AB\u30FC\u30C9•\u643A\u5E2F•\u5E03\u6559•</data> 19422e5b6d6dSopenharmony_ci#しかもロゴがUnicode!! -> しかも▁ロゴが▁Unicode!! 19432e5b6d6dSopenharmony_ci<data>•\u3057\u304B\u3082•\u30ED\u30B4\u304C•\uFF35\uFF4E\uFF49\uFF43\uFF4F\uFF44\uFF45\uFF01\uFF01•</data> 19442e5b6d6dSopenharmony_ci#しかし、これらのサービスは6年間使ってこなかった自分にとっては特に必要でないものであり、これからこの機能が加わったからといって特別ハッピーなわけでもない。 19452e5b6d6dSopenharmony_ci#-> しかし、▁これらの▁サービスは▁6年間▁使ってこなかった▁自分にとっては▁特に▁必要でない▁もので▁あり、▁これから▁この▁機能が▁加わったから▁といって▁特別▁ハッピーな▁わけでもない。 19462e5b6d6dSopenharmony_ci<data>•\u3057\u304B\u3057\u3001•\u3053\u308C\u3089\u306E•\u30B5\u30FC\u30D3\u30B9\u306F•\uFF16\u5E74\u9593•\u4F7F\u3063\u3066\u3053\u306A\u304B\u3063\u305F•\u81EA\u5206\u306B\u3068\u3063\u3066\u306F•\u7279\u306B•\u5FC5\u8981\u3067\u306A\u3044•\u3082\u306E\u3067•\u3042\u308A\u3001•\u3053\u308C\u304B\u3089•\u3053\u306E•\u6A5F\u80FD\u304C•\u52A0\u308F\u3063\u305F\u304B\u3089•\u3068\u3044\u3063\u3066•\u7279\u5225•\u30CF\u30C3\u30D4\u30FC\u306A•\u308F\u3051\u3067\u3082\u306A\u3044\u3002•</data> 19472e5b6d6dSopenharmony_ci#自由が丘での三ヶ月の生活を通して得られる経験 -> 自由が丘での▁三ヶ月の▁生活を通して▁得られる▁経験 19482e5b6d6dSopenharmony_ci<data>•\u81EA\u7531\u304C\u4E18\u3067\u306E•\u4E09\u30F6\u6708\u306E•\u751F\u6D3B\u3092\u901A\u3057\u3066•\u5F97\u3089\u308C\u308B•\u7D4C\u9A13•</data> 19492e5b6d6dSopenharmony_ci#幕の内弁当がよく買われるらしいです。 -> 幕の内▁弁当が▁よく▁買われるらしいです。 19502e5b6d6dSopenharmony_ci<data>•\u5E55\u306E\u5185•\u5F01\u5F53\u304C•\u3088\u304F•\u8CB7\u308F\u308C\u308B\u3089\u3057\u3044\u3067\u3059\u3002•</data> 19512e5b6d6dSopenharmony_ci#その名の通りに伝われと望まれています。 -> その▁名の通りに▁伝われと▁望まれ▁ています。 19522e5b6d6dSopenharmony_ci<data>•\u305D\u306E•\u540D\u306E\u901A\u308A\u306B•\u4F1D\u308F\u308C\u3068•\u671B\u307E\u308C•\u3066\u3044\u307E\u3059\u3002•</data> 19532e5b6d6dSopenharmony_ci#下鴨神社の糺の森は、静かだし、ちょっとした空き時間の散歩にいいですよ♪ -> 下鴨▁神社の▁糺の森は、▁静か▁だし、▁ちょっとした▁空き▁時間の▁散歩に▁いいですよ♪ 19542e5b6d6dSopenharmony_ci<data>•\u4E0B\u9D28•\u795E\u793E\u306E•\u7CFA\u306E\u68EE\u306F\u3001•\u9759\u304B•\u3060\u3057\u3001•\u3061\u3087\u3063\u3068\u3057\u305F•\u7A7A\u304D•\u6642\u9593\u306E•\u6563\u6B69\u306B•\u3044\u3044\u3067\u3059\u3088\u266A•</data> 19552e5b6d6dSopenharmony_ci#この論理は、別の弱点から逃避するためによく使われるので注意が必要ですが。 -> この▁論理は、▁別の▁弱点から▁逃避▁する▁ために▁よく▁使われるので▁注意が▁必要▁ですが。 19562e5b6d6dSopenharmony_ci<data>•\u3053\u306E•\u8AD6\u7406\u306F\u3001•\u5225\u306E•\u5F31\u70B9\u304B\u3089•\u9003\u907F•\u3059\u308B•\u305F\u3081\u306B•\u3088\u304F•\u4F7F\u308F\u308C\u308B\u306E\u3067•\u6CE8\u610F\u304C•\u5FC5\u8981•\u3067\u3059\u304C\u3002•</data> 19572e5b6d6dSopenharmony_ci#バッテリーを長持ちさせ、充電を最適化します -> バッテリーを▁長持ちさせ、▁充電を▁最適化します 19582e5b6d6dSopenharmony_ci<data>•\u30D0\u30C3\u30C6\u30EA\u30FC\u3092•\u9577\u6301\u3061\u3055\u305B\u3001•\u5145\u96FB\u3092•\u6700\u9069\u5316\u3057\u307E\u3059•</data> 19592e5b6d6dSopenharmony_ci#開発者サービス -> 開発者▁サービス 19602e5b6d6dSopenharmony_ci<data>•\u958B\u767A\u8005•\u30B5\u30FC\u30D3\u30B9•</data> 19612e5b6d6dSopenharmony_ci#長さを選択して感度を調整し、電源ボタンを長押しします。 -> 長さを▁選択して▁感度を▁調整し、▁電源▁ボタンを▁長押しします。 19622e5b6d6dSopenharmony_ci<data>•\u9577\u3055\u3092•\u9078\u629E\u3057\u3066•\u611F\u5EA6\u3092•\u8ABF\u6574\u3057\u3001•\u96FB\u6E90•\u30DC\u30BF\u30F3\u3092•\u9577\u62BC\u3057\u3057\u307E\u3059\u3002•</data> 19632e5b6d6dSopenharmony_ci#データのコピー、スマートフォンでのお支払いなど -> データの▁コピー、▁スマートフォンでの▁お支払いなど 19642e5b6d6dSopenharmony_ci<data>•\u30C7\u30FC\u30BF\u306E•\u30B3\u30D4\u30FC\u3001•\u30B9\u30DE\u30FC\u30C8\u30D5\u30A9\u30F3\u3067\u306E•\u304A\u652F\u6255\u3044\u306A\u3069•</data> 19652e5b6d6dSopenharmony_ci#データ使用量を節約します -> データ▁使用量を▁節約します 19662e5b6d6dSopenharmony_ci<data>•\u30C7\u30FC\u30BF•\u4F7F\u7528\u91CF\u3092•\u7BC0\u7D04\u3057\u307E\u3059•</data> 19672e5b6d6dSopenharmony_ci#スマートフォンで設定できる最低限の明るさよりも画面を暗くします -> スマートフォンで▁設定▁できる▁最低限の▁明るさよりも▁画面を▁暗くします 19682e5b6d6dSopenharmony_ci<data>•\u30B9\u30DE\u30FC\u30C8\u30D5\u30A9\u30F3\u3067•\u8A2D\u5B9A•\u3067\u304D\u308B•\u6700\u4F4E\u9650\u306E•\u660E\u308B\u3055\u3088\u308A\u3082•\u753B\u9762\u3092•\u6697\u304F\u3057\u307E\u3059•</data> 19692e5b6d6dSopenharmony_ci#寝転んでいても -> 寝転んでいても 19702e5b6d6dSopenharmony_ci<data>•\u5BDD\u8EE2\u3093\u3067\u3044\u3066\u3082•</data> 19712e5b6d6dSopenharmony_ci#周囲の状況にもご注意ください -> 周囲の▁状況にも▁ご注意▁くだ▁さい 19722e5b6d6dSopenharmony_ci<data>•\u5468\u56F2\u306E•\u72B6\u6CC1\u306B\u3082•\u3054\u6CE8\u610F•\u304F\u3060•\u3055\u3044•</data> 19732e5b6d6dSopenharmony_ci#国や地域によってはご利用いただけない場合があります -> 国や▁地域によっては▁ご利用いただけない▁場合が▁あります 19742e5b6d6dSopenharmony_ci<data>•\u56FD\u3084•\u5730\u57DF\u306B\u3088\u3063\u3066\u306F•\u3054\u5229\u7528\u3044\u305F\u3060\u3051\u306A\u3044•\u5834\u5408\u304C•\u3042\u308A\u307E\u3059•</data> 19752e5b6d6dSopenharmony_ci#前回停止した場所からお楽しみいただけます -> 前回▁停止した▁場所から▁お楽しみいただけます 19762e5b6d6dSopenharmony_ci<data>•\u524D\u56DE•\u505C\u6B62\u3057\u305F•\u5834\u6240\u304B\u3089•\u304A\u697D\u3057\u307F\u3044\u305F\u3060\u3051\u307E\u3059•</data> 19772e5b6d6dSopenharmony_ci#聞き逃したくない音をスマートフォンで常時検出できます -> 聞き逃したくない▁音を▁スマートフォンで▁常時▁検出▁できます 19782e5b6d6dSopenharmony_ci<data>•\u805E\u304D\u9003\u3057\u305F\u304F\u306A\u3044•\u97F3\u3092•\u30B9\u30DE\u30FC\u30C8\u30D5\u30A9\u30F3\u3067•\u5E38\u6642•\u691C\u51FA•\u3067\u304D\u307E\u3059•</data> 19792e5b6d6dSopenharmony_ci#メニューのボタンを大きくする -> メニューの▁ボタンを▁大きく▁する 19802e5b6d6dSopenharmony_ci<data>•\u30E1\u30CB\u30E5\u30FC\u306E•\u30DC\u30BF\u30F3\u3092•\u5927\u304D\u304F•\u3059\u308B•</data> 19812e5b6d6dSopenharmony_ci#これらの連絡先はデバイスをロック解除しなくても表示され -> これらの▁連絡先は▁デバイスを▁ロック▁解除しなくても▁表示され 19822e5b6d6dSopenharmony_ci<data>•\u3053\u308C\u3089\u306E•\u9023\u7D61\u5148\u306F•\u30C7\u30D0\u30A4\u30B9\u3092•\u30ED\u30C3\u30AF•\u89E3\u9664\u3057\u306A\u304F\u3066\u3082•\u8868\u793A\u3055\u308C•</data> 19832e5b6d6dSopenharmony_ci 19842e5b6d6dSopenharmony_ci#################################################################################### 19852e5b6d6dSopenharmony_ci# 19862e5b6d6dSopenharmony_ci# Test rule status values 19872e5b6d6dSopenharmony_ci# 19882e5b6d6dSopenharmony_ci#################################################################################### 19892e5b6d6dSopenharmony_ci<rules> $Letters = [:L:]; 19902e5b6d6dSopenharmony_ci $Numbers = [:N:]; 19912e5b6d6dSopenharmony_ci $Letters+{1}; 19922e5b6d6dSopenharmony_ci $Numbers+{2}; 19932e5b6d6dSopenharmony_ci Help\ me\!{4}; 19942e5b6d6dSopenharmony_ci [^$Letters $Numbers]; 19952e5b6d6dSopenharmony_ci !.*; 19962e5b6d6dSopenharmony_ci</rules> 19972e5b6d6dSopenharmony_ci<data>•abc<1>123<2>.•.•abc<1> •Help<1> •me<1> •Help me!<4></data> 19982e5b6d6dSopenharmony_ci 19992e5b6d6dSopenharmony_ci# Test option to prohibit unquoted literals. 20002e5b6d6dSopenharmony_ci 20012e5b6d6dSopenharmony_ci<rules> 20022e5b6d6dSopenharmony_ci!!forward; 20032e5b6d6dSopenharmony_ci Hello\ World; 20042e5b6d6dSopenharmony_ci!!safe_reverse; 20052e5b6d6dSopenharmony_ci .*; 20062e5b6d6dSopenharmony_ci</rules> 20072e5b6d6dSopenharmony_ci<data>•Hello World•</data> 20082e5b6d6dSopenharmony_ci 20092e5b6d6dSopenharmony_ci<badrules> 20102e5b6d6dSopenharmony_ci!!quoted_literals_only; 20112e5b6d6dSopenharmony_ci!!forward; 20122e5b6d6dSopenharmony_ci Hello\ World; 20132e5b6d6dSopenharmony_ci!!safe_reverse; 20142e5b6d6dSopenharmony_ci .*; 20152e5b6d6dSopenharmony_ci</badrules> 20162e5b6d6dSopenharmony_ci 20172e5b6d6dSopenharmony_ci<rules> 20182e5b6d6dSopenharmony_ci!!quoted_literals_only; 20192e5b6d6dSopenharmony_ci!!forward; 20202e5b6d6dSopenharmony_ci 'Hello World'; 20212e5b6d6dSopenharmony_ci!!safe_reverse; 20222e5b6d6dSopenharmony_ci .*; 20232e5b6d6dSopenharmony_ci</rules> 20242e5b6d6dSopenharmony_ci<data>•Hello World•</data> 20252e5b6d6dSopenharmony_ci 20262e5b6d6dSopenharmony_ci 20272e5b6d6dSopenharmony_ci# Test for circular buffer overflow during reverse iteration with inefficient reverse rules, 20282e5b6d6dSopenharmony_ci# Too many boundaries between safe back up position and current position. 20292e5b6d6dSopenharmony_ci 20302e5b6d6dSopenharmony_ci<rules> 20312e5b6d6dSopenharmony_ci!!forward; 20322e5b6d6dSopenharmony_ci.; 20332e5b6d6dSopenharmony_ci!!safe_reverse; 20342e5b6d6dSopenharmony_ci.*; 20352e5b6d6dSopenharmony_ci</rules> 20362e5b6d6dSopenharmony_ci<data>•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•a•</data> 20372e5b6d6dSopenharmony_ci 20382e5b6d6dSopenharmony_ci# 20392e5b6d6dSopenharmony_ci# Dictionary regression check 20402e5b6d6dSopenharmony_ci# Intended to find unexpected behavior changes when changing dictionary implementation code, 20412e5b6d6dSopenharmony_ci# but may also be fragile, failing with intended improvements to dictionary breaking. 20422e5b6d6dSopenharmony_ci# 20432e5b6d6dSopenharmony_ci<locale en> 20442e5b6d6dSopenharmony_ci<word> 20452e5b6d6dSopenharmony_ci<data>•Unicode<200> •คือ<200>อะไร<200>?• 20462e5b6d6dSopenharmony_ci•Unicode<200> •กำหนด<200>หมายเลข<200>เฉพาะ<200>สำหรับ<200>ทุก<200>อักขระ<200> 20472e5b6d6dSopenharmony_ci•โดย<200>ไม่<200>สนใจ<200>ว่า<200>เป็น<200>แพ<200>ล็ต<200>ฟอร์ม<200>ใด<200> 20482e5b6d6dSopenharmony_ci•ไม่<200>ขึ้น<200>กับ<200>ว่า<200>จะ<200>เป็น<200>โปรแกรม<200>ใด<200> 20492e5b6d6dSopenharmony_ci•และ<200>ไม่<200>ว่า<200>จะ<200>เป็น<200>ภาษา<200>ใด<200> 20502e5b6d6dSopenharmony_ci• 20512e5b6d6dSopenharmony_ci•โดย<200>พื้น<200>ฐาน<200>แล้ว<200>,• •คอมพิวเตอร์<200>จะ<200>เกี่ยวข้อง<200>กับ<200>เรื่อง<200>ของ<200>ตัวเลข<200>.• •คอมพิวเตอร์<200>จัด<200>เก็บ<200>ตัว<200>อักษร<200>และ<200>อักข<200>ระ<200>อื่นๆ<200> •โดย<200>การ<200>กำหนด<200>หมายเลข<200>ให้<200>สำหรับ<200>แต่ละ<200>ตัว<200>.• •ก่อน<200>หน้า<200>ที่๊<200> •Unicode<200> •จะ<200>ถูก<200>สร้าง<200>ขึ้น<200>,• •ได้<200>มี<200>ระบบ<200> •encoding<200> •อยู่<200>หลาย<200>ร้อย<200>ระบบ<200>สำหรับ<200>การ<200>กำหนด<200>หมายเลข<200>เหล่า<200>นี้<200>.• •ไม่มี<200> •encoding<200> •ใด<200>ที่<200>มี<200>จำนวน<200>ตัว<200>อักขระ<200>มาก<200>เพียง<200>พอ<200>:• •ยก<200>ตัวอย่าง<200>เช่น<200>,• •เฉพาะ<200>ใน<200>กลุ่ม<200>สหภาพ<200>ยุโรป<200>เพียง<200>แห่ง<200>เดียว<200> •ก็<200>ต้องการ<200>หลาย<200> •encoding<200> •ใน<200>การ<200>ครอบคลุม<200>ทุก<200>ภาษา<200>ใน<200>กลุ่ม<200>.• •หรือ<200>แม้แต่<200>ใน<200>ภาษา<200>เดี่ยว<200> •เช่น<200> •ภาษา<200>อังกฤษ<200> •ก็<200>ไม่มี<200> •encoding<200> •ใด<200>ที่<200>เพียง<200>พอ<200>สำหรับ<200>ทุก<200>ตัว<200>อักษร<200>,• •เครื่องหมาย<200>วรรค<200>ตอน<200> •และ<200>สัญลักษณ์<200>ทาง<200>เทคนิค<200>ที่<200>ใช้<200>กัน<200>อยู่<200>ทั่วไป<200>.• 20522e5b6d6dSopenharmony_ci• 20532e5b6d6dSopenharmony_ci•ระบบ<200> •encoding<200> •เหล่า<200>นี้<200>ยัง<200>ขัด<200>แย้ง<200>ซึ่ง<200>กัน<200>และ<200>กัน<200>.• •นั่น<200>ก็<200>คือ<200>,• •ใน<200>สอง<200> •encoding<200> •สามารถ<200>ใช้<200>หมายเลข<200>เดียวกัน<200>สำหรับ<200>ตัว<200>อักขระ<200>สอง<200>ตัว<200>ที่<200>แตก<200>ต่าง<200>กัน<200>,•หรือ<200>ใช้<200>หมายเลข<200>ต่าง<200>กัน<200>สำหรับ<200>อักขระ<200>ตัว<200>เดียวกัน<200>.• •ใน<200>ระบบ<200>คอมพิวเตอร์<200> •(•โดย<200>เฉพาะ<200>เซิร์ฟเวอร์<200>)• •ต้อง<200>มี<200>การ<200>สนับสนุน<200>หลาย<200> •encoding<200>;• •และ<200>เมื่อ<200>ข้อมูล<200>ที่<200>ผ่าน<200>ไป<200>มาระ<200>หว่าง<200>การ<200>เข้า<200>รหัส<200>หรือ<200>แพ<200>ล็ต<200>ฟอร์ม<200>ที่<200>ต่าง<200>กัน<200>,• •ข้อมูล<200>นั้น<200>จะ<200>เสี่ยง<200>ต่อ<200>การ<200>ผิด<200>พลาด<200>เสีย<200>หาย<200>.• 20542e5b6d6dSopenharmony_ci• 20552e5b6d6dSopenharmony_ci•Unicode<200> •จะ<200>เปลี่ยนแปลง<200>สิ่ง<200>เหล่า<200>นั้น<200>ทั้งหมด<200>!• 20562e5b6d6dSopenharmony_ci• 20572e5b6d6dSopenharmony_ci•Unicode<200> •กำหนด<200>หมายเลข<200>เฉพาะ<200>สำหรับ<200>แต่ละ<200>อักขระ<200>,• •โดย<200>ไม่<200>สนใจ<200>ว่า<200>เป็น<200>แพ<200>ล็ต<200>ฟอร์ม<200>ใด<200>,• •ไม่<200>ขึ้น<200>กับ<200>ว่า<200>จะ<200>เป็น<200>โปรแกรม<200>ใด<200>และ<200>ไม่<200>ว่า<200>จะ<200>เป็น<200>ภาษา<200>ใด<200>.• •มาตรฐาน<200> •Unicode<200> •ได้<200>ถูก<200>นำ<200>ไป<200>ใช้<200>โดย<200>ผู้นำ<200>ใน<200>อุตสาหกรรม<200> •เช่น<200> •Apple<200>,• •HP<200>,• •IBM<200>,• •JustSystem<200>,• •Microsoft<200>,• •Oracle<200>,• •SAP<200>,• •Sun<200>,• •Sybase<200>,• •Unisys<200> •และ<200>อื่นๆ<200> •อีก<200>มาก<200>.• •Unicode<200> •เป็น<200>สิ่ง<200>ที่<200>จำเป็น<200>สำหรับ<200>มาตร<200>ฐาน<200>ใหม่ๆ<200> •เช่น<200> •XML<200>,• •Java<200>,• •ECMAScript<200> •(•JavaScript<200>)•,• •LDAP<200>,• •CORBA<200> •3.0<100>,• •WML<200> •ฯลฯ<200>.•,• •และ<200>เป็น<200>แนวทาง<200>อย่าง<200>เป็น<200>ทางการ<200>ใน<200>การ<200>ทำ<200> •ISO<200>/•IEC<200> •10646<100>.• •Unicode<200> •ได้<200>รับ<200>การ<200>สนับสนุน<200>ใน<200>ระบบ<200>ปฏิบัติ<200>การ<200>จำนวน<200>มาก<200>,• •บราวเซอร์<200>ใหม่ๆ<200> •ทก<200>ตัว<200>,• •และ<200>ผลิต<200>ภัณฑ์<200>อื่นๆ<200> •อีก<200>มาก<200>.• •การ<200>เกิด<200>ขึ้น<200>ของ<200> •Unicode<200> •Standard<200> •และ<200>ทูล<200>ส์<200>ต่างๆ<200> •ที่<200>มี<200>ใน<200>การ<200>สนับสนุน<200> •Unicode<200>,• •เป็น<200>หนึ่ง<200>ใน<200>แนว<200>โน้ม<200>ทาง<200>เทคโนโลยี<200>ซอฟต์แวร์<200>ระดับ<200>โลก<200>ที่<200>มี<200>ความ<200>สำคัญ<200>ที่สุด<200>.• 20582e5b6d6dSopenharmony_ci• 20592e5b6d6dSopenharmony_ci•การ<200>รวม<200> •Unicode<200> •เข้าไป<200>ใน<200>ระบบ<200>ไคลเอ็นต์<200>-•เซิร์ฟเวอร์<200> •หรือ<200>แอ็พ<200>พลิ<200>เค<200>ชัน<200>แบบ<200> •multi<200>-•tiered<200> •และ<200>เว็บไซต์<200> •จะ<200>ทำให้<200>เกิด<200>การ<200>ประหยัด<200>ค่า<200>ใช้<200>จ่าย<200>มากกว่า<200>การ<200>ใช้<200>ชุด<200>อักขระ<200>แบบ<200>เดิม<200>.• •Unicode<200> •ทำให้<200>ผลิตภัณฑ์<200>ซอฟต์แวร์<200>หนึ่ง<200>เดียว<200> •หรือ<200>เว็บไซต์<200>แห่ง<200>เดียว<200> •รองรับ<200>ได้<200>หลาย<200>แพ<200>ล็ต<200>ฟอร์ม<200>,• •หลาย<200>ภาษา<200>และ<200>หลาย<200>ประเทศ<200>โดย<200>ไม่<200>ต้อง<200>ทำการ<200>รื้อ<200>ปรับ<200>ระบบ<200>.• •Unicode<200> •ยัง<200>ทำให้<200>ข้อมูล<200>สามารถ<200>เคลื่อน<200>ย้าย<200>ไป<200>มา<200>ใน<200>หลายๆ<200> •ระบบ<200>โดย<200>ไม่<200>เกิด<200>ความ<200>ผิด<200>พลาด<200>เสีย<200>หาย<200>.• 20602e5b6d6dSopenharmony_ci• 20612e5b6d6dSopenharmony_ci•เกี่ยว<200>กับ<200> •Unicode<200> •Consortium<200> 20622e5b6d6dSopenharmony_ci• 20632e5b6d6dSopenharmony_ci•Unicode<200> •Consortium<200> •เป็น<200>องค์กร<200>ไม่<200>แสวงหา<200>กำไร<200>ที่<200>ก่อ<200>ตั้ง<200>ขึ้น<200>เพื่อ<200>พัฒนา<200>,• •ขยาย<200>และ<200>ส่ง<200>เสริม<200>การ<200>ใช้<200> •Unicode<200> •Standard<200>,• •ซึ่ง<200>กำหนด<200>รูป<200>แบบ<200>การ<200>แทน<200>ค่า<200>ของ<200>ข้อความ<200>ใน<200>ผลิตภัณฑ์<200>ซอฟต์แวร์<200>และ<200>มาตร<200>ฐาน<200>ใหม่ๆ<200>.• •สมาชิก<200>ของ<200>สมาคม<200>เป็น<200>ตัวแทน<200>จาก<200>บริษัท<200>และ<200>องค์กร<200>ใน<200>อุตสาหกรรม<200>คอมพิวเตอร์<200>และ<200>การ<200>ประมวล<200>ผล<200>สารสนเทศ<200>.• •สมาคม<200>ได้<200>รับ<200>การ<200>สนับสนุน<200>ทางการ<200>เงิน<200>ผ่าน<200>ทาง<200>ค่า<200>ธรรมเนียม<200>ของ<200>การ<200>เป็น<200>สมาชิก<200>เท่านั้น<200>.• •สมาชิก<200>ภาพ<200>ของ<200> •Unicode<200> •Consortium<200> •เปิด<200>กว้าง<200>สำหรับ<200>องค์กร<200>หรือ<200>บุคคล<200>ใดๆ<200> •ใน<200>โลก<200>ที่<200>ต้องการ<200>สนับสนุน<200> •Unicode<200> •Standard<200> •และ<200>ช่วย<200>เหลือ<200>การ<200>ขยาย<200>ตัว<200>และ<200>การนำ<200> •Unicode<200> •ไป<200>ใช้<200>งาน<200>.• 20642e5b6d6dSopenharmony_ci• 20652e5b6d6dSopenharmony_ci•สำหรับ<200>ข้อมูล<200>เพิ่ม<200>เติม<200>,• •ให้<200>ดู<200>ที่<200> •Glossary<200>,• •Sample<200> •Unicode<200>-•Enabled<200> •Products<200>,• •Technical<200> •Introduction<200> •และ<200> •Useful<200> •Resources<200>.•</data> 20662e5b6d6dSopenharmony_ci 20672e5b6d6dSopenharmony_ci<word> 20682e5b6d6dSopenharmony_ci# Burmese 20692e5b6d6dSopenharmony_ci<data>•အ<200>လော<200>င္<200>မ<200>င္<200>တရား<200> 20702e5b6d6dSopenharmony_ci• •မဟာ<200>ဓမ္မရာဇာ<200>မိ<200>ပတိ<200>လ<200>က္<200>ထ<200>က္<200>တ္<200>ဝ<200>င္<200> •အ<200>င္<200>ဝ<200>နေ<200>ပ္<200>ရ<200>ည္<200>တော္<200>က္<200>ရီး<200>သ<200>ည္<200> •မ<200>င္<200>ရိ<200>မ္<200>မ<200>သ<200>က္<200>ဖ္<200>ရ<200>စ္<200>နေ<200>သ<200>ည္<200>။• •မဏိ<200>ပူ<200>ရ<200> •က<200>သ<200>ည္<200>မ္<200>ယား<200>က<200> •အ<200>င္<200>ဝ<200>နေ<200>ပ္<200>ရ<200>ည္<200>တော္<200>၏• •မ္<200>ရော<200>က္<200>ဘ<200>က္<200>တ<200>လ္<200>ဝ္<200>ဟား<200>ကုိ<200> •တုိ<200>က္<200>ခုိ<200>က္<200>ဖ္<200>ယ<200>က္<200>ဆီး<200>သ<200>ည္<200>။• •အော<200>က္<200>မ္<200>ရ<200>န္<200>မာ<200>နုိ<200>င္<200>ငံ<200> •ဟံ<200>သာ<200>ဝ<200>တီ<200>သား<200>တုိ့<200>က<200>လ<200>ည္<200> •ပု<200>န္<200>က<200>န္<200>သ<200>ည္<200>။• •မတ္တ<200>ရာ<200>အု<200>တ္<200>ဖုိ<200>ရ္<200>ဟိ<200> •က္<200>ဝေ့<200>ရ္<200>ဟ<200>မ္<200>မ္<200>ယား<200>က<200>လ<200>ည္<200> •ထ<200>က္<200>ရ္<200>ဝ<200>သ<200>ည္<200>။• 20712e5b6d6dSopenharmony_ci• 20722e5b6d6dSopenharmony_ci• •ထုိ<200>အ<200>ခ္<200>ယိ<200>န္<200>တ္<200>ဝ<200>င္<200> •မု<200>ဆုိး<200>ဖုိ<200>ရ္<200>ဝာ<200>သူ<200>က္<200>ရီး<200> •အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>သ<200>ည္<200> •မိမိ<200>၏•ရ္<200>ဝာ<200>ကုိ<200> •လုံ<200>ခ္<200>ရုံ<200>အော<200>င္<200>ထ<200>န္<200>လုံး<200>တ<200>ပ္<200>မ္<200>ယား<200>ကာ<200>ရ<200>သ<200>ည္<200>။• •အနီး<200>အ<200>ပား<200> •က္<200>ယေး<200>ရ္<200>ဝာ<200> •လေး<200>ဆ<200>ယ့္<200>ခ္<200>ရော<200>က္<200>ရ္<200>ဝာ<200>ကုိ<200> •သိ<200>မ္း<200>သ္<200>ဝ<200>င္<200>ထား<200>သ<200>ည္<200>။• •မ<200>က္<200>ရာ<200>မီ<200>ပ<200>င္<200> •အ<200>င္<200>ဝ<200>နေ<200>ပ္<200>ရ<200>ည္<200>တော္<200>က္<200>ရီး<200>သ<200>ည္<200> •ဟံ<200>သာ<200>ဝ<200>တီ<200>တ<200>ပ္<200>မ္<200>ယား<200> •လ<200>က္<200>တ္<200>ဝ<200>င္<200>သ<200>က္<200>ဆ<200>င္<200>ရ<200>တော့<200>သ<200>ည္<200>။• 20732e5b6d6dSopenharmony_ci• 20742e5b6d6dSopenharmony_ci• •အ<200>င္<200>ဝ<200>ကုိ<200> •သိ<200>မ္<200>ပုိ<200>က္<200>ပ္<200>ရီး<200>သော<200> •ဟံ<200>သာ<200>ဝ<200>တီ<200>တ<200>ပ္<200>မ္<200>ယား<200>သ<200>ည္<200> •မ္<200>ရော<200>က္<200>ဘ<200>က္<200>တ<200>လ္<200>ဝ္<200>ဟား<200>က္<200>ယေး<200>ရ္<200>ဝာ<200>မ္<200>ယား<200>ကုိ<200> •သစ္စာ<200>ခံ<200>ခုိ<200>င္<200>ရ<200>န္<200> •လာ<200>က္<200>ရ<200>ရာ<200> •မု<200>ဆုိး<200>ဖုိ<200>ရ္<200>ဝာ<200>သုိ့<200> •ရော<200>က္<200>ရ္<200>ဟိ<200>လာ<200>သ<200>ည္<200>။• •တ<200>ခ္<200>ယိ<200>န္<200>တ<200>ည္<200>မ္<200>ဟာ<200>ပ<200>င္<200> •က္<200>ဝေ့<200>ရ္<200>ဟ<200>မ္<200>မ္<200>ယား<200>က<200>လ<200>ည္<200> •သစ္စာ<200>ခံ<200>ခုိ<200>င္<200>ရ<200>န္<200> •ရော<200>က္<200>ရ္<200>ဟိ<200>လာ<200>သ<200>ည္<200>။• •ဦး<200>အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>သ<200>ည္<200> •အ<200>ဖ္<200>ဝဲ့<200>န္<200>ဟ<200>စ္<200>ဖ္<200>ဝဲ့<200>ကုိ<200> •ခ္<200>ရေ<200>ငံ<200>စ္<200>ဝာ<200> •ဆ<200>က္<200>ဆံ<200>သ<200>ည္<200>။• •မ<200>ည္<200>သူ့<200>သ<200>စ္<200>စာ<200>ကုိ<200>မ္<200>ယ္<200>ဟ<200> •ခံ<200>ယူ<200>ခ္<200>ရ<200>င္<200>မ<200>ပ္<200>ရု<200>ပေ<200>။• •ဟံ<200>သာ<200>ဝ<200>တီ<200>တ<200>ပ္<200>ဖ္<200>ဝဲ့<200>ကုိ<200> •အ<200>ပ္<200>ရ<200>န္<200>ခ<200>ရီး<200>တ္<200>ဝ<200>င္<200> •လ<200>မ္<200>မ္<200>ဟ<200>ဖ္<200>ရ<200>တ္<200>၍• •တုိ<200>က္<200>ခုိ<200>က္<200>သ<200>ည္<200>။• •ဟံ<200>သာ<200>ဝ<200>တီ<200>တ<200>ပ္<200>ဖ္<200>ဝဲ့<200>မ္<200>ယား<200> •အထိ<200>အ<200>ခုိ<200>က္<200>အ<200>က္<200>ယ<200>အ<200>ဆုံး<200>မ္<200>ယား<200>စ္<200>ဝာ<200>ဖ္<200>ရ<200>င္<200> •ပ္<200>ရ<200>န္<200>ရ<200>သ<200>ည္<200>။• 20752e5b6d6dSopenharmony_ci• 20762e5b6d6dSopenharmony_ci• •ဟံ<200>သာ<200>ဝ<200>တီ<200>တ<200>ပ္<200>မ္<200>ယား<200>သ<200>ည္<200> •မု<200>ဆုိး<200>ဖုိ<200>ရ္<200>ဝာ<200>ကုိ<200> •လာ<200>ရော<200>က္<200>တုိ<200>က္<200>ခုိ<200>က္<200>က္<200>ရ<200>ပ္<200>ရ<200>န္<200>သ<200>ည္<200>။• •ဦး<200>အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>သ<200>ည္<200> •သ္<200>ဝေး<200>သော<200>က္<200>ရဲ<200>ဘော္<200> •ခ္<200>ရော<200>က္<200>က္<200>ယိ<200>ပ္<200>ရ္<200>ဟ<200>စ္<200>ယော<200>က္<200>န္<200>ဟ<200>င္<200>အတူ<200> •ဦးစီး<200>ကာ<200>အော<200>င္<200>မ္<200>ရ<200>င္<200>စ္<200>ဝာ<200>ခု<200>ခံ<200>တ္<200>ဝ<200>န္<200>လ္<200>ဟ<200>န္<200>နုိ<200>င္<200>ခဲ့<200>သ<200>ည္<200>။• •ထုိ့<200>နော<200>က္<200> •ဦး<200>အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>သ<200>ည္<200> •မ္<200>ရော<200>က္<200>ဘ<200>က္<200>တ<200>လ္<200>ဝ္<200>ဟား<200>ရ္<200>ဟိ<200> •ရ္<200>ဟ<200>မ္<200>မ္<200>ယား<200>န္<200>ဟ<200>င္<200> •မ္<200>ရ<200>န္<200>မာ<200>မ္<200>ယား<200>ကုိ<200>လ<200>ည္<200> •ဆ<200>က္<200>သ္<200>ဝ<200>ယ္<200>စ<200>ည္<200>ရုံး<200>နုိ<200>င္<200>ခဲ့<200>သ<200>ည္<200>။• •ဤ<200>သုိ့<200>ဖ္<200>ရ<200>င္<200> •ဦး<200>အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>၏• •အ<200>ရ္<200>ဟိ<200>န္<200>အ<200>ဝာ<200> •မ္<200>ရ<200>င္<200>မား<200>လာ<200>လေ<200>သ<200>ည္<200>။• 20772e5b6d6dSopenharmony_ci• 20782e5b6d6dSopenharmony_ci• •ဦး<200>အော<200>င္<200>ဇေ<200>ယ္<200>ယ<200>သ<200>ည္<200> •အ<200>လော<200>င္<200>မ<200>င္<200>တ<200>ရား<200>ဘ္<200>ဝဲ့<200>ကုိ<200> •ခံယူ<200>ကာ<200> •ကု<200>န္<200>ဘော<200>င္<200>မ<200>င္<200>ဆ<200>က္<200>ကုိ<200>စ<200>တ<200>င္<200>တ<200>ည္<200>ထော<200>င္<200>သ<200>ည္<200>။• •မု<200>ဆုိး<200>ဖုိ<200>ရ္<200>ဝာ<200>ကုိ<200> •ရ္<200>ဝ္<200>ဟ<200>ဝေ<200>ဘုိ<200>ဟု<200> •သ<200>မု<200>တ္<200>ကာ<200> •မ္<200>ရုိ့<200>န<200>န္<200>တ<200>ည္<200>သ<200>ည္<200>။• •န<200>န္<200>တ<200>ည္<200>သ<200>က္<200>က<200>ရာ<200>ဇ္<200>ဖ္<200>ရ<200>စ္<200>သော<200> •၁၁၁၅<100> •ခု<200>ကုိ<200> •ဥ<200>ဩ<200>အော္<200>မ္<200>ရ<200>ည္<200> •ကု<200>န္<200>ဘော<200>င္<200>တ<200>ည္<200>ဟု<200> •အ<200>မ္<200>ဟ<200>တ္<200>အ<200>သား<200>ပ္<200>ရု<200>က္<200>ရ<200>သ<200>ည္<200>။• 20792e5b6d6dSopenharmony_ci• 20802e5b6d6dSopenharmony_ci• •အ<200>လော<200>င္<200>မ<200>င္<200>တရား<200>သ<200>ည္<200> •ဧရာ<200>ဝ<200>တီ<200>န္<200>ဟ<200>င္<200>ခ္<200>ယ<200>င္<200>တ္<200>ဝ<200>င္<200> •မ္<200>ရ<200>စ္<200>န္<200>ဟ<200>စ္<200>သ္<200>ဝ<200>ယ္<200>အ<200>က္<200>ရား<200> •ဒေ<200>သ<200>မ္<200>ယား<200>ကုိ<200>အ<200>ခုိ<200>င္<200>အ<200>မာ<200> •စု<200>စ<200>ည္<200>ပ္<200>ရီး<200>နော<200>က္<200> •အ<200>င္<200>ဝ<200>ကုိ<200> •တုိ<200>က္<200>ခုိ<200>က္<200>အော<200>င္<200>မ္<200>ရ<200>င္<200>သ<200>ည္<200>။• •ထုိ<200>နော<200>က္<200>တ္<200>ဝ<200>င္<200>ပ္<200>ရ<200>ည္<200>၊• •လ္<200>ဝ<200>န္<200>ဆေး<200>၊• •ဒ<200>ဂုံ<200>မ္<200>ရုိ့<200>မ္<200>ယား<200>ကုိ<200> •သိ<200>မ္<200>ပုိ<200>က္<200>သ<200>ည္<200>။• •လ္<200>ဝ<200>န္<200>ဆေး<200> •ကုိ<200>မ္<200>ရ<200>န္<200>အော<200>င္<200>ဟူ<200>၍• •သ<200>မု<200>တ္<200>သ<200>ည္<200>။• •ဒ<200>ဂုံ<200>ကုိ<200>ရ<200>န္<200>ကု<200>န္<200>ဟူ<200>၍• •သ<200>မု<200>တ္<200>ထ<200>သ<200>ည္<200>။•</data> 20812e5b6d6dSopenharmony_ci 20822e5b6d6dSopenharmony_ci<word> 20832e5b6d6dSopenharmony_ci# japanese 20842e5b6d6dSopenharmony_ci<data>•ユニ<400>コード<400>と<400>は<400>何<400>か<400>?• 20852e5b6d6dSopenharmony_ci•ユニ<400>コード<400>は<400>、•すべて<400>の<400>文字<400>に<400>固有<400>の<400>番号<400>を<400>付与<400>し<400>ます<400> 20862e5b6d6dSopenharmony_ci•プラットフォーム<400>に<400>は<400>依存<400>しま<400>せん<400> 20872e5b6d6dSopenharmony_ci•プログラム<400>に<400>も<400>依存<400>しま<400>せん<400> 20882e5b6d6dSopenharmony_ci•言語<400>に<400>も<400>依存<400>しま<400>せん<400> 20892e5b6d6dSopenharmony_ci• 20902e5b6d6dSopenharmony_ci•コンピューター<400>は<400>、•本質<400>的<400>に<400>は<400>数字<400>しか<400>扱う<400>こと<400>が<400>でき<400>ま<400>せん<400>。•コンピューター<400>は<400>、•文字<400>や<400>記号<400>など<400>の<400>それぞれに<400>番号<400>を<400>割り振る<400>こと<400>によって<400>扱える<400>よう<400>にし<400>ます<400>。•ユニ<400>コード<400>が<400>出来る<400>まで<400>は<400>、•これらの<400>番号<400>を<400>割り振る<400>仕組み<400>が<400>何<400>百<400>種類<400>も<400>存在<400>しま<400>した<400>。•どの<400>一つ<400>を<400>とっても<400>、•十分<400>な<400>文字<400>を<400>含<400>んで<400>は<400>いま<400>せん<400>で<400>した<400>。•例えば<400>、•欧州<400>連合<400>一つ<400>を<400>見<400>て<400>も<400>、•その<400>すべて<400>の<400>言語<400>を<400>カバー<400>する<400>ため<400>に<400>は<400>、•いくつか<400>の<400>異なる<400>符号<400>化<400>の<400>仕組み<400>が<400>必要<400>で<400>した<400>。•英語<400>の<400>よう<400>な<400>一つ<400>の<400>言語<400>に<400>限<400>って<400>も<400>、•一つ<400>だけ<400>の<400>符号<400>化<400>の<400>仕組み<400>では<400>、•一般<400>的<400>に<400>使<400>われる<400>すべて<400>の<400>文字<400>、•句読点<400>、•技術<400>的<400>な<400>記号<400>など<400>を<400>扱う<400>に<400>は<400>不十分<400>で<400>した<400>。• 20912e5b6d6dSopenharmony_ci• 20922e5b6d6dSopenharmony_ci•これらの<400>符号<400>化<400>の<400>仕組み<400>は<400>、•相互<400>に<400>矛盾<400>する<400>もの<400>でも<400>ありま<400>した<400>。•二つ<400>の<400>異なる<400>符号<400>化<400>の<400>仕組み<400>が<400>、•二つ<400>の<400>異なる<400>文字<400>に<400>同一<400>の<400>番号<400>を<400>付ける<400>こと<400>も<400>できる<400>し<400>、•同じ<400>文字<400>に<400>異なる<400>番号<400>を<400>付ける<400>こと<400>も<400>できる<400>の<400>です<400>。•どの<400>よう<400>な<400>コンピューター<400>も<400>(•特に<400>サーバー<400>は<400>)•多く<400>の<400>異<400>な<400>っ<400>た<400>符号<400>化<400>の<400>仕組み<400>を<400>サポート<400>する<400>必要<400>が<400>あり<400>ます<400>。•たとえ<400>データ<400>が<400>異なる<400>符号<400>化<400>の<400>仕組み<400>や<400>プラットフォーム<400>を<400>通過<400>し<400>て<400>も<400>、•いつ<400>どこ<400>で<400>データ<400>が<400>乱れる<400>か<400>分<400>から<400>ない<400>危険<400>を<400>冒す<400>こと<400>の<400>なる<400>の<400>です<400>。• 20932e5b6d6dSopenharmony_ci• 20942e5b6d6dSopenharmony_ci•ユニ<400>コード<400>は<400>すべて<400>を<400>変<400>え<400>ます<400> 20952e5b6d6dSopenharmony_ci• 20962e5b6d6dSopenharmony_ci•ユニ<400>コード<400>は<400>、•プラットフォーム<400>に<400>係<400>わら<400>ず<400>、•プログラム<400>に<400>係<400>わら<400>ず<400>、•言語<400>に<400>係<400>わら<400>ず<400>、•すべて<400>の<400>文字<400>に<400>独立<400>した<400>番号<400>を<400>与<400>え<400>ます<400>。•ユニ<400>コード<400>標準<400>は<400>、•アップル<400>、•ヒュー<400>レット<400>パッ<400>カード<400>、•IBM<200>、•ジャスト<400>システム<400>、•マイクロ<400>ソフト<400>、•オラクル<400>、•SAP<200>、•サン<400>、•サイ<400>ベース<400>など<400>の<400>産業<400>界<400>の<400>主導<400>的<400>企業<400>と<400>他の<400>多く<400>の<400>企業<400>に<400>採用<400>さ<400>れ<400>てい<400>ます<400>。•ユニ<400>コード<400>は<400>、•XML<200>、•Java<200>、•ECMAScript<200>(•JavaScript<200>)•、•LDAP<200>、•CORBA<200> •3.0<100>など<400>の<400>最先端<400>の<400>標準<400>の<400>前提<400>と<400>な<400>って<400>おり<400>、•ユニ<400>コード<400>を<400>実装<400>す<400>れ<400>ば<400>、•ISO<200>/•IEC<200> •10646<100>に<400>適合<400>する<400>ことに<400>なり<400>ます<400>。•ユニ<400>コード<400>は<400>、•多く<400>の<400>オペレーティングシステム<400>と<400>すべて<400>の<400>最新<400>の<400>ブラウザー<400>と<400>他の<400>多く<400>の<400>製品<400>で<400>サポート<400>さ<400>れ<400>てい<400>ます<400>。•ユニ<400>コード<400>標準<400>の<400>出現<400>と<400>ユニ<400>コード<400>を<400>サポート<400>する<400>ツール<400>類<400>は<400>、•昨今<400>顕著<400>に<400>な<400>って<400>いる<400>ソフトウエア<400>技術<400>の<400>グローバル<400>化<400>の<400>流れ<400>に対して<400>、•特に<400>役<400>に<400>立<400>って<400>い<400>ます<400>。• 20972e5b6d6dSopenharmony_ci• 20982e5b6d6dSopenharmony_ci•ユニ<400>コード<400>を<400>ク<400>ライアン<400>ト<400>サーバー<400>型<400>の<400>アプリケーション<400>や<400>、•多層<400>構造<400>を<400>持つ<400>アプリケーション<400>、•ウェブサイト<400>など<400>に<400>組み込む<400>こと<400>で<400>、•従来<400>の<400>文字<400>コードセット<400>を<400>用いる<400>より<400>も<400>明らか<400>な<400>コスト<400>削減<400>が<400>可能<400>です<400>。•ユニ<400>コード<400>は<400>、•単一<400>の<400>ソフトウエア<400>製品<400>、•単一<400>の<400>ウェブサイト<400>に<400>、•何ら<400>手<400>を<400>加える<400>こと<400>なく<400>、•複数<400>の<400>プラットフォーム<400>、•複数<400>の<400>言語<400>、•複数<400>の<400>国<400>を<400>カバー<400>する<400>こと<400>が<400>出来る<400>の<400>です<400>。•ユニ<400>コード<400>は<400>、•データ<400>が<400>多く<400>の<400>異なる<400>システム<400>の<400>間<400>を<400>、•何<400>の<400>乱れ<400>も<400>なし<400>に<400>転送<400>する<400>こと<400>を<400>可能<400>と<400>する<400>の<400>です<400>。• 20992e5b6d6dSopenharmony_ci• 21002e5b6d6dSopenharmony_ci•ユニ<400>コード<400>コンソーシアム<400>について<400> 21012e5b6d6dSopenharmony_ci• 21022e5b6d6dSopenharmony_ci•ユニ<400>コード<400>コンソーシアム<400>は<400>、•最新<400>の<400>ソフトウエア<400>製品<400>と<400>標準<400>において<400>テキスト<400>を<400>表現<400>する<400>こと<400>を<400>意味<400>する<400>“•ユニ<400>コード<400>標準<400>”•の<400>構築<400>、•発展<400>、•普及<400>、•利用<400>促進<400>を<400>目的<400>として<400>設立<400>さ<400>れ<400>た<400>非<400>営利<400>組織<400>です<400>。•同<400>コンソーシアム<400>の<400>会員<400>は<400>、•コンピューター<400>と<400>情報処理<400>に<400>係わる<400>広汎<400>な<400>企業<400>や<400>組織<400>から<400>構成<400>さ<400>れ<400>てい<400>ます<400>。•同<400>コンソーシアム<400>は<400>、•財政<400>的<400>に<400>は<400>、•純粋<400>に<400>会費<400>のみ<400>によって<400>運営<400>さ<400>れ<400>てい<400>ます<400>。•ユニ<400>コード<400>標準<400>を<400>支持<400>し<400>、•その<400>拡張<400>と<400>実装<400>を<400>支援<400>する<400>世界中<400>の<400>組織<400>や<400>個人<400>は<400>、•だれ<400>も<400>が<400>ユニ<400>コード<400>コンソーシアム<400>の<400>会員<400>なる<400>こと<400>が<400>でき<400>ます<400>。• 21032e5b6d6dSopenharmony_ci• 21042e5b6d6dSopenharmony_ci•より<400>詳しい<400>こと<400>を<400>お<400>知<400>り<400>に<400>なり<400>たい<400>方<400>は<400>、•Glossary<200>,• •Technical<200> •Introduction<200> •および<400> •Useful<200> •Resources<200>を<400>ご<400>参照<400>くだ<400>さい<400>。• 21052e5b6d6dSopenharmony_ci•</data> 21062e5b6d6dSopenharmony_ci 21072e5b6d6dSopenharmony_ci 21082e5b6d6dSopenharmony_ci# 21092e5b6d6dSopenharmony_ci# Bug 20303 Multiple Look-ahead rules with similar contexts. 21102e5b6d6dSopenharmony_ci# Check that samples of such rules are being handled correctly. 21112e5b6d6dSopenharmony_ci# 21122e5b6d6dSopenharmony_ci 21132e5b6d6dSopenharmony_ci<rules> 21142e5b6d6dSopenharmony_ci!!forward; 21152e5b6d6dSopenharmony_ci!!quoted_literals_only; 21162e5b6d6dSopenharmony_ci!!chain; 21172e5b6d6dSopenharmony_ci[a] [b] / [c] [d]; 21182e5b6d6dSopenharmony_ci[a] [b] / [c] [d] {100}; 21192e5b6d6dSopenharmony_ci[a] [b] / [e] [f] {200}; 21202e5b6d6dSopenharmony_ci[a] [b] / [e] [g] {300}; 21212e5b6d6dSopenharmony_ci[a] [b] [c] [h] {400}; 21222e5b6d6dSopenharmony_ci[x] [a] [b] / [c] [d] {500}; 21232e5b6d6dSopenharmony_ci[y] [a] [b] [c] [d] {600}; 21242e5b6d6dSopenharmony_ci</rules> 21252e5b6d6dSopenharmony_ci<data>•ab<100>c•d•ab<200>e•f•ab<300>e•g•abch<400>xab<500>c•d•yabcd<600></data> 2126