162306a36Sopenharmony_ci.. SPDX-License-Identifier: GPL-2.0
262306a36Sopenharmony_ci
362306a36Sopenharmony_ci.. include:: ../disclaimer-zh_TW.rst
462306a36Sopenharmony_ci
562306a36Sopenharmony_ci:Original: Documentation/admin-guide/unicode.rst
662306a36Sopenharmony_ci
762306a36Sopenharmony_ci:譯者:
862306a36Sopenharmony_ci
962306a36Sopenharmony_ci 吳想成 Wu XiangCheng <bobwxc@email.cn>
1062306a36Sopenharmony_ci 胡皓文 Hu Haowen <src.res.211@gmail.com>
1162306a36Sopenharmony_ci
1262306a36Sopenharmony_ciUnicode(統一碼)支持
1362306a36Sopenharmony_ci======================
1462306a36Sopenharmony_ci
1562306a36Sopenharmony_ci	(英文版)上次更新:2005-01-17,版本號 1.4
1662306a36Sopenharmony_ci
1762306a36Sopenharmony_ci此文檔由H. Peter Anvin <unicode@lanana.org>管理,是Linux註冊名稱與編號管理局
1862306a36Sopenharmony_ci(Linux Assigned Names And Numbers Authority,LANANA)項目的一部分。
1962306a36Sopenharmony_ci現行版本請見:
2062306a36Sopenharmony_ci
2162306a36Sopenharmony_ci	http://www.lanana.org/docs/unicode/admin-guide/unicode.rst
2262306a36Sopenharmony_ci
2362306a36Sopenharmony_ci簡介
2462306a36Sopenharmony_ci-----
2562306a36Sopenharmony_ci
2662306a36Sopenharmony_ciLinux內核代碼已被重寫以使用Unicode來將字符映射到字體。下載一個Unicode到字體
2762306a36Sopenharmony_ci(Unicode-to-font)表,八位字符集與UTF-8模式都將改用此字體來顯示。
2862306a36Sopenharmony_ci
2962306a36Sopenharmony_ci這微妙地改變了八位字符表的語義。現在的四個字符表是:
3062306a36Sopenharmony_ci
3162306a36Sopenharmony_ci=============== =============================== ================
3262306a36Sopenharmony_ci映射代號        映射名稱                        Escape代碼 (G0)
3362306a36Sopenharmony_ci=============== =============================== ================
3462306a36Sopenharmony_ciLAT1_MAP        Latin-1 (ISO 8859-1)            ESC ( B
3562306a36Sopenharmony_ciGRAF_MAP        DEC VT100 pseudographics        ESC ( 0
3662306a36Sopenharmony_ciIBMPC_MAP       IBM code page 437               ESC ( U
3762306a36Sopenharmony_ciUSER_MAP        User defined                    ESC ( K
3862306a36Sopenharmony_ci=============== =============================== ================
3962306a36Sopenharmony_ci
4062306a36Sopenharmony_ci特別是 ESC ( U 不再是「直通字體」,因爲字體可能與IBM字符集完全不同。
4162306a36Sopenharmony_ci例如,即使加載了一個Latin-1字體,也允許使用塊圖形(block graphics)。
4262306a36Sopenharmony_ci
4362306a36Sopenharmony_ci請注意,儘管這些代碼與ISO 2022類似,但這些代碼及其用途都與ISO 2022不匹配;
4462306a36Sopenharmony_ciLinux有兩個八位代碼(G0和G1),而ISO 2022有四個七位代碼(G0-G3)。
4562306a36Sopenharmony_ci
4662306a36Sopenharmony_ci根據Unicode標準/ISO 10646,U+F000到U+F8FF被保留用於作業系統範圍內的分配
4762306a36Sopenharmony_ci(Unicode標準將其稱爲「團體區域(Corporate Zone)」,因爲這對於Linux是不準確
4862306a36Sopenharmony_ci的,所以我們稱之爲「Linux區域」)。選擇U+F000作爲起點,因爲它允許直接映射
4962306a36Sopenharmony_ci區域以2的大倍數開始(以防需要1024或2048個字符的字體)。這就留下U+E000到
5062306a36Sopenharmony_ciU+EFFF作爲最終用戶區。
5162306a36Sopenharmony_ci
5262306a36Sopenharmony_ci[v1.2]:Unicodes範圍從U+F000到U+F7FF已經被硬編碼爲直接映射到加載的字體,
5362306a36Sopenharmony_ci繞過了翻譯表。用戶定義的映射現在默認爲U+F000到U+F0FF,模擬前述行爲。實際上,
5462306a36Sopenharmony_ci此範圍可能較短;例如,vgacon只能處理256字符(U+F000..U+F0FF)或512字符
5562306a36Sopenharmony_ci(U+F000..U+F1FF)字體。
5662306a36Sopenharmony_ci
5762306a36Sopenharmony_ciLinux 區域中定義的實際字符
5862306a36Sopenharmony_ci---------------------------
5962306a36Sopenharmony_ci
6062306a36Sopenharmony_ci此外,還定義了Unicode 1.1.4中不存在的以下字符;這些字符由DEC VT圖形映射使用。
6162306a36Sopenharmony_ci[v1.2]此用法已過時,不應再使用;請參見下文。
6262306a36Sopenharmony_ci
6362306a36Sopenharmony_ci====== ======================================
6462306a36Sopenharmony_ciU+F800 DEC VT GRAPHICS HORIZONTAL LINE SCAN 1
6562306a36Sopenharmony_ciU+F801 DEC VT GRAPHICS HORIZONTAL LINE SCAN 3
6662306a36Sopenharmony_ciU+F803 DEC VT GRAPHICS HORIZONTAL LINE SCAN 7
6762306a36Sopenharmony_ciU+F804 DEC VT GRAPHICS HORIZONTAL LINE SCAN 9
6862306a36Sopenharmony_ci====== ======================================
6962306a36Sopenharmony_ci
7062306a36Sopenharmony_ciDEC VT220使用6x10字符矩陣,這些字符在DEC VT圖形字符集中形成一個平滑的過渡。
7162306a36Sopenharmony_ci我省略了掃描5行,因爲它也被用作塊圖形字符,因此被編碼爲U+2500 FORMS LIGHT
7262306a36Sopenharmony_ciHORIZONTAL。
7362306a36Sopenharmony_ci
7462306a36Sopenharmony_ci[v1.3]:這些字符已正式添加到Unicode 3.2.0中;它們在U+23BA、U+23BB、U+23BC、
7562306a36Sopenharmony_ciU+23BD處添加。Linux現在使用新值。
7662306a36Sopenharmony_ci
7762306a36Sopenharmony_ci[v1.2]:添加了以下字符來表示常見的鍵盤符號,這些符號不太可能被添加到Unicode
7862306a36Sopenharmony_ci中,因爲它們非常討厭地取決於特定供應商。當然,這是糟糕設計的一個好例子。
7962306a36Sopenharmony_ci
8062306a36Sopenharmony_ci====== ======================================
8162306a36Sopenharmony_ciU+F810 KEYBOARD SYMBOL FLYING FLAG
8262306a36Sopenharmony_ciU+F811 KEYBOARD SYMBOL PULLDOWN MENU
8362306a36Sopenharmony_ciU+F812 KEYBOARD SYMBOL OPEN APPLE
8462306a36Sopenharmony_ciU+F813 KEYBOARD SYMBOL SOLID APPLE
8562306a36Sopenharmony_ci====== ======================================
8662306a36Sopenharmony_ci
8762306a36Sopenharmony_ci克林貢(Klingon)語支持
8862306a36Sopenharmony_ci------------------------
8962306a36Sopenharmony_ci
9062306a36Sopenharmony_ci1996年,Linux是世界上第一個添加對人工語言克林貢支持的作業系統,克林貢是由
9162306a36Sopenharmony_ciMarc Okrand爲《星際迷航》電視連續劇創造的。這種編碼後來被徵募Unicode註冊表
9262306a36Sopenharmony_ci(ConScript Unicode Registry,CSUR)採用,並建議(但最終被拒絕)納入Unicode
9362306a36Sopenharmony_ci平面一。不過,它仍然是Linux區域中的Linux/CSUR私有分配。
9462306a36Sopenharmony_ci
9562306a36Sopenharmony_ci這種編碼已經得到克林貢語言研究所(Klingon Language Institute)的認可。
9662306a36Sopenharmony_ci有關更多信息,請聯繫他們:
9762306a36Sopenharmony_ci
9862306a36Sopenharmony_ci	http://www.kli.org/
9962306a36Sopenharmony_ci
10062306a36Sopenharmony_ci由於Linux CZ開頭部分的字符大多是dingbats/symbols/forms類型,而且這是一種
10162306a36Sopenharmony_ci語言,因此根據標準Unicode慣例,我將它放置在16單元的邊界上。
10262306a36Sopenharmony_ci
10362306a36Sopenharmony_ci.. note::
10462306a36Sopenharmony_ci
10562306a36Sopenharmony_ci  這個範圍現在由徵募Unicode註冊表正式管理。規範性引用文件爲:
10662306a36Sopenharmony_ci
10762306a36Sopenharmony_ci	https://www.evertype.com/standards/csur/klingon.html
10862306a36Sopenharmony_ci
10962306a36Sopenharmony_ci克林貢語有一個26個字符的字母表,一個10位數的位置數字書寫系統,從左到右
11062306a36Sopenharmony_ci,從上到下書寫。
11162306a36Sopenharmony_ci
11262306a36Sopenharmony_ci克林貢字母的幾種字形已經被提出。但是由於這組符號看起來始終是一致的,只有實際
11362306a36Sopenharmony_ci的形狀不同,因此按照標準Unicode慣例,這些差異被認爲是字體變體。
11462306a36Sopenharmony_ci
11562306a36Sopenharmony_ci======	=======================================================
11662306a36Sopenharmony_ciU+F8D0	KLINGON LETTER A
11762306a36Sopenharmony_ciU+F8D1	KLINGON LETTER B
11862306a36Sopenharmony_ciU+F8D2	KLINGON LETTER CH
11962306a36Sopenharmony_ciU+F8D3	KLINGON LETTER D
12062306a36Sopenharmony_ciU+F8D4	KLINGON LETTER E
12162306a36Sopenharmony_ciU+F8D5	KLINGON LETTER GH
12262306a36Sopenharmony_ciU+F8D6	KLINGON LETTER H
12362306a36Sopenharmony_ciU+F8D7	KLINGON LETTER I
12462306a36Sopenharmony_ciU+F8D8	KLINGON LETTER J
12562306a36Sopenharmony_ciU+F8D9	KLINGON LETTER L
12662306a36Sopenharmony_ciU+F8DA	KLINGON LETTER M
12762306a36Sopenharmony_ciU+F8DB	KLINGON LETTER N
12862306a36Sopenharmony_ciU+F8DC	KLINGON LETTER NG
12962306a36Sopenharmony_ciU+F8DD	KLINGON LETTER O
13062306a36Sopenharmony_ciU+F8DE	KLINGON LETTER P
13162306a36Sopenharmony_ciU+F8DF	KLINGON LETTER Q
13262306a36Sopenharmony_ci	- Written <q> in standard Okrand Latin transliteration
13362306a36Sopenharmony_ciU+F8E0	KLINGON LETTER QH
13462306a36Sopenharmony_ci	- Written <Q> in standard Okrand Latin transliteration
13562306a36Sopenharmony_ciU+F8E1	KLINGON LETTER R
13662306a36Sopenharmony_ciU+F8E2	KLINGON LETTER S
13762306a36Sopenharmony_ciU+F8E3	KLINGON LETTER T
13862306a36Sopenharmony_ciU+F8E4	KLINGON LETTER TLH
13962306a36Sopenharmony_ciU+F8E5	KLINGON LETTER U
14062306a36Sopenharmony_ciU+F8E6	KLINGON LETTER V
14162306a36Sopenharmony_ciU+F8E7	KLINGON LETTER W
14262306a36Sopenharmony_ciU+F8E8	KLINGON LETTER Y
14362306a36Sopenharmony_ciU+F8E9	KLINGON LETTER GLOTTAL STOP
14462306a36Sopenharmony_ci
14562306a36Sopenharmony_ciU+F8F0	KLINGON DIGIT ZERO
14662306a36Sopenharmony_ciU+F8F1	KLINGON DIGIT ONE
14762306a36Sopenharmony_ciU+F8F2	KLINGON DIGIT TWO
14862306a36Sopenharmony_ciU+F8F3	KLINGON DIGIT THREE
14962306a36Sopenharmony_ciU+F8F4	KLINGON DIGIT FOUR
15062306a36Sopenharmony_ciU+F8F5	KLINGON DIGIT FIVE
15162306a36Sopenharmony_ciU+F8F6	KLINGON DIGIT SIX
15262306a36Sopenharmony_ciU+F8F7	KLINGON DIGIT SEVEN
15362306a36Sopenharmony_ciU+F8F8	KLINGON DIGIT EIGHT
15462306a36Sopenharmony_ciU+F8F9	KLINGON DIGIT NINE
15562306a36Sopenharmony_ci
15662306a36Sopenharmony_ciU+F8FD	KLINGON COMMA
15762306a36Sopenharmony_ciU+F8FE	KLINGON FULL STOP
15862306a36Sopenharmony_ciU+F8FF	KLINGON SYMBOL FOR EMPIRE
15962306a36Sopenharmony_ci======	=======================================================
16062306a36Sopenharmony_ci
16162306a36Sopenharmony_ci其他虛構和人工字母
16262306a36Sopenharmony_ci-------------------
16362306a36Sopenharmony_ci
16462306a36Sopenharmony_ci自從分配了克林貢Linux Unicode塊之後,John Cowan <jcowan@reutershealth.com>
16562306a36Sopenharmony_ci和 Michael Everson <everson@evertype.com> 建立了一個虛構和人工字母的註冊表。
16662306a36Sopenharmony_ci徵募Unicode註冊表請訪問:
16762306a36Sopenharmony_ci
16862306a36Sopenharmony_ci	https://www.evertype.com/standards/csur/
16962306a36Sopenharmony_ci
17062306a36Sopenharmony_ci所使用的範圍位於最終用戶區域的低端,因此無法進行規範化分配,但建議希望對虛構
17162306a36Sopenharmony_ci字母進行編碼的人員使用這些代碼,以實現互操作性。對於克林貢語,CSUR採用了Linux
17262306a36Sopenharmony_ci編碼。CSUR的人正在推動將Tengwar和Cirth添加到Unicode平面一;將克林貢添加到
17362306a36Sopenharmony_ciUnicode平面一被拒絕,因此上述編碼仍然是官方的。
17462306a36Sopenharmony_ci
175