Abstract is: The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added. The UCS has over 1.1 million possible code points available for use/allocation, but only the first 65,536, which is the Basic Multilingual Plane (BMP), had entered into common use before 2000. This situation began changing when the People's Republic of China (PRC) ruled in 2006 that all software sold in its jurisdiction would have to support GB 18030. This required software intended for sale in the PRC to move beyond the BMP. The system deliberately leaves many code points not assigned to characters, even in the BMP. It does this to allow for future expansion or to minimise conflicts with other encoding forms. The original edition of the UCS defined UTF-16, an extension of UCS-2, to represent code points outside the BMP. A range of code points in the S (Special) Zone of the BMP remains unassigned to characters. UCS-2 disallows use of code values for these code points, but UTF-16 allows their use in pairs. Unicode also adopted UTF-16, but in Unicode terminology, the high-half zone elements become "high surrogates" and the low-half zone elements become "low surrogates". Another encoding, UTF-32 (previously named UCS-4), uses four bytes (total 32 bits) to encode a single character of the codespace. UTF-32 thereby permits a binary representation of every code point in the APIs, and software applications.
ISO standard | Q15087423 |
IEC standard | Q21705905 |
coded character set | Q29149990 |
P1343 | described by source | ISO/IEC 10646:2014: Information technology—Universal Coded Character Set (UCS) | Q26720086 |
P1382 | partially coincident with | Unicode | Q8819 |
Q718147 | C0 and C1 control codes |
Q796156 | CJK unified ideograph |
Q109615047 | Unicode code point |
Q10853148 | Unicode plane |
Q109613493 | Unicode range |
Q1416278 | Unicode transformation format |
Q30921757 | character encoding scheme |
Q14629298 | Han unification | facet of | P1269 |
Q8819 | Unicode | partially coincident with | P1382 |
Q424583 | Internationalized Resource Identifier | uses | P2283 |
Q897819 | Universal Character Set characters | main subject | P921 |
https://cs.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia | |
Universal Coded Character Set | wikipedia | |
Universal Coded Character Set | wikipedia | |
https://es.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia | |
Persian (fa / Q9168) | یوسیاس (کدبندی نویسه) | wikipedia |
https://fr.wikipedia.org/wiki/ISO/CEI 10646 | wikipedia | |
ISO 10646 | wikipedia | |
UCS | wikipedia | |
Universal Character Set | wikipedia | |
https://ja.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia | |
국제 문자 세트 | wikipedia | |
ky | Юникод тамгалар системи | wikipedia |
nb | https://no.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia |
ISO-10646 | wikipedia | |
Norwegian, Nynorsk (nn / Q25164) | https://nn.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia |
ISO 10646 | wikipedia | |
https://pt.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia | |
Универсальный набор символов | wikipedia | |
UCS | wikipedia | |
https://sv.wikipedia.org/wiki/ISO/IEC 10646 | wikipedia | |
Універсальний кодований набір символів | wikipedia | |
Yoruba (yo / Q34311) | ISO 10646 | wikipedia |
通用字符集 | wikipedia |
Search more.