Universal Character Set

standard set of coded characters defined by the ISO/IEC 10646 international standard

DBpedia resource is: http://dbpedia.org/resource/Universal_Coded_Character_Set

Abstract is: The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added. The UCS has over 1.1 million possible code points available for use/allocation, but only the first 65,536, which is the Basic Multilingual Plane (BMP), had entered into common use before 2000. This situation began changing when the People's Republic of China (PRC) ruled in 2006 that all software sold in its jurisdiction would have to support GB 18030. This required software intended for sale in the PRC to move beyond the BMP. The system deliberately leaves many code points not assigned to characters, even in the BMP. It does this to allow for future expansion or to minimise conflicts with other encoding forms. The original edition of the UCS defined UTF-16, an extension of UCS-2, to represent code points outside the BMP. A range of code points in the S (Special) Zone of the BMP remains unassigned to characters. UCS-2 disallows use of code values for these code points, but UTF-16 allows their use in pairs. Unicode also adopted UTF-16, but in Unicode terminology, the high-half zone elements become "high surrogates" and the low-half zone elements become "low surrogates". Another encoding, UTF-32 (previously named UCS-4), uses four bytes (total 32 bits) to encode a single character of the codespace. UTF-32 thereby permits a binary representation of every code point in the APIs, and software applications.

Universal Character Set is …
instance of (P31):
ISO standardQ15087423
IEC standardQ21705905

sublass of (P279):
coded character setQ29149990

ISO/IEC 2022
🡄
External links are
P11567Dictionary of Archives Terminology IDiso-iec-10646
P646Freebase ID/m/019m4l
P4197IBM graphic character set global ID03001
03004
P6366Microsoft Academic ID17593920

P1343described by sourceISO/IEC 10646:2014: Information technology—Universal Coded Character Set (UCS)Q26720086
P1382partially coincident withUnicodeQ8819

Reverse relations

based on (P144)
Q1484877GB 18030
Q11225834JIS X 0221
Q8819Unicode
Q125262501Unicode encoding

followed by (P156)
Q1197730ISO/IEC 2022
Q764925ISO/IEC 646
Q221738ISO/IEC 8859

part of (P361)
Q718147C0 and C1 control codes
Q796156CJK unified ideograph
Q109615047Unicode code point
Q10853148Unicode plane
Q109613493Unicode range
Q1416278Unicode transformation format
Q30921757character encoding scheme

Q14629298Han unificationfacet ofP1269
Q8819Unicodepartially coincident withP1382
Q424583Internationalized Resource IdentifierusesP2283
Q897819Universal Character Set charactersmain subjectP921

The articles in Wikimedia projects and languages

      https://cs.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      Universal Coded Character Setwikipedia
      Universal Coded Character Setwikipedia
      https://es.wikipedia.org/wiki/ISO/IEC 10646wikipedia
Persian (fa / Q9168)یوسی‌اس (کدبندی نویسه)wikipedia
      https://fr.wikipedia.org/wiki/ISO/CEI 10646wikipedia
      ISO 10646wikipedia
      UCSwikipedia
      Universal Character Setwikipedia
      https://ja.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      국제 문자 세트wikipedia
kyЮникод тамгалар системиwikipedia
nbhttps://no.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      ISO-10646wikipedia
Norwegian, Nynorsk (nn / Q25164)https://nn.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      ISO 10646wikipedia
      https://pt.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      Универсальный набор символовwikipedia
      UCSwikipedia
      https://sv.wikipedia.org/wiki/ISO/IEC 10646wikipedia
      Універсальний кодований набір символівwikipedia
Yoruba (yo / Q34311)ISO 10646wikipedia
      通用字符集wikipedia

Search more.