Unicode

From Free net encyclopedia

Unicode
Encodings UTF-7 UTF-8 CESU-8 UTF-16/UCS-2 UTF-32/UCS-4 UTF-EBCDIC SCSU Punycode GB 18030
UCS
Mapping
Bi-directional text
BOM
Han unification
Unicode and HTML
Unicode and e-mail

Template:SpecialChars

Unicode is an industry standard designed to allow text and symbols from all languages to be consistently represented and manipulated by computers.

Unicode characters can be encoded using any of several schemes termed Unicode Transformation Formats (UTF).

The Unicode Consortium has as its ambitious goal the eventual replacement of existing character encoding schemes with Unicode, as many of the existing schemes are limited in size and scope, and are incompatible with multilingual environments. Its success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including XML, the Java programming language, and modern operating systems.

1 Origin and development
2 Scripts covered
- 2.1 Trivia
3 Mapping and encodings
4 Unicode in use
5 See also
6 References
7 External links

Row	Cells	Range(s)
00	20–7E	Basic Latin (00–7F)
00	A0–FF	Latin-1 Supplement (80–FF)
01	00–13, 14–15, 16–2B, 2C–2D, 2E–4D, 4E–4F, 50–7E, 7F	Latin Extended-A (00–7F)
01	8F, 92, B7, DE-EF, FA–FF	Latin Extended-B (80–FF …)
02	18–1B, 1E–1F	Latin Extended-B (… 00–4F)
	59, 7C, 92	IPA Extensions (50–AF)
	BB–BD, *C6, C7,* C9, D6, D8–DB, DC, DD,** DF, EE	Spacing Modifier Letters (B0–FF)
03	74–75, 7A, 7E, 84–8A, 8C, 8E–A1, A3–CE, D7, DA–E1	Greek (70–FF)
04	00, 01–0C, 0D, 0E–4F, 50, 51–5C, 5D, 5E–5F, 90–91, 92–C4, C7–C8, CB–CC, D0–EB, EE–F5, F8–F9	Cyrillic (00–FF)
1E	02–03, 0A–0B, 1E–1F, 40–41, 56–57, 60–61, 6A–6B, 80–85, 9B, F2–F3	Latin Extended Additional (00–FF)
1F	00–15, 18–1D, 20–45, 48–4D, 50–57, 59, 5B, 5D, 5F–7D, 80–B4, B6–C4, C6–D3, D6–DB, DD–EF, F2–F4, F6–FE	Greek Extended (00–FF)
20	*13–14, 15,* 17, 18–19, 1A–1B, 1C–1D, 1E, 20–22, 26, 30, 32–33, 39–3A, 3C, 3E**	General Punctuation (00–6F)
	44, 4A, 7F, 82	Superscripts and Subscripts (70–9F)
	A3–A4, A7, AC, AF	Currency Symbols (A0–CF)
21	*05, 13, 16, 22, 26,* 2E**	Letterlike Symbols (00–4F)
	5B–5E	Number Forms (50–8F)
	90–93, 94–95, A8	Arrows (90–FF)
22	00, 02, 03, 06, 08-09, 0F, 11–12, 15, 19–1A, 1E–1F, 27-28, 29, 2A, 2B, 48, 59, 60–61, 64–65, 82–83, 95, 97	Mathematical Operators (00–FF)
23	02, 0A, 20–21, 29–2A	Miscellaneous Technical (00–FF)
25	00, 02, 0C, 10, 14, 18, 1C, 24, 2C, 34, 3C, 50–6C	Box Drawing (00–7F)
	80, 84, 88, 8C, 90–93	Block Elements (80–9F)
	A0–A1, AA–AC, B2, BA, BC, C4, CA–CB, CF, D8–D9, E6	Geometric Shapes (A0–FF)
26	*3A–3C, 40, 42, 60, 63, 65–66, 6A,* 6B**	Miscellaneous Symbols (00–FF)
F0	(01–02)	Private Use Area (00–FF …)
FB	01–02	Alphabetic Presentation Forms (00–4F)
FF	FD	Specials

Unicode

From Free net encyclopedia

Contents

Origin and development

Scripts covered

Trivia

Mapping and encodings

Standard

Unicode revision history

Storage, transfer, and processing

Ready-made versus composite characters

Issues

Unicode in use

Operating systems

E-mail

Web

Fonts

Multilingual text-rendering engines

Input methods

See also

References

External links

Views

Personal tools

Search

Partner sites