FACTOID # 30: If Alaska were its own country, it would be the 26th largest in total area, slightly larger than Iran.
 
 Home   Encyclopedia   Statistics   States A-Z   Flags   Maps   FAQ   About 
   
 
WHAT'S NEW
 

SEARCH ALL

FACTS & STATISTICS    Advanced view

Search encyclopedia, statistics and forums:

 

 

(* = Graphable)

 

 


Encyclopedia > Unicode fonts
Unicode
Encodings
UCS
Mapping
Bi-directional text
BOM
Han unification
Unicode and HTML
Unicode and e-mail
Unicode fonts

Unicode fonts (also known as UCS fonts, Unicode Typefaces and Typefaces) contains wide range of characters, letters, digits, glyphs, symbols, etc, which are collectively mapped into Universal Character Set, also known as, UCS (which is an international standard ISO/IEC 10646), derived from many different languages from all around the world. Thus, only one font is able to display vast range of characters, from different languages. Due to technical limitations, some web browsers may not display some special characters in this article. ... This page compares Unicode encodings. ... UTF-7 (7-bit Unicode Transformation Format) is a variable-length character encoding that was proposed for representing Unicode-encoded text using a stream of ASCII characters, for example for use in Internet e-mail messages. ... UTF-8 (8-bit Unicode Transformation Format) is a variable-length character encoding for Unicode created by Ken Thompson and Rob Pike. ... CESU-8 is a variant of UTF-8 that is described in Unicode Technical Report 26. ... In computing, UCS-2 and UTF-16 are the names of two nearly identical 16-bit Unicode Transformation Formats: character encoding forms that provide a way to represent a series of abstract characters from Unicode and ISO/IEC 10646 as a series of 16-bit words suitable for storage or... UTF-32 and UCS-4 are alternate names for a method of encoding Unicode characters, using the fixed amount of exactly 32 bits for each Unicode code point. ... UTF-EBCDIC is an encoding of Unicode that is meant to be EBCDIC friendly so that some older EBCDIC applications can handle some Unicode data. ... The Standard Compression Scheme for Unicode (SCSU) is a Unicode Technical Standard to reduce the number of bytes needed to represent text, especially if that text uses mostly characters from a small number of Unicode blocks. ... This article or section may be confusing for some readers, and should be edited to be clearer. ... GB18030 is the registered internet name for the official character set of the Peoples Republic of China. ... The international standard ISO/IEC 10646 defines the Universal Character Set (UCS) as a character encoding. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Some writing systems of the world, such as Arabic and Hebrew, are written in a form known as right-to-left (RTL), in which writing begins at the right-hand side of a page and concludes at the left-hand side. ... A Byte Order Mark (BOM) is the character at code point U+FEFF (zero-width no-break space), when that character is used to denote the endianness of a string of UCS/Unicode characters encoded in UTF-16 or UTF-32 and/or as a marker to indicate that text... Han unification is the process used by the authors of Unicode and the Universal Character Set to map multiple character sets of the CJK languages into a single set of unified characters. ... The relationship between Unicode and HTML tends to be a difficult topic for many computer professionals, document authors, and web users alike. ... Many e-mail clients are now able to use Unicode. ... This article is about letter, a written message from one party to another. ... In mathematics and computer science, a numerical digit is a symbol, e. ... A glyph is a carved figure or character, incised or in relief; a carved pictograph; hence, a pictograph representing a form originally adopted for sculpture, whether carved or painted. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... The international standard ISO/IEC 10646 defines the Universal Character Set (UCS) as a character encoding. ... The International Organization for Standardization (ISO) is an international standard-setting body composed of representatives from national standards bodies. ... The International Electrotechnical Commission (IEC) is an international standards organization dealing with electrical, electronic and related technologies. ... The Universal Character Set is a character encoding that is defined by the international standard ISO/IEC 10646. ...


Current Unicode fonts do not include all UCS characters defined in the present revision of ISO 10646 standard. Their creators are updating them with more characters, which were not included previously, or, were not placed correctly, or, including their own characters, or, updating them with new characters defined in the newer standard revision of ISO 10646. Due to technical limitations, some web browsers may not display some special characters in this article. ...


The UCS has over 1.1 million code points, but only the first 65,536 (the Plane 0: Basic Multilingual Plane, or BMP) had entered into common use before 2000. See the Mapping of Unicode characters article for more information on other planes (Plane 1: SMP, Plane 2: SIP, Plane 14: SSP, Plane 15 and 16: reserved for PUA) and therein included scripts for different languages, dialects. Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ...


There are typgraphical ambiguities in Unicode, so that some of the unified Chinese characters will be typographically different in different regions. For example, Unicode point U+9AA8 (骨) is typographically different between simplified Chinese and traditional Chinese. This has implications for the idea that a single typeface can satisfy the needs of all locales[1].


Utility software such as the Character Map applet included with Windows 2000/XP, MainType (by HighLogic. Commercial. 40-day trial version is available), BabelMap (by Andrew West. free. donation-ware.), Unicode Font Viewer (by Mike Lischke. freeware), Quick Key (by Nathanael Jones. Opensource, free.), etc, can be used to see exactly which characters are included, inside a typeface. Quick Key is primarily a flexible keyboard extension software, along with the features similar to Microsoft Windows Character Map and Alt codes character input method. ...


Unicode fonts may refer to: Arial Unicode MS, Bitstream Cyberbit, Cardo, Caslon, Code2000, Charis SIL, ClearlyU, DejaVu fonts, Doulos SIL, Everson Mono Unicode, Gentium, GNU Unifont, Junicode, Lucida Sans Unicode, New Gulim, TITUS Cyberbit Basic, Y.OzFontN. In digital typography, Arial Unicode MS is an extended version of the OpenType font Arial. ... Bitstream Cyberbit is a commercial Unicode font designed by Bitstream. ... For the crustacean genus Cardus, see Polychelidae. ... William Caslons 1734 Specimen Origins Caslon refers to a large family of typefaces originating from William Caslons original old style, first shown in 1734 in a now-highly-sought broadside specimen. ... Code2000 is a digital font which includes characters and symbols from a very large range of writing systems. ... The DejaVu fonts are modifications of the Bitstream Vera fonts. ... Doulos SIL is a serif typeface developed by SIL International. ... Gentium is a proprietary Unicode font designed by Victor Gaultney and copyright SIL International. ... Junicode (short for Junius-Unicode) is a free Unicode font for mediaevalists. ... In digital typography, 's Lucida Sans Unicode OpenType font is designed to support the most commonly used characters defined in version 2. ... Titus Cyberbit Basic is a Unicode font designed by Bitstream and the TITUS (Thesaurus Indogermanischer Text- und Sprachmaterialien) for Unicode 4. ...

Contents


0000–077F

Unicode fonts
Font ──>

Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Characters 38,917 - 2,195 - 51,155 1,956 - 1,686 1,956 4,893 1,469 - 1,433 - - - 16,777
Glyphs 50,377 29,934 2,882 3,551 63,888 3,084 9,538 1,701 3,083 4,899 1,699 - 1,435 1,776 49,284 9,779 59,678
Version 1.01 2.0b 0.098 2004 12/16/01 1.15 4.002 1.9 2.3 4.014 4.1.3[2] 1.0.1 2003 - 0.6.3 2.00 3.10 3.0, 2000 9.13
Basic Latin (0000–007F) 95 Check mark 95 Check mark 95 95 Check mark 96 95 95 95 - Check mark Check mark Check mark Check mark 95
Latin-1 Supplement (0080–00FF) 96 Check mark 96 Check mark 96 96 Check mark 96 96 96 96 - Check mark Check mark Check mark Check mark 96
Latin Extended-A (0100–017F) 128 Check mark 128 Check mark 128 128 X 128 128 128 128 - Check mark Check mark Check mark Check mark 128
Latin Extended-B (0180–024F) 148 Check mark 52 Check mark 194 194 Check mark 126 194 183 178 - Check mark Check mark X Check mark 28
IPA Extensions (0250–02AF) 89 Check mark 96 Check mark 96 96 Check mark 96 96 96 94 - Check mark Check mark X Check mark 55
Spacing Modifier Letters (02B0–02FF) 57 Check mark 80 Check mark 73 80 Check mark 41 80 80 56 - Check mark Check mark Check mark Check mark 16
Combining Diacritical Marks (0300–036F) 72 Check mark 112 Check mark 97 104 Check mark 66 104 107 82 - Check mark Check mark X Check mark 32
Greek (0370–03FF) 105 Check mark 124 Check mark 120 14 Check mark 110 14 118 82 - X Check mark Check mark Check mark 76
Cyrillic (0400–04FF) 226 Check mark 2 Check mark 246 209 Check mark 193 209 246 80 - X Check mark Check mark Check mark 66
Cyrillic Supplement (0500–052F) 0 X X X 16 16 X X 16 16 1 - X X X Check mark X
Armenian (0530–058F) 85 X X Check mark 86 X Check mark 86 X 86 X - X X X Check mark X
Hebrew (0590–05FF) 82 Check mark 86 Check mark 82 X Check mark X X 82 X - X Check mark X Check mark X
Arabic (0600–06FF) 194 Check mark 10 X 185 X X X X 3 X - X X X Check mark X
Syriac (0700–074F) 0 X X X 50 X X X X X X - X X X Check mark X
Arabic Supplement (0750–077F) 0 X X X X X X X X X X - X X X X X

The Latin alphabet, also called the Roman alphabet, is the most widely used alphabetic writing system in the world today. ... ISO 8859-1, more formally cited as ISO/IEC 8859-1 or less formally as Latin-1, is part 1 of ISO/IEC 8859, a standard character encoding defined by ISO. It encodes what it refers to as Latin alphabet no. ... The International Phonetic Alphabet (IPA) is a system of phonetic notation devised by linguists to accurately and uniquely represent each of the wide variety of sounds (phones or phonemes) used in spoken human language. ... A diacritic mark or accent mark is an additional mark added to a basic letter. ... The Cyrillic alphabet (or azbuka, from the old name of the first two letters) is an alphabet used to write six natural Slavic languages (Belarusian, Bulgarian, Macedonian, Russian, Serbian, and Ukrainian) and many other languages of the former Soviet Union, Asia and Eastern Europe. ... This article is mainly about Hebrew letters. ... The Arabic alphabet is the script used for writing in the Arabic language. ... 11th century book in Syriac Serto. ...

0780–139F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Thaana (0780–07BF) 0 X X X 50 X Check mark X X X X - X X X Check mark X
Devanagari (0900–097F) 104 X X X 106 X Check mark X X X X - X X X Check mark X
Bengali (0980–09FF) 89 X X X 91 X X X X X X - X X X X X
Gurmukhi (0A00–0A7F) 75 X X X 77 X X X X X X - X X X X X
Gujarati (0A80–0AFF) 78 X X X 78 X X X X X X - X X X X X
Oriya (0B00–0B7F) 79 X X X 79 X X X X X X - X X X X X
Tamil (0B80–0BFF) 61 X X X 71 X X X X X X - X X X X X
Telugu (0C00–0C7F) 80 X X X 80 X X X X X X - X X X X X
Kannada (0C80–0CFF) 80 X X X 80 X X X X X X - X X X X X
Malayalam (0D00–0D7F) 78 X X X 78 X X X X X X - X X X X X
Sinhala (0D80–0DFF) 0 X X X X X X X X X X - X X X X X
Thai (0E00–0E7F) 87 Check mark X Check mark 87 X Check mark X X X X - X X X Check mark X
Lao (0E80–0EFF) 65 X X X 65 X Check mark X X X X - X X X X X
Tibetan (0F00–0FFF) 168 X X X X X X X X X X - X X X X X
Myanmar (Burma) (1000–109F) 0 X X X 78 X X X X X X - X X X X X
Georgian (10A0–10FF) 78 X 1 X 81 X Check mark X X 80 X - X X X Check mark X
Hangul Jamo (1100–11FF) 240 X X X 240 X X X X X X - X X Check mark X X
Ethiopic(Ge'ez) (1200–137F) 0 X X X 356 X Check mark X X X X - X X X Check mark X
Ethiopic Supplement (1380–139F) 0 X X X 26 X X X X X X - X X X X X

Thaana is the writing system for the Dhivehi language spoken in the Maldives. ... Rigveda manuscript in Devanagari (early 19th century) Devanāgarī (देवनागरी — in English pronounced ) (ISCII – IS13194:1991) [1] is an abugida alphabet used to write several Indian languages, including Sanskrit, Hindi, Marathi, Kashmiri, Sindhi, Bihari, Bhili, Konkani, Bhojpuri and Nepali from Nepal. ... The Bengali script is an Abugida system of writing belonging to the Brahmic family of scripts whose use is associated with the Bangla, Assamese, Manipuri and Sylheti languages. ... The Gurmukhī (ਗੁਰਮੁਖੀ) script, derived from the Later Sharada script and standardised by Guru Angad Dev in the 16th century, was designed to write the Punjabi language. ... Excerpt from My experiments with truth - the autobiography of Mahatma Gandhi in its original Gujarati script. ... The Oriya script is used to write the Oriya language. ... The Tamil script is an abugida which has 12 vowels and 18 consonants. ... Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language. ... The Kannada script is an abugida of the Brahmic family, primarily to write the Kannada language, one of the Dravidian languages in India. ... The Malayalam script is an abugida of the Brahmic family, used to write the Malayalam language. ... The Sinhala script is used to write the Sinhala language. ... Om Mani Padme Hum, the primary mantra of Tibetan Buddhism written in the Tibetan script, on a rock outside the Potala Palace in Lhasa, Tibet. ... Hangul also refers to a word processing application widely used in Korea. ... The Geez language (or Giiz language) is an ancient language that developed in the Ethiopian Highlands of the Horn of Africa as the language of the peasantry. ...

13A0–1DBF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Cherokee (13A0–13FF) 0 X X X 85 X Check mark X X 85 X - X X X X X
Unified Canadian Aboriginal Syllabics (1400–167F) 0 X X X 630 X Check mark X X 630 X - X X X X X
Ogham (1680–169F) 0 X X Check mark 29 X Check mark X X 29 X - X X X Check mark X
Runic (16A0–16FF) 0 X 81 Check mark 81 X Check mark X X 81 X - Check mark X X Check mark X
Tagalog (Baybayin) (1700–171F) 0 X X X X X X X X X X - X X X X X
Hanunoo (1720–173F) 0 X X X 2 X X X X X X - X X X X X
Buhid (1740–175F) 0 X X X 20 X X X X X X - X X X X X
Tagbanwa (1760–177F) 0 X X X X X X X X X X - X X X X X
Khmer (1780–17FF) 0 X X X 103 X X X X X X - X X X X X
Mongolian (1800–18AF) 0 X X X 155 X X X X X X - X X X X X
Limbu (1900–194F) 0 X X X 66 X X X X X X - X X X X X
Tai Le (1950–197F) 0 X X X X X X X X X X - X X X X X
Tai Lue (1980–19DF) 0 X X X X X X X X X X - X X X X X
Khmer Symbols (19E0–19FF) 0 X X X X X X X X X X - X X X X X
Buginese (1A00–1A1F) 0 X X X 30 X X X X X X - X X X X X
Phonetic Extensions (1D00–1D7F) 0 X 17 X 109 128 X 36 128 107 X - Check mark X X Check mark X
Phonetic Extensions Supplement (1D80–1DBF) 0 X X X X 64 X 37 64 X X - X X X X X

Cherokee (Cherokee: Tsalagi) is an Iroquoian language spoken by the Cherokee people. ... Canadian aboriginal syllabic writing (often syllabics for short) is a family of writing schemes which are used to write a number of aboriginal Canadian languages from the Algonquian, Athabaskan and Inuit language families. ... Ogham (Old Irish Ogam) was an alphabet used primarily to represent Gaelic languages. ... Technical note: Due to technical limitations, some web browsers may not display some special characters in this article. ... Baybayin (sometimes called Alibata) is a pre-Hispanic Tagalog writing system that originated from the Javanese script Kavi. ... One of the indigenous scripts of the Philippines; see Baybayin. ... Buhid (ᝊᝓᝑᝒ), or Mangyan, is an indigenous Brahmic script of the Philippines, and is used today by the Mindoro people to write Tagalog, the national language of the Philippines. ... Tagbawna is one of the indigenous writing systems of the Philippines. ... The quick brown fox jumps over the lazy dog translated into Khmer. ... The Limbu (meaning: archer) are an ethnic group that belong to the Kiranti group or Kirat confederation that includes the Rai and Sunuwar. ... Tai Le is the name of a language and the script used to write that language in parts of the Yunnan Province of China, Myanmar (Burma), Laos, and possibly other countries. ... Tai Lue (or Tai Lü, Tai Le; tai51 lɯ11; Xishuangbanna Dai; Chinese: 傣仂语 Dǎilèyǔ) is one of the languages spoken by the Dai people in China. ... This article or section uses Khmer characters which may be rendered as boxes or other nonsensical symbols. ... Buginese (locally Basa Ugi, elsewhere also Bahasa Bugis, Bugis, Bugi, De) is the language spoken by about four million people mainly in the southern part of Sulawesi, Indonesia. ...

1DC0–257F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Combining Diacritical Marks Supplement (1DC0–1DFF) 0 X 2 X X 1 X X 1 X X - X X X X X
Latin Extended Additional (1E00–1EFF) 246 Check mark 88 Check mark 246 246 X 51 246 246 246 - X X X Check mark 8
Greek Extended (1F00–1FFF) 233 X 233 Check mark 233 X X 233 X 233 233 - X X X Check mark 4
General Punctuation (2000–206F) 63 Check mark 65 Check mark 105 73 X 32 73 97 39 - Check mark Check mark Check mark Check mark 62
Superscripts and Subscripts (2070–209F) 28 Check mark 9 Check mark 29 34 X 8 34 29 28 - Check mark Check mark X Check mark 29
Currency Symbols (20A0–20CF) 13 Check mark 6 Check mark 22 22 X 3 22 18 14 - X Check mark X Check mark 18
Combining Diacritical Marks for Symbols (20D0–20FF) 18 Check mark X Check mark 27 X X X X 27 X - X X X X 27
Letterlike Symbols (2100–214F) 57 Check mark 13 Check mark 75 2 Check mark 8 2 74 1 - X Check mark Check mark Check mark 75
Number Forms (2150–218F) 48 Check mark 4 Check mark 49 49 Check mark 34 49 49 X - Check mark X Check mark Check mark 49
Arrows (2190–21FF) 91 Check mark 14 Check mark 112 19 Check mark X 19 112 X - X Check mark Check mark Check mark 112
Mathematical Operators (2200–22FF) 242 Check mark 24 Check mark 246 17 X 34 17 256 2 - Check mark Check mark Check mark Check mark 256
Miscellaneous Technical (2300–23FF) 123 Check mark 36 Check mark 209 2 X 13 2 207 X - X X X X 209
Control Pictures (2400–243F) 37 Check mark X Check mark 39 X Check mark 2 X 39 X - X Check mark X X 4
Optical Character Recognition (2440–245F) 11 Check mark X X 11 X X X X 11 X - X X X X 11
Enclosed Alphanumerics (2460–24FF) 139 Check mark X Check mark 160 X X X X 159 X - Check mark X Check mark Check mark 160
Box Drawing (2500–257F) 128 Check mark 1 Check mark 128 X X X X 128 X - X Check mark Check mark Check mark 128

The term punctuation has two different linguistic meanings: in general, the act and the effect of punctuating, i. ... This article needs to be cleaned up to conform to a higher standard of quality. ... Graphic symbols are often used as a shorthand for currency names. ... This article is about operators in mathematics, for other kinds of operators see operator (disambiguation). ... Optical character recognition, usually abbreviated to OCR, involves computer software designed to translate images of typewritten text (usually captured by a scanner) into machine-editable text, or to translate pictures of characters into a standard encoding scheme representing them in (ASCII or Unicode). ... Box drawing characters are widely used in text user interfaces to draw various frames and boxes. ...

2580–2DDF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Block Elements (2580–259F) 22 Check mark X Check mark 32 X X 32 X 32 X - X Check mark X Check mark 32
Geometric Shapes (25A0–25FF) 80 Check mark 8 Check mark 96 2 Check mark 96 2 96 X - X Check mark Check mark Check mark 96
Miscellaneous Symbols (2600–26FF) 106 Check mark 31 Check mark 146 X X 7 X 125 X - X X Check mark Check mark 146
Dingbats (2700–27BF) 160 X 6 X 174 2 X X 2 160 X - Check mark X X X 174
Miscellaneous Mathematical Symbols-A (27C0–27EF) 0 X 9 X 35 2 X 5 2 X X - X X X X 28
Supplemental Arrows-A (27F0–27FF) 0 X 2 X 16 X X X X X X - X X X X 16
Braille Patterns (2800–28FF) 0 X X X 256 X Check mark X X X X - X X X X 256
Supplemental Arrows-B (2900–297F) 0 X 6 X 128 X X X X 111 X - X X X X 128
Miscellaneous Mathematical Symbols-B (2980–29FF) 0 X 2 X 128 X X 1 X 62 X - X X X X 128
Supplemental Mathematical Operators (2A00–2AFF) 0 X X X 256 X X 3 X 21 X - X X X X 256
Miscellaneous Symbols and Arrows (2B00–2BFF) 0 X X X 20 X X 2 X X X - X X X X 14
Glagolitic (2C00–2C5F) 0 X X X X X X X X X X - X X X X X
Coptic (2C80–2CFF) 0 X X X X X X X X X X - X X X X X
Georgian Supplement (2D00–2D2F) 0 X X X X X X X X X X - X X X X X
Tifinagh (2D30–2D7F) 0 X X X 55 X X X X X X - X X X X X
Ethiopic Extended (2D80–2DDF) 0 X X X 79 X X X X X X - X X X X X

The Miscellaneous Symbol plane of Unicode (2600–26FF) contains various glyphs representing things from a variety of categories: Astrological, Astronomical, Chess, Dice, Ideological symbols, Musical notation, Political symbols, Recycling, Religious symbols, Trigrams, Warning Signs and Weather. ... A dingbat is an ornament or spacer used in typesetting, sometimes more formally known as a printers ornament. The term supposedly originated as onomatopoeia in old style metal-type print shops, where extra space around text or illustrations would be filled by dinging an ornament into the space... PREMIER - first The braille system, named after Louis Braille, is a method that is widely used by blind people to read and write. ... Tablet inscribed with the Glagolitic alphabet The Glagolitic alphabet or Glagolitsa is the oldest known Slavonic alphabet. ... The Coptic alphabet is an alphabet used for writing the Coptic language. ... This article needs to be cleaned up to conform to a higher standard of quality. ...

2E00–4DBF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Supplemental Punctuation (2E00–2E7F) 0 X 24 X 26 X X X X X X - X X X X X
CJK Radicals Supplement (2E80–2EFF) 0 X X X 115 X X X X X X - X X X X 128
Kangxi Radicals(Kangxi) (2F00–2FDF) 0 X X X 214 X X X X X X - X X X X 214
Ideographic Description Characters (2FF0–2FFF) 0 X X X 12 X X X X X X - X X X X 12
CJK Symbols and Punctuation (3000–303F) 57 Check mark 12 X 64 X X X X X X - X X Check mark Check mark 45
Hiragana (3040–309F) 90 Check mark X Check mark 90 X X X X 90 X - X X Check mark Check mark 93
Katakana (30A0–30FF) 94 Check mark X Check mark 94 X X X X 94 X - X X Check mark Check mark 96
Bopomofo (3100–312F) 40 Check mark X X 40 X X X X X X - X X X Check mark X
Hangul Compatibility Jamo (3130–318F) 94 Check mark X X 94 X X X X X X - X X Check mark X 1
Kanbun (3190–319F) 16 Check mark X X 16 X X X X X X - X X X X 16
Bopomofo Extended (31A0–31BF) 0 X X X 24 X X X X X X - X X X X X
CJK Strokes (31C0–31EF) 0 X X X 16 X X X X X X - X X X X X
Katakana Phonetic Extensions (31F0–31FF) 0 X X X X X X X X X X - X X X X 16
Enclosed CJK Letters and Months (3200–32FF) 202 Check mark X X 232 X X X X X X - X X Check mark X 174
CJK Compatibility (3300–33FF) 249 Check mark X X 161 X X X X X X - X X Check mark X 85
CJK Unified Ideographs Extension A (3400–4DBF) 20,902 X X X 6,582 X X X X X X - X X Check mark X 176

The left part of mā, a Chinese character meaning mother, is a radical that means woman A radical (from Latin radix, meaning root) is a basic identifiable component of every Chinese character. ... The left part of mā, a Chinese character meaning mother, is a radical that means woman A radical (from Latin radix, meaning root) is a basic identifiable component of every Chinese character. ... This article needs cleanup, so as to conform to a higher standard. ... Japanese writing Kanji 漢字 Kana 仮名 Hiragana 平仮名 Katakana 片仮名 Uses Furigana 振り仮名 Okurigana 送り仮名 Rōmaji ローマ字 Hiragana ) are a Japanese syllabary, one of the four Japanese writing systems, along with katakana, kanji and rōmaji (i. ... Japanese writing Kanji 漢字 Kana 仮名 Hiragana 平仮名 Katakana 片仮名 Uses Furigana 振り仮名 Okurigana 送り仮名 Rōmaji ローマ字 Katakana (片仮名) are a Japanese syllabary, one of the four Japanese writing systems. ... Zh yīn F o (注音符號), or Symbols for Annotating Sounds, often abbreviated as Zhuyin, or known as Bopomofo (ㄅㄆㄇㄈ) for the first four syllables of these Chinese phonetic symbols, is the national phonetic system of the Republic of China (based on Taiwan) for teaching the Chinese languages, especially Standard Mandarin... Example of Kaeriten Kanbun (漢文, literally Han writing) is Chinese written for a Japanese audience. ... Zh yīn F o (注音符號), or Symbols for Annotating Sounds, often abbreviated as Zhuyin, or known as Bopomofo (ㄅㄆㄇㄈ) for the first four syllables of these Chinese phonetic symbols, is the national phonetic system of the Republic of China (based on Taiwan) for teaching the Chinese languages, especially Standard Mandarin... Stroke order refers to the way of writing Chinese characters. ...

4DC0–FE2F

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Yijing Hexagram Symbols (4DC0–4DFF) 64 X X X 64 X X X X X X - X X X X X
CJK Unified Ideographs (Han Unification) (4E00–9FFF) 20,924 Check mark X X 20,924 X X X X X X - X X Check mark X 9,829
Yi Syllables (A000–A48F) 0 X X X 1,165 X X X X X X - X X X X X
Yi Radicals (A490–A4CF) 0 X X X 55 X X X X X X - X X X X X
Modifier Tone Letters (A700–A71F) 0 X X X X 23 X X 23 X X - X X X X X
Syloti Nagri (A800–A82F) 0 X X X X X X X X X X - X X X X X
Hangul Syllables (AC00–D7AF) 11,172 Check mark X X 11,172 X X X X X X - X X Check mark X X
High Surrogates (D800–DB7F) 0 X X X X X X X X X X - X X X X X
High Private Use Surrogates (DB80–DBFF) 0 X X X X X X X X X X - X X X X X
Low Surrogates (DC00–DFFF) 0 X X X X X X X X X X - X X X X X
Private Use Area (E000–F8FF) 0 Check mark X X X X X X X X X - X X X Check mark X
CJK Compatibility Ideographs (F900–FAFF) 302 Check mark X X 421 X X X X X X - X X Check mark X 98
Alphabetic Presentation Forms (FB00–FB4F) 57 Check mark 53 Check mark 58 5 X 10 5 58 5 - X X X Check mark 2
Arabic Presentation Forms-A (FB50–FDFF) 593 Check mark X X 155 X X X X X X - X X X Check mark 25
Variation Selectors (FE00–FE0F) 0 X X X 16 16 X X 16 1 X - X X X X X
Vertical Forms (FE10–FE1F) 0 X X X 0 0 X X 0 0 X - X X X X X
Combining Half Marks (FE20–FE2F) 4 X X X 4 X X X X 4 X - X X X X X

Alternative meaning: I Ching (monk) The I Ching (Simplified Chinese: 易经; Traditional Chinese: 易經, Hanyu Pinyin: Yì Jīng; Cantonese IPA: jɪk6gɪŋ1; Cantonese Jyutping: jik6ging1; alternative romanizations include I Jing, Yi Ching, Yi King) is the oldest of the Chinese classic texts. ... Han unification is the process used by the authors of Unicode and the Universal Character Set to map multiple character sets of the CJK languages into a single set of unified characters. ... The Yi people (own name in the Cool Mountain dialect: ꆈꌠ, official transcription: Nuosu, IPA: [nɔ̄sÅ«]; Chinese: 彝族, Pinyin: Yìzú; the older name Lolo is now considered derogatory in China, though used officially in Vietnam as Lô Lô) are a modern ethnic group in China and Vietnam. ... Hangul also refers to a word processing application widely used in Korea. ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... A Chinese character. ...

FE30–FFFF

              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
CJK Compatibility Forms (FE30–FE4F) 28 X X X 28 X X X X X X - X X X Check mark 32
Small Form Variants (FE50–FE6F) 26 Check mark X X 26 X X X X X X - X X X Check mark X
Arabic Presentation Forms-B (FE70–FEFF) 139 Check mark X X 140 1 X X 1 X X - X X X Check mark X
Halfwidth and Fullwidth Forms (FF00–FFEF) 223 Check mark X Check mark 124 X X X X X X - X X Check mark Check mark 172
Specials (FFF0–FFFF) 2 X 1 X 3 5 X 1 5 5 X - X X X X X
           
└─ BMP

Specials album cover The Specials were a British band formed in 1977 in Coventry (see 1977 in music). ... Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ...

10000–1D7FF

┌─ SMP 
           
              
Range ─┘
Font  ───>
Range ─┐
              
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Linear B Syllabary (10000–1007F) X X 0 X X X X X X X X - X X X X X
Linear B Ideograms (10080–100FF) X X 2 X X X X X X X X - X X X X X
Aegean Numbers (10100–1013F) X X 2 X X X X X X X X - X X X X X
Ancient Greek Numbers (10140–1018F) X X 75 X X X X X X X X - X X X X X
Old Italic (10300–1032F) X X 35 X X X X X X X X - X X X X X
Gothic (10330–1034F) X X 27 X X X X X X X X - X X X X X
Ugaritic (10380–1039F) X X 0 X X X X X X X X - X X X X X
Old Persian (103A0–103DF) X X 0 X X X X X X X X - X X X X X
Deseret (10400–1044F) X X 0 X X X X X X X X - X X X X X
Shavian (10450–1047F) X X 0 X X X X X X X X - X X X X X
Osmanya (10480–104AF) X X 0 X X X X X X X X - X X X X X
Cypriot Syllabary (10800–1083F) X X 0 X X X X X X X X - X X X X X
Kharoshthi (10A00–10A5F) X X 0 X X X X X X X X - X X X X X
Byzantine Musical Symbols (1D000–1D0FF) X X 0 X X X X X X X X - X X X X X
Musical Symbols (1D100–1D1FF) X X 0 X X X X X X X X - X X X X X
Byzantine Musical Symbols (1D000–1D0FF) X X 0 X X X X X X X X - X X X X X
Ancient Greek Musical Notation (1D200–1D24F) X X 70 X X X X X X X X - X X X X X
Tai Xuan Jing Symbols (1D300–1D35F) X X 0 X X X X X X X X - X X X X X
Mathematical Alphanumeric Symbols (1D400–1D7FF) X X 13 X X 2 X X 2 X X - X X X X X
CJK Unified Ideographs Extension B X X X X X X X X X X X - X X X X 339
              
Range ─┘

Font  ──>
Arial Unicode MS Bitstream Cyberbit Cardo Caslon Code2000 Charis SIL ClearlyU DejaVu Fonts Doulos SIL Everson Mono Unicode Gentium GNU Unifont Junicode Lucida Sans Unicode New Gulim TITUS Cyberbit Basic Y.OzFontN
Unicode fonts

Unicode reserves 1,114,112 (= 220 + 216) code points, and currently assigns characters to more than 96,000 of those code points. ... Linear B script sample Linear B is a script that was used for writing Mycenaean, an early form of Greek. ... Besides its original meaning, of or relating to the Goths (Gothos, Getas), a Germanic tribe and thus the Gothic language and the Gothic alphabet, the word Gothic has been used to refer to distinctly different things: From a Renaissance perspective (originally Italian, gotico, with connotations of rough, barbarous), it conveyed... The Ugaritic language is known to us only in the form of writings found in the lost city of Ugarit since its discovery by French archaeologists in 1928. ... See Aryan Language or Old Persian For more information visit: *[Ancient Iranian Languages & Literature The Circle of Ancient Iranian Studies (CAIS) ... To meet Wikipedias quality standards, this article or section may require cleanup. ... Posthumously funded by and named for Irish playwright George Bernard Shaw, the Shavian alphabet (also known as Shaw alphabet) was conceived as a way to provide simple, phonetic orthography for the English language to replace the difficulties of the conventional spelling. ... An invented script for the Somali language, now replaced both officially and in practise by the Latin alphabet. ... The Kharoṣṭhī script, also known as the Gāndhārī script, is an ancient alphabetic script used by the Gandhara culture of historic northwest India to write the Gandhari and Sanskrit languages (the Gandhara kingdom was located along the present-day border between Afghanistan and Pakistan between the Indus River and the... Mathematical alphanumeric symbols are modifications of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles (one example is blackboard bold, or double-struck (in Unicode terminology)). Unicode now includes many such symbols (in the range U+1D400 . ...

See also

The Unicode BMP Fallback Font is a Unicode font containing a glyph for every character in the Basic multilingual plane. ... This page compares Unicode encodings. ... This is a list of typefaces. ... Typographic units are the units of measurement used in typography or typesetting. ... A type foundry is a company that produces and/or distributes typefaces. ... An excerpt of HTML code with syntax highlighting In computing, HyperText Markup Language (HTML) is a markup language designed for the creation of web pages with hypertext and other information to be displayed in a web browser. ... Calligraphy in a Latin Bible of AD 1407 on display in Malmesbury Abbey, Wiltshire, England. ... Font-management programs are a class of computer utility software that allows the user to activate or deactive fonts on a computer. ... In computing, Unicode is the international standard whose goal is to provide the means to encode the text of every document people want to store in computers. ... The term Alt codes is used to refer to a number of Unicode input methods that allow characters to be entered by typing a characters code point in concert with the Alt key. ... An IME for inputting Japanese characters in Mac OS 9 An input method editor (IME) is a program or operating system component that allows computer users to enter complex characters and symbols (such as Chinese, Japanese, Korean and Tibetan characters), using a standard Western keyboard. ... Since the Chinese language uses a logographic script—that is a script where one or two character corresponds roughly to one word or meaning—there are vastly more characters, or glyphs, than there are keys on a standard computer keyboard. ... Japanese input methods are the methods used to input Japanese characters on a computer. ... This is intended as a non-exhaustive list of input methods for UNIX platforms. ... Computers and other typing devices offer many different keyboard layouts, for people to be able to input data in different languages. ... Code page is the traditional IBM term used for a specific character encoding table: a mapping in which a sequence of bits, usually a single octet representing integer values 0 through 255, is associated with a specific character. ...

References

  1. ^ Ken Lunde, CJKV Information Processing, O'Reilly Inc, 1999. Page 128, "CJKV character form differences"
  2. ^ Version info for font in Macromedia Fontographer v4.1.3 2003-02-13.

External links


 
 

COMMENTARY     


Share your thoughts, questions and commentary here
Your name
Your comments

Want to know more?
Search encyclopedia, statistics and forums:

 


Press Releases |  Feeds | Contact
The Wikipedia article included on this page is licensed under the GFDL.
Images may be subject to relevant owners' copyright.
All other elements are (c) copyright NationMaster.com 2003-5. All Rights Reserved.
Usage implies agreement with terms, 1022, m