UTF-8 is capable of encoding all 1,112,064 valid Unicode code points using up to four code bytes.
- starting bytes are
11xx xxxx
- continuation bytes are
10xx xxxx
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | CAN | SUB | ESC | FS | GS | RS | US |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
8x | +0 | +1 | +2 | +3 | +4 | +5 | +6 | +7 | +8 | +9 | +A | +B | +C | +D | +E | +F |
9x | +10 | +11 | +12 | +13 | +14 | +15 | +16 | +17 | +18 | +19 | +1A | +1B | +1C | +1D | +1E | +1F |
Ax | +20 | +21 | +22 | +23 | +24 | +25 | +26 | +27 | +28 | +29 | +2A | +2B | +2C | +2D | +2E | +2F |
Bx | +30 | +31 | +32 | +33 | +34 | +35 | +36 | +37 | +38 | +39 | +3A | +3B | +3C | +3D | +3E | +3F |
Cx | [2] | [2] | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
Dx | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
Ex | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 |
Fx | 4 | 4 | 4 | 4 | 4 | [4] | [4] | [4] | [5] | [5] | [5] | [5] | [6] | [6] |
This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
¢ = c2 a2
0xC2 Controls and Latin-1 Supplement | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+008x | XXX | XXX | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
U+009x | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | XXX | SCI | CSI | ST | OSC | PM | APC |
U+00Ax | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY | ® | ¯ |
U+00Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
U+00Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
U+00Dx | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
U+00Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
U+00Fx | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
λ : ce bb
Greek and Coptic | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+037x | Ͱ | ͱ | Ͳ | ͳ | ʹ | ͵ | Ͷ | ͷ | ͺ | ͻ | ͼ | ͽ | ; | Ϳ | ||
U+038x | ΄ | ΅ | Ά | · | Έ | Ή | Ί | Ό | Ύ | Ώ | ||||||
U+039x | ΐ | Α | Β | Γ | Δ | Ε | Ζ | Η | Θ | Ι | Κ | Λ | Μ | Ν | Ξ | Ο |
U+03Ax | Π | Ρ | Σ | Τ | Υ | Φ | Χ | Ψ | Ω | Ϊ | Ϋ | ά | έ | ή | ί | |
U+03Bx | ΰ | α | β | γ | δ | ε | ζ | η | θ | ι | κ | λ | μ | ν | ξ | ο |
U+03Cx | π | ρ | ς | σ | τ | υ | φ | χ | ψ | ω | ϊ | ϋ | ό | ύ | ώ | Ϗ |
U+03Dx | ϐ | ϑ | ϒ | ϓ | ϔ | ϕ | ϖ | ϗ | Ϙ | ϙ | Ϛ | ϛ | Ϝ | ϝ | Ϟ | ϟ |
U+03Ex | Ϡ | ϡ | Ϣ | ϣ | Ϥ | ϥ | Ϧ | ϧ | Ϩ | ϩ | Ϫ | ϫ | Ϭ | ϭ | Ϯ | ϯ |
U+03Fx | ϰ | ϱ | ϲ | ϳ | ϴ | ϵ | ϶ | Ϸ | ϸ | Ϲ | Ϻ | ϻ | ϼ | Ͻ | Ͼ | Ͽ |
Accents
Arial | Arial Unicode MS | Segoe UI Symbol | Dec | Hex | Entity |
---|---|---|---|---|---|
̀ | ̀ | ̀ | 768 | 0300 | GRAVE ACCENT |
́ | ́ | ́ | 769 | 0301 | ACUTE ACCENT |
̂ | ̂ | ̂ | 770 | 0302 | CIRCUMFLEX ACCENT |
̃ | ̃ | ̃ | 771 | 0303 | TILDE |
̄ | ̄ | ̄ | 772 | 0304 | MACRON |
̅ | ̅ | ̅ | 773 | 0305 | OVERLINE |
̆ | ̆ | ̆ | 774 | 0306 | BREVE |
̇ | ̇ | ̇ | 775 | 0307 | DOT ABOVE |
̈ | ̈ | ̈ | 776 | 0308 | DIAERESIS |
̉ | ̉ | ̉ | 777 | 0309 | HOOK ABOVE |
̊ | ̊ | ̊ | 778 | 030A | RING ABOVE |
̋ | ̋ | ̋ | 779 | 030B | DOUBLE ACUTE ACCENT |
̌ | ̌ | ̌ | 780 | 030C | CARON |
̍ | ̍ | ̍ | 781 | 030D | VERTICAL LINE ABOVE |
̎ | ̎ | ̎ | 782 | 030E | DOUBLE VERTICAL LINE ABOVE |
̏ | ̏ | ̏ | 783 | 030F | DOUBLE GRAVE ACCENT |
̐ | ̐ | ̐ | 784 | 0310 | CANDRABINDU |
̑ | ̑ | ̑ | 785 | 0311 | INVERTED BREVE |
̒ | ̒ | ̒ | 786 | 0312 | TURNED COMMA ABOVE |
̓ | ̓ | ̓ | 787 | 0313 | COMMA ABOVE |
̔ | ̔ | ̔ | 788 | 0314 | REVERSED COMMA ABOVE |
̕ | ̕ | ̕ | 789 | 0315 | COMMA ABOVE RIGHT |
̖ | ̖ | ̖ | 790 | 0316 | GRAVE ACCENT BELOW |
̗ | ̗ | ̗ | 791 | 0317 | ACUTE ACCENT BELOW |
̘ | ̘ | ̘ | 792 | 0318 | LEFT TACK BELOW |
̙ | ̙ | ̙ | 793 | 0319 | RIGHT TACK BELOW |
̚ | ̚ | ̚ | 794 | 031A | LEFT ANGLE ABOVE |
̛ | ̛ | ̛ | 795 | 031B | HORN |
̜ | ̜ | ̜ | 796 | 031C | LEFT HALF RING BELOW |
̝ | ̝ | ̝ | 797 | 031D | UP TACK BELOW |
̞ | ̞ | ̞ | 798 | 031E | DOWN TACK BELOW |
̟ | ̟ | ̟ | 799 | 031F | PLUS SIGN BELOW |
̠ | ̠ | ̠ | 800 | 0320 | MINUS SIGN BELOW |
̡ | ̡ | ̡ | 801 | 0321 | PALATALIZED HOOK BELOW |
̢ | ̢ | ̢ | 802 | 0322 | RETROFLEX HOOK BELOW |
̣ | ̣ | ̣ | 803 | 0323 | DOT BELOW |
̤ | ̤ | ̤ | 804 | 0324 | DIAERESIS BELOW |
̥ | ̥ | ̥ | 805 | 0325 | RING BELOW |
̦ | ̦ | ̦ | 806 | 0326 | COMMA BELOW |
̧ | ̧ | ̧ | 807 | 0327 | CEDILLA |
̨ | ̨ | ̨ | 808 | 0328 | OGONEK |
̩ | ̩ | ̩ | 809 | 0329 | VERTICAL LINE BELOW |
̪ | ̪ | ̪ | 810 | 032A | BRIDGE BELOW |
̫ | ̫ | ̫ | 811 | 032B | INVERTED DOUBLE ARCH BELOW |
̬ | ̬ | ̬ | 812 | 032C | CARON BELOW |
̭ | ̭ | ̭ | 813 | 032D | CIRCUMFLEX ACCENT BELOW |
̮ | ̮ | ̮ | 814 | 032E | BREVE BELOW |
̯ | ̯ | ̯ | 815 | 032F | INVERTED BREVE BELOW |
̰ | ̰ | ̰ | 816 | 0330 | TILDE BELOW |
̱ | ̱ | ̱ | 817 | 0331 | MACRON BELOW |
̲ | ̲ | ̲ | 818 | 0332 | LOW LINE |
̳ | ̳ | ̳ | 819 | 0333 | DOUBLE LOW LINE |
̴ | ̴ | ̴ | 820 | 0334 | TILDE OVERLAY |
̵ | ̵ | ̵ | 821 | 0335 | SHORT STROKE OVERLAY |
̶ | ̶ | ̶ | 822 | 0336 | LONG STROKE OVERLAY |
̷ | ̷ | ̷ | 823 | 0337 | SHORT SOLIDUS OVERLAY |
̸ | ̸ | ̸ | 824 | 0338 | LONG SOLIDUS OVERLAY |
̹ | ̹ | ̹ | 825 | 0339 | RIGHT HALF RING BELOW |
̺ | ̺ | ̺ | 826 | 033A | INVERTED BRIDGE BELOW |
̻ | ̻ | ̻ | 827 | 033B | SQUARE BELOW |
̼ | ̼ | ̼ | 828 | 033C | SEAGULL BELOW |
̽ | ̽ | ̽ | 829 | 033D | X ABOVE |
̾ | ̾ | ̾ | 830 | 033E | VERTICAL TILDE |
̿ | ̿ | ̿ | 831 | 033F | DOUBLE OVERLINE |
̀ | ̀ | ̀ | 832 | 0340 | GRAVE TONE MARK |
́ | ́ | ́ | 833 | 0341 | ACUTE TONE MARK |
͂ | ͂ | ͂ | 834 | 0342 | GREEK PERISPOMENI (combined with theta) |
̓ | ̓ | ̓ | 835 | 0343 | GREEK KORONIS (combined with theta) |
̈́ | ̈́ | ̈́ | 836 | 0344 | GREEK DIALYTIKA TONOS (combined with theta) |
ͅ | ͅ | ͅ | 837 | 0345 | GREEK YPOGEGRAMMENI (combined with theta) |
͆ | ͆ | ͆ | 838 | 0346 | BRIDGE ABOVE |
͇ | ͇ | ͇ | 839 | 0347 | EQUALS SIGN BELOW |
͈ | ͈ | ͈ | 840 | 0348 | DOUBLE VERTICAL LINE BELOW |
͉ | ͉ | ͉ | 841 | 0349 | LEFT ANGLE BELOW |
͊ | ͊ | ͊ | 842 | 034A | NOT TILDE ABOVE |
͋ | ͋ | ͋ | 843 | 034B | HOMOTHETIC ABOVE |
͌ | ͌ | ͌ | 844 | 034C | ALMOST EQUAL TO ABOVE |
͍ | ͍ | ͍ | 845 | 034D | LEFT RIGHT ARROW BELOW |
͎ | ͎ | ͎ | 846 | 034E | UPWARDS ARROW BELOW |
͏ | ͏ | ͏ | 847 | 034F | GRAPHEME JOINER |
͐ | ͐ | ͐ | 848 | 0350 | RIGHT ARROWHEAD ABOVE |
͑ | ͑ | ͑ | 849 | 0351 | LEFT HALF RING ABOVE |
͒ | ͒ | ͒ | 850 | 0352 | FERMATA |
͓ | ͓ | ͓ | 851 | 0353 | X BELOW |
͔ | ͔ | ͔ | 852 | 0354 | LEFT ARROWHEAD BELOW |
͕ | ͕ | ͕ | 853 | 0355 | RIGHT ARROWHEAD BELOW |
͖ | ͖ | ͖ | 854 | 0356 | RIGHT ARROWHEAD AND UP ARROWHEAD BELOW |
͗ | ͗ | ͗ | 855 | 0357 | RIGHT HALF RING ABOVE |
͘ | ͘ | ͘ | 856 | 0358 | DOT ABOVE RIGHT |
͙ | ͙ | ͙ | 857 | 0359 | ASTERISK BELOW |
͚ | ͚ | ͚ | 858 | 035A | DOUBLE RING BELOW |
͛ | ͛ | ͛ | 859 | 035B | ZIGZAG ABOVE |
͜ | ͜ | ͜ | 860 | 035C | DOUBLE BREVE BELOW |
͝ | ͝ | ͝ | 861 | 035D | DOUBLE BREVE |
͞ | ͞ | ͞ | 862 | 035E | DOUBLE MACRON |
͟ | ͟ | ͟ | 863 | 035F | DOUBLE MACRON BELOW |
͠ | ͠ | ͠ | 864 | 0360 | DOUBLE TILDE |
͡ | ͡ | ͡ | 865 | 0361 | DOUBLE INVERTED BREVE |
͢ | ͢ | ͢ | 866 | 0362 | DOUBLE RIGHTWARDS ARROW BELOW |
ͣ | ͣ | ͣ | 867 | 0363 | LATIN SMALL LETTER A |
ͤ | ͤ | ͤ | 868 | 0364 | LATIN SMALL LETTER E |
ͥ | ͥ | ͥ | 869 | 0365 | LATIN SMALL LETTER I |
ͦ | ͦ | ͦ | 870 | 0366 | LATIN SMALL LETTER O |
ͧ | ͧ | ͧ | 871 | 0367 | LATIN SMALL LETTER U |
ͨ | ͨ | ͨ | 872 | 0368 | LATIN SMALL LETTER C |
ͩ | ͩ | ͩ | 873 | 0369 | LATIN SMALL LETTER D |
ͪ | ͪ | ͪ | 874 | 036A | LATIN SMALL LETTER H |
ͫ | ͫ | ͫ | 875 | 036B | LATIN SMALL LETTER M |
ͬ | ͬ | ͬ | 876 | 036C | LATIN SMALL LETTER R |
ͭ | ͭ | ͭ | 877 | 036D | LATIN SMALL LETTER T |
ͮ | ͮ | ͮ | 878 | 036E | LATIN SMALL LETTER V |
ͯ | ͯ | ͯ | 879 | 036F | LATIN SMALL LETTER X |
三個和尚沒水å–
(Chinese Proverb)