Character Entities

As discussed earlier, you can specify special characters (tag delimiters and non-ASCII) either by number or name (case-sensitive). For example, you can specify the ampersand character & as either & or as & (upto 5 decimal digits). Newt's Cape generally requires the ; (or a space) delimiter -- some browsers apparently do not. You can also use  4-digit hex notation for specific (unusual) unicode characters; for example, on Newton,  appears as  (Apple symbol in Espy font). If a character code -- in the "unused" areas, extended (8-bit) ASCII range or 16-bit unicode, e.g., Japanese -- is embedded directly in the file, you may need an Encoding plugin, e.g., ISO-8859-1, to translate and display this properly. If you find "square boxes" appearing where should be characters, let me know the URL and I will check it on the desktop and on the Newton; it's possible the Newton does not have that character in its font, or that an encoding needs to be used (or fixed).

HTML Characters (ISO-8859-1)

Following is a complete list of the HTML Coded Character Set based on the ISO-8859-1 standard (as we understand it; still checking). First is the character code (item number), followed by the character itself -- the character is specified either directly or by numeric code; ?? if unused. If the character has a symbolic name, it is specified symbolically, followed by its [name]. A character description may followed with "??" if there seems to be no Newton equivalent. Most interesting codes/names are 34, 38, 60, 62, 160-255.

We have included a few browser-specific/de facto standard characters: 134, 135, 136, 137, 140, 153, 156, 159. At the end, we have included a few supported HTML 4.0 characters for which there are Newton equivalents.

  1. -8: unused
  2. horizontal tab
  3. line feed
  4. -12: unused
  5. carriage return or <BR>
  6. -31: unused
  7. space
  8. ! exclamation mark
  9. " " [quot] quotation mark
  10. # number sign
  11. $ dollar sign
  12. % percent
  13. & & [amp] ampersand
  14. ' apostrophe
  15. ( left parenthesis
  16. ) right parenthesis
  17. * asterisk
  18. + plus sign
  19. , comma
  20. - hyphen
  21. . period
  22. / slash
  23. -57: 0-9
  24. : colon
  25. ; semi-colon
  26. < < [lt] less than
  27. = equals
  28. > > [gt] greater than
  29. ? question mark
  30. @ at sign
  31. -90. A-Z
  32. [ left square bracket
  33. \ backslash
  34. ] right square bracket
  35. ^ caret
  36. _ underscore
  37. ` acute accent
  38. -122. a-z
  39. { left curly brace
  40. | vertical bar
  41. } right curly brace
  42. ~ tilde
  43.  ??
  44. € ??
  45.  ??
  46. ‚ ‚ [sbquo]
  47. ƒ ƒ [fnof]
  48. „ „ [bdquo]
  49. … … [hellip] horiz ellipsis
  50. † † [dagger]
  51. ‡ ‡ [Dagger] double dagger
  52. ˆ ˆ [circumflex]
  53. ‰ salinity
  54. Š ??
  55. ‹ ‹ [lsaquo]
  56. ΠΠ[OElig] OE diphthong
  57.  ??
  58. Ž ??
  59.  ??
  60.  ??
  61. ‘ ‘ [lsquot]
  62. ’ ’ [rsquo]
  63. “ “ [ldquo]
  64. ” ” [rdquo]
  65. • • [bull] bullet
  66. – – [ndash]
  67. — — [mdash]
  68. ˜ minutes??
  69. ™ ™ [trade] trademark
  70. š ??
  71. › › [rsaquo]
  72. œ œ [oelig] oe diphthong
  73.  ??
  74. ž ??
  75. Ÿ Ÿ [Yuml] Y umlaut ??
  76.     [nbsp] no-break space??
  77. ¡ ¡ [iexcl] inverted !
  78. ¢ ¢ [cent] cent sign
  79. £ £ [pound] pound sterling sign
  80. ¤ ¤ [curren] general currency sign
  81. ¥ ¥ [yen] yen sign
  82. ¦ ¦ [brbvar] broken (vertical) bar??
  83. § § [sect] section sign
  84. ¨ ¨ [uml] spacing dieresis (umlaut)
  85. © © [copy] copyright sign
  86. ª ª [ordf] ordinal indicator, feminine
  87. « « [laquo] angle quote mark, left
  88. ¬ ¬ [not] not sign
  89. ­ ­ [shy] soft hyphen??
  90. ® ® [reg] circled R registered sign
  91. ¯ ¯ [macr] spacing macron (hibar?)
  92. ° ° [deg] degree sign
  93. ± ± [plusmn] plus-or-minus sign
  94. ² ² [sup2] superscript 2??
  95. ³ ³ [sup3] superscript 3??
  96. ´ ´ [acute] acute accent
  97. µ µ [micro] micro sign
  98. ¶ ¶ [para] pilcrow (paragraph sign)
  99. · · [middot] middle dot
  100. ¸ ¸ [cedil] spacing cedilla
  101. ¹ ¹ [sup1] superscript 1??
  102. º º [ordm] ordinal indicator, masculine
  103. » » [raquo] angle quote mark, right
  104. ¼ ¼ [frac14] fraction one-quarter??
  105. ½ ½ [frac12] fraction one-half??
  106. ¾ ¾ [frac34] fraction three-quarters??
  107. ¿ ¿ [iquest] inverted question mark
  108. À À [Agrave] A, grave accent
  109. Á Á [Aacute] A, acute accent
  110. Â Â [Acirc] A, circumflex accent
  111. Ã Ã [Atilde] A, tilde
  112. Ä Ä [Auml] A, dieresis (umlaut) mark
  113. Å Å [Aring] A, ring
  114. Æ Æ [AElig] AE, diphthong (ligature)
  115. Ç Ç [Ccedil] C, cedilla
  116. È È [Egrave] E, grave accent
  117. É É [Eacute] E, acute
  118. Ê Ê [Ecirc] E, circumflex
  119. Ë Ë [Euml] E, umlaut
  120. Ì Ì [Igrave] I, grave
  121. Í Í [Iacute] , acute
  122. Î Î [Icirc] I, circumflex
  123. Ï Ï [Iuml] I, umlaut
  124. Ð Ð [ETH] Eth, Icelandic??
  125. Ñ Ñ [Ntilde] N, tilde
  126. Ò Ò [Ograve] O, grave
  127. Ó Ó [Oacute] O, acute
  128. Ô Ô [Ocirc] O, circumflex
  129. Õ Õ [Otilde] O, tilde
  130. Ö Ö [Ouml] O, umlaut
  131. × × [times] multiply sign??
  132. Ø Ø [Oslash] O, slash
  133. Ù Ù [Ugrave] U, grave
  134. Ú Ú [Uacute] U, acute
  135. Û Û [Ucirc] U, circumflex
  136. Ü Ü [Uuml] U, umlaut
  137. Ý Ý [Yacute] Y, acute??
  138. Þ Þ [THORN] THORN, Icelandic??
  139. ß ß [szlig] sharp s, German (sz ligature)
  140. à à [agrave] a, grave
  141. á á [aacute] a, acute
  142. â â [acirc] a, circumflex
  143. ã ã [atilde] a, tilde
  144. ä ä [auml] a, umlaut
  145. å å [aring] a ring
  146. æ æ [aelig] ae, diphthong (ligature)
  147. ç ç [ccedil] c, cedilla
  148. è è [egrave] e, grave
  149. é é [eacute] e, acute
  150. ê ê [ecirc] e, circumflex
  151. ë ë [euml] e, umlaut
  152. ì ì [igrave] i, grave
  153. í í [iacute] i, acute
  154. î î [icirc] i, circumflex
  155. ï ï [iuml] i, umlaut
  156. ð ð [eth] eth, Icelandic??
  157. ñ ñ [ntilde] n, tilde
  158. ò ò [ograve] o, grave
  159. ó ó [oacute] o, acute
  160. ô ô [ocirc] o, circumflex
  161. õ õ [otilde] o, tilde
  162. ö ö [ouml] o, umlaut
  163. ÷ ÷ [divide] divide sign
  164. ø ø [oslash] o, slash
  165. ù ù [ugrave] u, grave
  166. ú ú [uacute] u, acute
  167. û û [ucirc] u, circumflex
  168. ü ü [uuml] u, umlaut
  169. ý ý [yacute] y, acute??
  170. þ þ [thorn] thorn, Icelandic??
  171. ÿ ÿ [yuml] y, umlaut

For fun, here are a few characters that should NOT map: Ā(out of range), ϧ (out of range), &foobar; (unknown), & (different case).

Supported HTML 4.0 Entities

For More Info

This document (in all its formats) is © 1995-2007. Steve Weyer, Greg Simon. All Rights Reserved Worldwide

Version 2.1. Last updated: Dec 2000