. * Interpreted as a no-break space within. The character entities and denote an en space and an em space respectively, where an en space is half the point size and an em space is equal to the point size of the current font. For fixed pitch fonts, the user agent can treat the en space as being equivalent to a single space character, and the em space as being equuivalent to two space characters. Non-breaking Space ( ) This should be treated in the same way as the space character (ASCII character code 32 decimal), except that the user agent should never break lines at this point. It is useful when you want to ensure that neigbouring words always stay together and don't get split across lines. Hyphen * Interpreted as a hyphen glyph in all contexts. * Interpreted as a potential word space by hyphenation engine. The character entities &endash; and &emdash; denote dash marks with the same widths as the and entities respectively. ------------------------------------------------------------------------------ Control Characters Control characters are non-printable characters that are typically used for communication and device control, as format effectors, and as information separators. In SGML applications, the use of control characters is limited in order to maximize the chance of sucessful interchange over heterogenous networks and operating systems. In HTML, there are only three control characters which are used. The remaining 55 control characters are shunned and should not appear in an HTML document. The valid control characters and their interpretation are: Horizontal Tab (HT - 9 dec) * Interpreted as a word space in all contexts except. Dave Raggett Page 143 HTML 3.0 28th March 1995 * Within, the tab should be interpreted to shift the horizontal column position to the next position which is a multiple of 8 on the same line; that is, col := (col+8) mod 8. Line Feed (LF - 10 dec) * Interpreted as a word space in all contexts except. * Within, the tab should be interpreted as a shift to the start of a new line; that is, col := 0; row := row+1 Carriage Return (CR - 13 dec) * Interpreted as a word space in all contexts except. * Within, the tab should be interpreted as a shift to the start of the line; that is, col := 0; ------------------------------------------------------------------------------ Numeric Character References Any printing character within the 8-bit character encoding of ISO 8859/1 (256 character positions) or the 7-bit character encoding of ISO 646 (128 character positions) may be represented within the text of an HTML document by a numeric character reference, e.g. é is a small e with an acute accent. It is recommended that character entity references such as é are used in preference to numberic character references. Dave Raggett Page 144 HTML 3.0 28th March 1995 Security Considerations Anchors, embedded images, and all other elements which contain URIs as parameters may cause the URI to be dereferenced. In this case, the security considerations of the URI specification apply. Documents may be constructed whose visible contents mislead the reader to follow a link to unsuitable or offensive material. The MD attribute is useful when authors are concerned that a linked object may be subsequently changed to something other than intended. This attribute is used to specify a cryptographic checksum for the linked object to provide a check on its integrity. Dave Raggett Page 145 HTML 3.0 28th March 1995 Dave Raggett Page 147 HTML 3.0 28th March 1995 Character Entity Set(s) This section is undergoing revision ... --In particular, we need to add a more complete list of character entities, e.g. for the characters below decimal 128 and missing codes such as currency signs. The following entity names are used in HTML, always prefixed by ampersand (&) and followed by a semicolon as shown. They represent particular graphic characters which have special meanings in places in the markup, or may not be part of the character set available to the writer. ------------------------------------------------------------------------------ Numeric and Special Graphic Entities The following table lists each of the supported characters specified in the Numeric and Special Graphic entity set, along with its name, syntax for use, and description. This list is derived from "ISO 8879:1986//ENTITIES Numeric and Special Graphic//EN" however HTML does not provide support for the entire entity set. Only the entities listed below are supported. Name Syntax Description lt < Less than sign gt > Greater than sign amp & Ampersand quot " Double quote sign ------------------------------------------------------------------------------ ISO Latin 1 Character Entities The following table lists each of the characters specified in the Added Latin 1 entity set, along with its name, syntax for use, and description. This list is derived from "ISO 8879:1986//ENTITIES Added Latin 1//EN", and HTML does provide support for the entire entity set. Name Syntax Description Aacute Á Capital A, acute accent Agrave À Capital A, grave accent Acirc  Capital A, circumflex accent Atilde à Capital A, tilde Aring Å Capital A, ring Auml Ä Capital A, dieresis or umlaut mark AElig Æ Capital AE dipthong (ligature) Dave Raggett Page 148 HTML 3.0 28th March 1995 Ccedil Ç Capital C, cedilla Eacute É Capital E, acute accent Egrave È Capital E, grave accent Ecirc Ê Capital E, circumflex accent Euml Ë Capital E, dieresis or umlaut mark Iacute Í Capital I, acute accent Igrave Ì Capital I, grave accent Icirc Î Capital I, circumflex accent Iuml Ï Capital I, dieresis or umlaut mark ETH Ð Capital Eth, Icelandic Ntilde Ñ Capital N, tilde Oacute Ó Capital O, acute accent Ograve Ò Capital O, grave accent Ocirc Ô Capital O, circumflex accent Otilde Õ Capital O, tilde Ouml Ö Capital O, dieresis or umlaut mark Oslash Ø Capital O, slash Uacute Ú Capital U, acute accent Ugrave Ù Capital U, grave accent Ucirc Û Capital U, circumflex accent Uuml Ü Capital U, dieresis or umlaut mark Yacute Ý Capital Y, acute accent THORN Þ Capital THORN, Icelandic szlig ß Small sharp s, German (sz ligature) aacute á Small a, acute accent agrave à Small a, grave accent acirc â Small a, circumflex accent atilde ã Small a, tilde atilde ã Small a, tilde auml ä Small a, dieresis or umlaut mark aelig æ Small ae dipthong (ligature) ccedil ç Small c, cedilla eacute é Small e, acute accent egrave è Small e, grave accent ecirc ê Small e, circumflex accent euml ë Small e, dieresis or umlaut mark iacute í Small i, acute accent igrave ì Small i, grave accent icirc î Small i, circumflex accent iuml ï Small i, dieresis or umlaut mark eth ð Small eth, Icelandic ntilde ñ Small n, tilde oacute ó Small o, acute accent ograve ò Small o, grave accent ocirc ô Small o, circumflex accent otilde õ Small o, tilde ouml ö Small o, dieresis or umlaut mark oslash ø Small o, slash uacute ú Small u, acute accent ugrave ù Small u, grave accent ucirc û Small u, circumflex accent Dave Raggett Page 149 HTML 3.0 28th March 1995 uuml ü Small u, dieresis or umlaut mark yacute ý Small y, acute accent thorn þ Small thorn, Icelandic yuml ÿ Small y, dieresis or umlaut mark ------------------------------------------------------------------------------ Numerical Character References This list, sorted numerically, is derived from the ISO 8859/1 8-bit single-byte coded graphic character set: Reference Description - Unused Horizontal tab Line feed - Unused &32; Space &33; Exclamation mark &34; Quotation mark &35; Number sign &36; Dollar sign &37; Percent sign &38; Ampersand &39; Apostrophe &40; Left parenthesis &41; Right parenthesis &42; Asterisk &43; Plus sign &44; Comma &45; Hyphen &46; Period (fullstop) &47; Solidus (slash) 0 - 9 Digits 0-9 &58; Colon &59; Semi-colon &60; Less than &61; Equals aign &62; Greater than &63; Question mark &64; Commercial at A-Z Letters A-Z &91; Left square bracket &92; Reverse solidus (backslash) &93; Right square bracket &95; Horizontal bar &96; Acute accent Dave Raggett Page 150 HTML 3.0 28th March 1995 a-z Letters a-z &123; Left curly brace &124; Vertical bar &125; Right curly brace &126; Tilde - Unused &161; Inverted exclamation &162; Cent sign &163; Pound sterling &164; General currency sign &165; Yen sign &166; Broken vertical bar &167; Section sign &168; Umlaut (dieresis) &169; Copyright &170; Feminine ordinal &171; Left angle quote, guillemotleft &172; Not sign &173; Soft hyphen &174; Registered trademark &175; Macron accent &176; Degree sign &177; Plus or minus &178; Superscript two &179; Superscript three &180; Acute accent &181; Micro sign &182; Paragraph sign &183; Middle dot &184; Cedilla &185; Superscript one &186; Masculine ordinal &187; Right angle quote, guillemotright &188; Fraction one-fourth &189; Fraction one-half &190; Fraction three-fourths &191; Inverted question mark &192; Capital A, acute accent &193; Capital A, grave accent &194; Capital A, circumflex accent &195; Capital A, tilde &196; Capital A, ring &197; Capital A, dieresis or umlaut mark &198; Capital AE dipthong (ligature) &199; Capital C, cedilla &200; Capital E, acute accent &201; Capital E, grave accent &202; Capital E, circumflex accent Dave Raggett Page 151 HTML 3.0 28th March 1995 &203; Capital E, dieresis or umlaut mark &204; Capital I, acute accent &205; Capital I, grave accent &206; Capital I, circumflex accent &207; Capital I, dieresis or umlaut mark &208; Capital Eth, Icelandic &209; Capital N, tilde &210; Capital O, acute accent &211; Capital O, grave accent &212; Capital O, circumflex accent &213; Capital O, tilde &214; Capital O, dieresis or umlaut mark &215; Multiply sign &216; Capital O, slash &217; Capital U, acute accent &218; Capital U, grave accent &219; Capital U, circumflex accent &220; Capital U, dieresis or umlaut mark &221; Capital Y, acute accent &222; Capital THORN, Icelandic &223; Small sharp s, German (sz ligature) &224; Small a, acute accent &225; Small a, grave accent &226; Small a, circumflex accent &227; Small a, tilde &228; Small a, tilde &229; Small a, dieresis or umlaut mark &230; Small ae dipthong (ligature) &231; Small c, cedilla &232; Small e, acute accent &233; Small e, grave accent &234; Small e, circumflex accent &235; Small e, dieresis or umlaut mark &236; Small i, acute accent &237; Small i, grave accent &238; Small i, circumflex accent &239; Small i, dieresis or umlaut mark &240; Small eth, Icelandic &241; Small n, tilde &242; Small o, acute accent &243; Small o, grave accent &244; Small o, circumflex accent &245; Small o, tilde &246; Small o, dieresis or umlaut mark &247; Division sign &248; Small o, slash &249; Small u, acute accent Dave Raggett Page 152 HTML 3.0 28th March 1995 &250; Small u, grave accent &251; Small u, circumflex accent &252; Small u, dieresis or umlaut mark &253; Small y, acute accent &254; Small thorn, Icelandic &255; Small y, dieresis or umlaut mark Dave Raggett Page 153 HTML 3.0 28th March 1995 Math Entities This list is in a very preliminary stage ... --I hope to use ISO names where practical, and want to ensure that names are meaningful, rather than cryptic. The character codes for common fonts will be included, although which fonts to include is still under review.-- The following sets out the range of math symbols supported by HTML math, giving the HTML entity name, the corresponding LaTeX command name and a short description. Character codes are given in hexadecimal when available for the Postscript symbol set and HP's math-8 symbol set. Continuation dots - ellipsis &ldots; \ldots three dots on the baseline &cdots; \cdots three dots on same level as a minus sign &vdots; \vdots three vertical dots &ddots; \ddots diagonal dots (top left to bottom right) &dotfill; \dotfill like cdots but fills column in an array Added Spacing \, thin space &sp; \: medium space \; thick space &quad; \quad huge space Lower case Greek Letters PS-Symbol Math-8 α \alpha alpha 61 61 β \beta beta 62 62 γ \gamma gamma 67 63 δ \delta delta 64 64 ε \epsilon epsilon -- 65 &vepsilon; \varepsilon var epsilon 65 3B ζ \zeta zeta 7A 66 η \eta eta 68 67 θ \theta theta 71 68 &vtheta; \vartheta var theta -- 79 ι \iota iota 69 69 κ \kappa kappa 6B 6A λ \lambda lambda 6C 6B μ \mu mu 6D 6C ν \nu nu 6E 6D ξ \xi xi 78 6E Dave Raggett Page 154 HTML 3.0 28th March 1995 ο .... omicron 6F 6F π \pi pi 70 70 ϖ \varpi var pi 76 7B ρ \rho rho 72 71 ϱ \varrho var rho -- -- σ \sigma sigma 73 72 &vsigma; \varsigma var sigma 56 5B τ \tau tau 74 73 υ \upsilon upsilon 75 74 φ \phi phi 66 75 ϕ \varphi var phi 6A 7A χ \chi chi 63 76 ψ \psi psi 79 77 ω \omega omega 77 78 Note: LaTeX uses the latin letter o for omicron. Dave Raggett Page 155 HTML 3.0 28th March 1995 Dave Raggett Page 156 HTML 3.0 28th March 1995 ... -- > ]]> Dave Raggett Page 160 HTML 3.0 28th March 1995 ]]> ]]> %HTMLlat1; %HTMLicons; %HTMLmath; ]]> ]]> Dave Raggett Page 167 HTML 3.0 28th March 1995 ]]> ]]> Dave Raggett Page 168 HTML 3.0 28th March 1995 ]]> HeadingText ... is preferred to
Heading Text ... --> ]]> Dave Raggett Page 170 HTML 3.0 28th March 1995 ]]> Dave Raggett Page 173 HTML 3.0 28th March 1995 Dave Raggett Page 175 HTML 3.0 28th March 1995 Dave Raggett Page 177 HTML 3.0 28th March 1995 Dave Raggett Page 178 HTML 3.0 28th March 1995 Dave Raggett Page 180 HTML 3.0 28th March 1995 Dave Raggett Page 181 HTML 3.0 28th March 1995 ]]> Dave Raggett Page 182 HTML 3.0 28th March 1995 ]]> Dave Raggett Page 183 HTML 3.0 28th March 1995 Dave Raggett Page 184 HTML 3.0 28th March 1995 Glossary of Terms Pasted from HTML 2.0 spec, this is now under revision ... ------------------------------------------------------------------------------ The HTML specification uses these words with precise meanings: attribute A characteristic quality of an element, other than type or content. browser A tool used to read electronic books. document For the purposes of this standard, an HTML instance. element A component of the hierarchical structure defined by the document type definition; it is identified in a document instance by descriptive markup, usually a start-tag and an end-tag. HTML HyperText Markup Language. HTTP A generic stateless object-oriented protocol, which may be used for many similar tasks by extending the commands, or "methods", used. For example, you might use HTTP for name servers and distributed object-oriented systems, With HTTP, the negotiation of data representation allows systems to be built independent of the development of new representations. For more information see: http://www.w3.org/hypertext/WWW/Protocols/Overview.html markup Text added to the data of a document to convey information about it. There are four different kinds of markup: descriptive markup (tags), references, markup declarations, and processing Dave Raggett Page 185 HTML 3.0 28th March 1995 instructions. MIME Multipurpose Internet Mail Extensions as defined in Mechanisms for Specifying and Describing the Format of Internet Message Bodies, 09/23/1993. (Pages=81) (Format=.txt, .ps) (Obsoletes RFC1341) (Updated by RFC1590). representation The encoding of information for interchange. For example, HTML is a representation of hypertext. rendering Formatting and presenting information to human readers. SGML Standard Generalized Markup Language as defined in ISO 8879:1986, Information Processing Text and Office Systems. SGMLS An SGML parser by James Clark, jjc@jclark.com, derived from the ARCSGML parser materials which were written by Charles F. Goldfarb. The source is available at ftp.ifi.uio.no/pub/SGML/SGMLS. tag Descriptive markup. There are two kinds of tags; start-tags and end-tags. URI Universal Resource Identifiers. Available by anonymous FTP as Postscript (www.w3.org/pub/www/doc/url.ps) or text (www.w3.org/pub/www/doc/url.txt) W3 The World-Wide Web, a global information initiative. For bootstrap information, telnet www.w3.org or find documents at Dave Raggett Page 186 HTML 3.0 28th March 1995 ftp://www.w3.org/pub/www/doc Dave Raggett Page 187 HTML 3.0 28th March 1995 References Under revision .. ------------------------------------------------------------------------------ The HTML specification cites these works: HTTP HTTP: A Protocol for Networked Information. This document is available at http://www.w3.org/WWW/Protocols/HTTP/HTTP2.html. MIME N. Borenstein, N. Freed, MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies, 09/23/1993. (Pages=81) (Format=.txt, .ps) (Obsoletes RFC1341) (Updated by RFC1590). SGML ISO 8879:1986, Information Processing Text and Office Systems Standard Generalized Markup Language (SGML). SGMLS An SGML parser by James Clark, jjc@jclark.com, derived from the ARCSGML parser materials which were written by Charles F. Goldfarb. The source is available at ftp.ifi.uio.no/pub/SGML/SGMLS. URI Universal Resource Identifiers. RFCxxx. Available by anonymous FTP as Postscript (info.cern.ch/pub/www/doc/url.ps) or text (info.cern.ch/pub/www/doc/url.txt) W3 The World-Wide Web , a global information initiative. For bootstrap information, telnet info.cern.ch or find documents by ftp://info.cern.ch/pub/www/doc. Dave Raggett Page 188 HTML 3.0 28th March 1995 Acknowledgments Pasted from HTML 2.0 spec, this section is under revision ... ------------------------------------------------------------------------------ The HTML document type was designed by Tim Berners-Lee at CERN as part of the 1990 World-Wide Web project. In 1992, Dan Connolly wrote the HTML Document Type Definition (DTD) and a brief HTML specification. Since 1993, a wide variety of Internet participants have contributed to the evolution of HTML. NCSA Mosaic played a particularly important role in establishing HTML. Mosaic pioneered the addition of in-line images, image maps, nested lists and fill-out forms (derived from work on HTML+). Minor variations in the way extensions were supported by different browsers eventually led to the setting up of the HTML working group. The HTML 2.0 specification sets out a definitive standard for HTML, formalizing the de facto situation during 1994. HTML+ was the result of my work on possible directions for extending HTML to meet the needs of information providers, e.g. to support forms, tables, text flow around figures and math. This work has now culminated in the current HTML 3.0 specification, which adds a range of important new features to HTML while preserving simplicity and backwards compatibility with existing documents. I would like to express my special thanks to members of the Internet community on the www-talk, www-html and html-wg mailing lists; to people who have written to me in person, and to members of the SGML-Open who have been very supportive of the Web initiative. Thanks also to Hewlett Packard for funding my work on HTML. Particular thanks are due to: * Terry Allen; O'Reilly & Associates; terry@ora.com * Marc Andreessen; Netscape Communications Corp; marca@netscape.com * Eric Bina; Netscape Communications Corp; ebina@netscape.com * Paul Burchard; The Geometry Center, University of Minnesota; burchard@geom.umn.edu * James Clark; jjc@jclark.com * Daniel W. Connolly; HaL Computer Systems; connolly@hal.com * Stephen DeRose; EBT; ??? steve@ebt.com * Roy Fielding; University of California, Irvine; fielding@ics.uci.edu Dave Raggett Page 189 HTML 3.0 28th March 1995 * Jay Glicksman; Enterprise Integration Technology; jay@eit.com * Eduardo Gutentag; Sun Microsystems; eduardo@Eng.Sun.com * Bill Hefley; Software Engineering Institute, Carnegie Mellon University; weh@sei.cmu.edu * Chung-Jen Ho; Xerox Corporation; cho@xsoft.xerox.com * Mike Knezovich; Spyglass, Inc.; mike@spyglass.com * Tim Berners-Lee; CERN; timbl@info.cern.ch * Tom Magliery; NCSA; mag@ncsa.uiuc.edu * Murray Maloney; SCO Canada; murray@sco.com * Larry Masinter; Xerox Palo Alto Research Center; masinter@parc.xerox.com * Karen Olson Muldrow; HaL Computer Systems; karen@hal.com * Bill Perry, Spry, Inc., wmperry@spry.com * E. Corprew Reed; Cold Spring Harbor Laboratory; corp@cshl.org * Yuri Rubinsky; SoftQuad, Inc.; yuri@sq.com * Eric Schieler; Spyglass, Inc.; eschieler@spyglass.com * Eric Severson; Avalanche, Inc.; ??? severson@avalanche.com * Eric W. Sink; Spyglass, Inc.; eric@spyglass.com * Stuart Weibel; OCLC Office of Research; weibel@oclc.org * Chris Wilson; Spry, Inc.; cwilson@spry.com Dave Raggett , February 1995. Dave Raggett Page 190
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.3