Home > Articles > XHTML 1.0 Special Character References

XHTML 1.0 Special Character References

skip to navigation

This section of the site features articles published between 2002 and 2004. They remain here for reference purposes and may contain information that is out of date.

Article Index


This article details the special character references and corresponding entity references available to your XHTML 1.0 documents. This is the second of a series of three articles detailing each of the three sets laid out in the XHTML DTDs.

An explanation of character references will be given in part two of my article XHTML Web Design for Beginners. I decided to release these tables as separate articles in order to make them easily accessible to those who don't want to read the series.

What the columns mean

The first column gives the named version of the entity, the second is the named entity itself. This allows you to see if that character reference works in the browser you are using to view this document via the named entity.

The third column gives the numeric entity, the fourth is the numeric entity itself. This allows you to see if that character reference works in the browser you are using to view this document via the numeric entity.

The fifth column is the description of the entity as found in the XHTML 1.0 DTDs at http://www.w3.org/TR/xhtml1/dtds.html#a_dtd_Special_characters.

Entity Reference Test Character Reference Test Description
C0 Controls and Basic Latin
" " " " quotation mark, U+0022 ISOnum
& & & & ampersand, U+0026 ISOnum
< < < < less-than sign, U+003C ISOnum
> > > > greater-than sign, U+003E ISOnum
' ' ' ' apostrophe = APL quote, U+0027 ISOnum
Latin Extended-A
ΠΠΠΠlatin capital ligature OE, U+0152 ISOlat2
œ œ œ œ latin small ligature oe, U+0153 ISOlat2
note: ligature is a misnomer, this is a separate character in some languages
Š Š Š Š latin capital letter S with caron, U+0160 ISOlat2
š š š š latin small letter s with caron, U+0161 ISOlat2
Ÿ Ÿ Ÿ Ÿ latin capital letter Y with diaeresis, U+0178 ISOlat2
Spacing Modifier Letters
ˆ ˆ ˆ ˆ modifier letter circumflex accent, U+02C6 ISOpub
˜ ˜ ˜ ˜ small tilde, U+02DC ISOdia
General Punctuation
en space, U+2002 ISOpub
em space, U+2003 ISOpub
thin space, U+2009 ISOpub
zero width non-joiner, U+200C NEW RFC 2070
zero width joiner, U+200D NEW RFC 2070
left-to-right mark, U+200E NEW RFC 2070
right-to-left mark, U+200F NEW RFC 2070
- - en dash, U+2013 ISOpub
em dash, U+2014 ISOpub
left single quotation mark, U+2018 ISOnum
right single quotation mark, U+2019 ISOnum
single low-9 quotation mark, U+201A NEW
left double quotation mark, U+201C ISOnum
right double quotation mark, U+201D ISOnum
double low-9 quotation mark, U+201E NEW
dagger, U+2020 ISOpub
double dagger, U+2021 ISOpub
per mille sign, U+2030 ISOtech
single left-pointing angle quotation mark, U+2039 ISO proposed
note:lsaquo is proposed but not yet ISO standardized
single right-pointing angle quotation mark, U+203A ISO proposed
note:rsaquo is proposed but not yet ISO standardized
Currency Symbols
euro sign, U+20AC NEW

Article Index