HTML/entity: Difference between revisions

from HTYP, the free directory anyone can edit if they can prove to me that they're not a spambot
m Woozle moved page HTML entity to HTML/entity: reorganizing
No edit summary
 
Line 7: Line 7:
==About==
==About==
An [[HTML entity]] is a combination of characters which will be displayed within an [[HTML]] browser as a single character. All HTML entities begin with "&amp;" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&amp;#<u>number</u>;" where <u>number</u> is the output character's [[ASCII]] value in decimal.
An [[HTML entity]] is a combination of characters which will be displayed within an [[HTML]] browser as a single character. All HTML entities begin with "&amp;" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&amp;#<u>number</u>;" where <u>number</u> is the output character's [[ASCII]] value in decimal.
 
==Pages==
''see also: [[wikipedia:Percent-encoding]]''
* [[/list]]: list of some of the most useful entities
==Reference==
==Reference==
* {{wikipedia|Character entity reference}}
* {{wikipedia|Character entity reference}}
** see also: {{l/wp|Percent-encoding}}
* [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities]
* [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities]
* [https://developers.whatwg.org/named-character-references.html#named-character-references Named character references], in alphabetical order, including newer ones not supported by most browsers
* [https://developers.whatwg.org/named-character-references.html#named-character-references Named character references], in alphabetical order, including newer ones not supported by most browsers
* [https://www.freeformatter.com/html-entities.html HTML Entity List]: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)."
* [https://www.freeformatter.com/html-entities.html HTML Entity List]: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)."
==Questions==
==Questions==
* Does [[Google]] resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''&#70;&#82;&#69;&#76;&#71;&#75;&#76;&#79;&#84;&#90;''')
* Do search engines resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''&#70;&#82;&#69;&#76;&#71;&#75;&#76;&#79;&#84;&#90;''')
** '''2007-07-15 answer''': Yes, it does find it (though it took several months at least before this page got indexed).
** '''2007-07-15 answer''': Yes, it does find it (though it took several months at least before this page got indexed).
** '''2016-11-28 answer''': Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz".
** '''2016-11-28 answer''': Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz".
==Note==
==Note==
* '''2017-11-11''' MediaWiki 1.28.0 apparently no longer supports the "&amp;#d;" style of entity; you now have to use named entities.
* '''2017-11-11''' MediaWiki 1.28.0 apparently no longer supports the "&amp;#d;" style of entity; you now have to use named entities.
* '''2020-05-23''' This seems to have been fixed?
* '''2020-05-23''' This seems to have been fixed?
==Entities by Number==
* 0-8 are not translated
* 9 is probably TAB
* 10 is probably [[linefeed|LF]]
* 13 is probably [[carriage return|CR]]
* 14-31 are not translated
* 32 is a standard space
{| width=100%
! colspan=3 style="border-bottom: 1px solid blue;" | lower ASCII || colspan=4 style="border-bottom: 1px solid green;" | upper ASCII
|-
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#33}}
{{show/entity/row|#34}}
{{show/entity/row|#35}}
{{show/entity/row|#36}}
{{show/entity/row|#37}}
{{show/entity/row|#38|{{show/entity|amp}}}}
{{show/entity/row|#39}}
{{show/entity/row|#40}}
{{show/entity/row|#41}}
{{show/entity/row|#42}}
{{show/entity/row|#43}}
{{show/entity/row|#44}}
{{show/entity/row|#45}}
{{show/entity/row|#46}}
{{show/entity/row|#47}}
{{show/entity/row|#48}}
{{show/entity/row|#49}}
{{show/entity/row|#50}}
{{show/entity/row|#51}}
{{show/entity/row|#52}}
{{show/entity/row|#53}}
{{show/entity/row|#54}}
{{show/entity/row|#55}}
{{show/entity/row|#56}}
{{show/entity/row|#57}}
{{show/entity/row|#58}}
{{show/entity/row|#59}}
{{show/entity/row|#60}}
{{show/entity/row|#61}}
{{show/entity/row|#62}}
{{show/entity/row|#63}}
{{show/entity/row|#64}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#65}}
{{show/entity/row|#66}}
{{show/entity/row|#67}}
{{show/entity/row|#68}}
{{show/entity/row|#69}}
{{show/entity/row|#70}}
{{show/entity/row|#71}}
{{show/entity/row|#72}}
{{show/entity/row|#73}}
{{show/entity/row|#74}}
{{show/entity/row|#75}}
{{show/entity/row|#76}}
{{show/entity/row|#77}}
{{show/entity/row|#78}}
{{show/entity/row|#79}}
{{show/entity/row|#80}}
{{show/entity/row|#81}}
{{show/entity/row|#82}}
{{show/entity/row|#83}}
{{show/entity/row|#84}}
{{show/entity/row|#85}}
{{show/entity/row|#86}}
{{show/entity/row|#87}}
{{show/entity/row|#88}}
{{show/entity/row|#89}}
{{show/entity/row|#90}}
{{show/entity/row|#91}}
{{show/entity/row|#92}}
{{show/entity/row|#93}}
{{show/entity/row|#94}}
{{show/entity/row|#95}}
{{show/entity/row|#96}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#97}}
{{show/entity/row|#98}}
{{show/entity/row|#99}}
{{show/entity/row|#100}}
{{show/entity/row|#101}}
{{show/entity/row|#102}}
{{show/entity/row|#103}}
{{show/entity/row|#104}}
{{show/entity/row|#105}}
{{show/entity/row|#106}}
{{show/entity/row|#107}}
{{show/entity/row|#108}}
{{show/entity/row|#109}}
{{show/entity/row|#110}}
{{show/entity/row|#111}}
{{show/entity/row|#112}}
{{show/entity/row|#113}}
{{show/entity/row|#114}}
{{show/entity/row|#115}}
{{show/entity/row|#116}}
{{show/entity/row|#117}}
{{show/entity/row|#118}}
{{show/entity/row|#119}}
{{show/entity/row|#120}}
{{show/entity/row|#121}}
{{show/entity/row|#122}}
{{show/entity/row|#123}}
{{show/entity/row|#124}}
{{show/entity/row|#125}}
{{show/entity/row|#126}}
{{show/entity/row|#127}}
{{show/entity/row|#128}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#129}}
{{show/entity/row|#130}}
{{show/entity/row|#131}}
{{show/entity/row|#132}}
{{show/entity/row|#133}}
{{show/entity/row|#134}}
{{show/entity/row|#135}}
{{show/entity/row|#136}}
{{show/entity/row|#137}}
{{show/entity/row|#138}}
{{show/entity/row|#139}}
{{show/entity/row|#140}}
{{show/entity/row|#141}}
{{show/entity/row|#142}}
{{show/entity/row|#143}}
{{show/entity/row|#144}}
{{show/entity/row|#145}}
{{show/entity/row|#146}}
{{show/entity/row|#147}}
{{show/entity/row|#148}}
{{show/entity/row|#149}}
{{show/entity/row|#150}}
{{show/entity/row|#151}}
{{show/entity/row|#152}}
{{show/entity/row|#153}}
{{show/entity/row|#154}}
{{show/entity/row|#155}}
{{show/entity/row|#156}}
{{show/entity/row|#157}}
{{show/entity/row|#158}}
{{show/entity/row|#159}}
{{show/entity/row|#160}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#161}}
{{show/entity/row|#162}}
{{show/entity/row|#163|{{show/entity|pound}}}}
{{show/entity/row|#164}}
{{show/entity/row|#165}}
{{show/entity/row|#166}}
{{show/entity/row|#167}}
{{show/entity/row|#168}}
{{show/entity/row|#169}}
{{show/entity/row|#170}}
{{show/entity/row|#171}}
{{show/entity/row|#172}}
{{show/entity/row|#173}}
{{show/entity/row|#174}}
{{show/entity/row|#175}}
{{show/entity/row|#176}}
{{show/entity/row|#177}}
{{show/entity/row|#178}}
{{show/entity/row|#179}}
{{show/entity/row|#180}}
{{show/entity/row|#181}}
{{show/entity/row|#182}}
{{show/entity/row|#183}}
{{show/entity/row|#184}}
{{show/entity/row|#185}}
{{show/entity/row|#186}}
{{show/entity/row|#187}}
{{show/entity/row|#188}}
{{show/entity/row|#189}}
{{show/entity/row|#190}}
{{show/entity/row|#191}}
{{show/entity/row|#192|{{show/entity|Agrave}}}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#193|{{show/entity|Aacute}}}}
{{show/entity/row|#194}}
{{show/entity/row|#195|{{show/entity|Atilde}}}}
{{show/entity/row|#196|{{show/entity|Auml}}}}
{{show/entity/row|#197}}
{{show/entity/row|#198|{{show/entity|AElig}}}}
{{show/entity/row|#199|{{show/entity|Ccedil}}}}
{{show/entity/row|#200}}
{{show/entity/row|#201}}
{{show/entity/row|#202}}
{{show/entity/row|#203}}
{{show/entity/row|#204}}
{{show/entity/row|#205}}
{{show/entity/row|#206}}
{{show/entity/row|#207}}
{{show/entity/row|#208}}
{{show/entity/row|#209}}
{{show/entity/row|#210}}
{{show/entity/row|#211}}
{{show/entity/row|#212}}
{{show/entity/row|#213}}
{{show/entity/row|#214}}
{{show/entity/row|#215}}
{{show/entity/row|#216}}
{{show/entity/row|#217}}
{{show/entity/row|#218}}
{{show/entity/row|#219}}
{{show/entity/row|#220}}
{{show/entity/row|#221}}
{{show/entity/row|#222}}
{{show/entity/row|#223}}
{{show/entity/row|#224}}
|}
| style="border:#0d0 1px solid; padding: 2px;" |
{|
{{show/entity/row|#225}}
{{show/entity/row|#226}}
{{show/entity/row|#227}}
{{show/entity/row|#228}}
{{show/entity/row|#229}}
{{show/entity/row|#230}}
{{show/entity/row|#231|{{show/entity|ccedil}}}}
{{show/entity/row|#232}}
{{show/entity/row|#233}}
{{show/entity/row|#234}}
{{show/entity/row|#235}}
{{show/entity/row|#236}}
{{show/entity/row|#237}}
{{show/entity/row|#238}}
{{show/entity/row|#239}}
{{show/entity/row|#240}}
{{show/entity/row|#241}}
{{show/entity/row|#242}}
{{show/entity/row|#243}}
{{show/entity/row|#244}}
{{show/entity/row|#245}}
{{show/entity/row|#246}}
{{show/entity/row|#247}}
{{show/entity/row|#248}}
{{show/entity/row|#249}}
{{show/entity/row|#250}}
{{show/entity/row|#251}}
{{show/entity/row|#252}}
{{show/entity/row|#253}}
{{show/entity/row|#254|{{show/entity|thorn}}}}
{{show/entity/row|#255}}
{{show/entity/row|#256}}
|}
|}
===Notes===
* '''2024-08-31''' <code>&#153;</code> used to map to {{show/entity|trade}}. That is either browser-specific or is no longer the case.
==Entities by Appearance==
{| class="wikitable sortable"
! entity || shows || description
{{show/entity/row|gt|greater than}}
{{show/entity/row|lt|less than}}
{{show/entity/row|ge|greater than or equal to}}
{{show/entity/row|le|less than or equal to}}
{{show/entity/row|ne|not equal to}}
{{show/entity/row|larr|left arrow}}
{{show/entity/row|rarr|right arrow}}
{{show/entity/row|lsquo|left single-quote}}
{{show/entity/row|rsquo|right single-quote}}
{{show/entity/row|trade|trademark symbol (superscripted)}}
{{show/entity/row|copy|copyright-circle symbol}}
{{show/entity/row|reg|registered-circle symbol}}
{{show/entity/row|Alpha|Greek letter alpha (upper)}}
{{show/entity/row|Beta|Greek letter beta (upper)}}
{{show/entity/row|Gamma|Greek letter gamma (upper)}}
{{show/entity/row|Delta|Greek letter delta (upper)}}
{{show/entity/row|alpha|Greek letter alpha (lower)}}
{{show/entity/row|beta|Greek letter beta (lower)}}
{{show/entity/row|gamma|Greek letter gamma (lower)}}
{{show/entity/row|delta|Greek letter delta (lower)}}
|}
===missing===
* Is there no entity for "equals"? It's not &eq;, &equ;, &equal;, or &equals;. The only thing that seems to work is the ASCII numeric, {{show/entity|#61}}.
===Roman alphabet===
{| class="wikitable sortable"
|-
! upper || lower || mnemonics
|-
| {{show/entity|#193}} - '''A''' with acute accent
| {{show/entity|#225}} - '''a''' with acute accent
| {{show/entity|Aacute}} {{show/entity|aacute}}
|-
|
| {{show/entity|#257}} - '''a''' with {{l/wp|Macron (diacritic)|macron}}
|-
|
| {{show/entity|#259}} - '''a''' with {{l/wp|caron}}
|-
| {{show/entity|#268}} - '''C''' with {{l/wp|caron}}
| {{show/entity|#269}} - '''c''' with {{l/wp|caron}}
|}
===missing===
* '''2024-08-31''' {{show/entity|Ccaron}} and {{show/entity|ccaron}} are apparently no longer supported, if they ever existed and were not just a figment of my imagination.
==Links==
* {{wikipedia|List of XML and HTML character entity references}} - a more complete list

Latest revision as of 16:14, 6 February 2025

<hide> page type::article page type::reference thing type::character format </hide>

About

An HTML entity is a combination of characters which will be displayed within an HTML browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#number;" where number is the output character's ASCII value in decimal.

Pages

  • /list: list of some of the most useful entities

Reference

Questions

  • Do search engines resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: FRELGKLOTZ)
    • 2007-07-15 answer: Yes, it does find it (though it took several months at least before this page got indexed).
    • 2016-11-28 answer: Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz".

Note

  • 2017-11-11 MediaWiki 1.28.0 apparently no longer supports the "&#d;" style of entity; you now have to use named entities.
  • 2020-05-23 This seems to have been fixed?