Difference between revisions of "HTML entity"
Jump to navigation
Jump to search
(→Questions: answer) |
(→Entities by Number: update: MW compatibility) |
||
(12 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | + | <hide> | |
− | [[category:computer terminology]]An [[HTML entity]] is a combination of characters which will be displayed within an [[HTML]] browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#<u>number</u>;" where <u>number</u> is the output character's [[ASCII]] value in decimal. | + | [[page type::article]] |
+ | [[page type::reference]] | ||
+ | [[thing type::character format]] | ||
+ | [[category:computer terminology]] | ||
+ | </hide> | ||
+ | ==About== | ||
+ | An [[HTML entity]] is a combination of characters which will be displayed within an [[HTML]] browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#<u>number</u>;" where <u>number</u> is the output character's [[ASCII]] value in decimal. | ||
+ | ''see also: [[wikipedia:Percent-encoding]]'' | ||
==Reference== | ==Reference== | ||
* {{wikipedia|Character entity reference}} | * {{wikipedia|Character entity reference}} | ||
* [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities] | * [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities] | ||
+ | * [https://developers.whatwg.org/named-character-references.html#named-character-references Named character references], in alphabetical order, including newer ones not supported by most browsers | ||
+ | * [https://www.freeformatter.com/html-entities.html HTML Entity List]: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)." | ||
+ | |||
==Questions== | ==Questions== | ||
* Does [[Google]] resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''FRELGKLOTZ''') | * Does [[Google]] resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''FRELGKLOTZ''') | ||
** '''2007-07-15 answer''': Yes, it does find it (though it took several months at least before this page got indexed). | ** '''2007-07-15 answer''': Yes, it does find it (though it took several months at least before this page got indexed). | ||
+ | ** '''2016-11-28 answer''': Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz". | ||
− | == | + | ==Note== |
+ | '''2017-11-11''' MediaWiki 1.28.0 apparently no longer supports the "&#d;" style of entity; you now have to use named entities. | ||
+ | ==Entities by Number== | ||
+ | * 0-8 are not translated | ||
+ | * 9 is probably TAB | ||
+ | * 10 is probably [[linefeed|LF]] | ||
+ | * 13 is probably [[carriage return|CR]] | ||
+ | * 14-31 are not translated | ||
+ | * 32 is a standard space | ||
+ | {| width=100% | ||
+ | ! colspan=3 style="border-bottom: 1px solid blue;" | lower ASCII || colspan=4 style="border-bottom: 1px solid green;" | upper ASCII | ||
+ | |- | ||
+ | | | ||
{| | {| | ||
|- | |- | ||
− | + | | ! || &#33; | |
− | |||
− | | ! || &#33; | ||
|- | |- | ||
| " || &#34; | | " || &#34; | ||
Line 24: | Line 45: | ||
| % || &#37; | | % || &#37; | ||
|- | |- | ||
− | | & || &#38; | + | | & || &#38; &amp; |
|- | |- | ||
| ' || &#39; | | ' || &#39; | ||
Line 78: | Line 99: | ||
| @ || &#64; | | @ || &#64; | ||
|} | |} | ||
+ | |||
+ | | | ||
+ | |||
{| | {| | ||
− | |||
|- | |- | ||
− | | B || &#66; || b || &#98; | + | | A || &#65; |
+ | |- | ||
+ | | B || &#66; | ||
+ | |- | ||
+ | | C || &#67; | ||
+ | |- | ||
+ | | D || &#68; | ||
+ | |- | ||
+ | | E || &#69; | ||
+ | |- | ||
+ | | F || &#70; | ||
+ | |- | ||
+ | | G || &#71; | ||
+ | |- | ||
+ | | H || &#72; | ||
+ | |- | ||
+ | | I || &#73; | ||
+ | |- | ||
+ | | J || &#74; | ||
+ | |- | ||
+ | | K || &#75; | ||
+ | |- | ||
+ | | L || &#76; | ||
+ | |- | ||
+ | | M || &#77; | ||
+ | |- | ||
+ | | N || &#78; | ||
+ | |- | ||
+ | | O || &#79; | ||
+ | |- | ||
+ | | P || &#80; | ||
+ | |- | ||
+ | | Q || &#81; | ||
+ | |- | ||
+ | | R || &#82; | ||
+ | |- | ||
+ | | S || &#83; | ||
+ | |- | ||
+ | | T || &#84; | ||
+ | |- | ||
+ | | U || &#85; | ||
+ | |- | ||
+ | | V || &#86; | ||
+ | |- | ||
+ | | W || &#87; | ||
+ | |- | ||
+ | | X || &#88; | ||
+ | |- | ||
+ | | Y || &#89; | ||
+ | |- | ||
+ | | Z || &#90; | ||
+ | |- | ||
+ | | [ || &#91; | ||
+ | |- | ||
+ | | \ || &#92; | ||
+ | |- | ||
+ | | ] || &#93; | ||
+ | |- | ||
+ | | ^ || &#94; | ||
+ | |- | ||
+ | | _ || &#95; | ||
+ | |- | ||
+ | | ` || &#96; | ||
+ | |} | ||
+ | |||
+ | | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | a || &#97; | ||
+ | |- | ||
+ | | b || &#98; | ||
+ | |- | ||
+ | | c || &#99; | ||
+ | |- | ||
+ | | d || &#100; | ||
+ | |- | ||
+ | | e || &#101; | ||
+ | |- | ||
+ | | f || &#102; | ||
+ | |- | ||
+ | | g || &#103; | ||
+ | |- | ||
+ | | h || &#104; | ||
+ | |- | ||
+ | | i || &#105; | ||
+ | |- | ||
+ | | j || &#106; | ||
+ | |- | ||
+ | | k || &#107; | ||
+ | |- | ||
+ | | l || &#108; | ||
+ | |- | ||
+ | | m || &#109; | ||
+ | |- | ||
+ | | n || &#110; | ||
+ | |- | ||
+ | | o || &#111; | ||
+ | |- | ||
+ | | p || &#112; | ||
+ | |- | ||
+ | | q || &#113; | ||
+ | |- | ||
+ | | r || &#114; | ||
+ | |- | ||
+ | | s || &#115; | ||
+ | |- | ||
+ | | t || &#116; | ||
+ | |- | ||
+ | | u || &#117; | ||
+ | |- | ||
+ | | v || &#118; | ||
+ | |- | ||
+ | | w || &#119; | ||
+ | |- | ||
+ | | x || &#120; | ||
+ | |- | ||
+ | | y || &#121; | ||
+ | |- | ||
+ | | z || &#122; | ||
|- | |- | ||
− | | &# | + | | { || &#123; |
|- | |- | ||
− | | &# | + | | | || &#124; |
|- | |- | ||
− | | &# | + | | } || &#125; |
|- | |- | ||
− | | &# | + | | ~ || &#126; |
|- | |- | ||
− | | &# | + | |  || &#127; |
+ | |- | ||
+ | | € || &#128; | ||
+ | |} | ||
+ | |||
+ | | | ||
+ | |||
+ | {| | ||
|- | |- | ||
− | | &# | + | |  || &#129; |
|- | |- | ||
− | | &# | + | | ‚ || &#130; |
|- | |- | ||
− | | &# | + | | ƒ || &#131; |
|- | |- | ||
− | | &# | + | | „ || &#132; |
|- | |- | ||
− | | &# | + | | … || &#133; |
|- | |- | ||
− | | &# | + | | † || &#134; |
|- | |- | ||
− | | &# | + | | ‡ || &#135; |
|- | |- | ||
− | | &# | + | | ˆ || &#136; |
|- | |- | ||
− | | &# | + | | ‰ || &#137; |
|- | |- | ||
− | | &# | + | | Š || &#138; |
|- | |- | ||
− | | &# | + | | ‹ || &#139; |
|- | |- | ||
− | | &# | + | | Œ || &#140; |
|- | |- | ||
− | | &# | + | |  || &#141; |
|- | |- | ||
− | | &# | + | | Ž || &#142; |
|- | |- | ||
− | | &# | + | |  || &#143; |
|- | |- | ||
− | | &# | + | |  || &#144; |
|- | |- | ||
− | | &# | + | | ‘ || &#145; |
|- | |- | ||
− | | &# | + | | ’ || &#146; |
|- | |- | ||
− | | &# | + | | “ || &#147; |
|- | |- | ||
− | | &# | + | | ” || &#148; |
|- | |- | ||
− | | &# | + | | • || &#149; |
|- | |- | ||
− | | &# | + | | – || &#150; |
|- | |- | ||
− | | &# | + | | — || &#151; |
|- | |- | ||
− | | &# | + | | ˜ || &#152; |
|- | |- | ||
− | | &# | + | | ™ || &#153; &trade; |
|- | |- | ||
− | | | + | | š || &#154; |
|- | |- | ||
− | | | + | | › || &#155; |
+ | |- | ||
+ | | œ || &#156; | ||
+ | |- | ||
+ | |  || &#157; | ||
+ | |- | ||
+ | | ž || &#158; | ||
+ | |- | ||
+ | | Ÿ || &#159; | ||
+ | |- | ||
+ | |   || &#160; | ||
|} | |} | ||
+ | |||
+ | | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | ¡ || &#161; | ||
+ | |- | ||
+ | | ¢ || &#162; | ||
+ | |- | ||
+ | | £ || &#163; &pound; | ||
+ | |- | ||
+ | | ¤ || &#164; | ||
+ | |- | ||
+ | | ¥ || &#165; | ||
+ | |- | ||
+ | | ¦ || &#166; | ||
+ | |- | ||
+ | | § || &#167; &sect; | ||
+ | |- | ||
+ | | ¨ || &#168; | ||
+ | |- | ||
+ | | © || &#169; &copy; | ||
+ | |- | ||
+ | | ª || &#170; | ||
+ | |- | ||
+ | | « || &#171; | ||
+ | |- | ||
+ | | ¬ || &#172; | ||
+ | |- | ||
+ | | ­ || &#173; | ||
+ | |- | ||
+ | | ® || &#174; | ||
+ | |- | ||
+ | | ¯ || &#175; | ||
+ | |- | ||
+ | | ° || &#176; | ||
+ | |- | ||
+ | | ± || &#177; | ||
+ | |- | ||
+ | | ² || &#178; | ||
+ | |- | ||
+ | | ³ || &#179; | ||
+ | |- | ||
+ | | ´ || &#180; | ||
+ | |- | ||
+ | | µ || &#181; | ||
+ | |- | ||
+ | | ¶ || &#182; | ||
+ | |- | ||
+ | | · || &#183; | ||
+ | |- | ||
+ | | ¸ || &#184; | ||
+ | |- | ||
+ | | ¹ || &#185; | ||
+ | |- | ||
+ | | º || &#186; | ||
+ | |- | ||
+ | | » || &#187; | ||
+ | |- | ||
+ | | ¼ || &#188; | ||
+ | |- | ||
+ | | ½ || &#189; | ||
+ | |- | ||
+ | | ¾ || &#190; | ||
+ | |- | ||
+ | | ¿ || &#191; | ||
+ | |- | ||
+ | | À || &#192; &Agrave; | ||
+ | |} | ||
+ | |||
+ | | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | Á || &#193; &Aacute; | ||
+ | |- | ||
+ | | Â || &#194; | ||
+ | |- | ||
+ | | Ã || &#195; &Atilde; | ||
+ | |- | ||
+ | | Ä || &#196; &Auml; | ||
+ | |- | ||
+ | | Å || &#197; | ||
+ | |- | ||
+ | | Æ || &#198; &AElig; | ||
+ | |- | ||
+ | | Ç || &#199; &Ccedil; | ||
+ | |- | ||
+ | | È || &#200; | ||
+ | |- | ||
+ | | É || &#201; | ||
+ | |- | ||
+ | | Ê || &#202; | ||
+ | |- | ||
+ | | Ë || &#203; | ||
+ | |- | ||
+ | | Ì || &#204; | ||
+ | |- | ||
+ | | Í || &#205; | ||
+ | |- | ||
+ | | Î || &#206; | ||
+ | |- | ||
+ | | Ï || &#207; | ||
+ | |- | ||
+ | | Ð || &#208; | ||
+ | |- | ||
+ | | Ñ || &#209; | ||
+ | |- | ||
+ | | Ò || &#210; | ||
+ | |- | ||
+ | | Ó || &#211; | ||
+ | |- | ||
+ | | Ô || &#212; | ||
+ | |- | ||
+ | | Õ || &#213; | ||
+ | |- | ||
+ | | Ö || &#214; | ||
+ | |- | ||
+ | | × || &#215; | ||
+ | |- | ||
+ | | Ø || &#216; | ||
+ | |- | ||
+ | | Ù || &#217; | ||
+ | |- | ||
+ | | Ú || &#218; | ||
+ | |- | ||
+ | | Û || &#219; | ||
+ | |- | ||
+ | | Ü || &#220; | ||
+ | |- | ||
+ | | Ý || &#221; | ||
+ | |- | ||
+ | | Þ || &#222; | ||
+ | |- | ||
+ | | ß || &#223; | ||
+ | |- | ||
+ | | à || &#224; | ||
+ | |} | ||
+ | |||
+ | | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | á || &#225; | ||
+ | |- | ||
+ | | â || &#226; | ||
+ | |- | ||
+ | | ã || &#227; | ||
+ | |- | ||
+ | | ä || &#228; | ||
+ | |- | ||
+ | | å || &#229; | ||
+ | |- | ||
+ | | æ || &#230; | ||
+ | |- | ||
+ | | ç || &#231; &ccedil; | ||
+ | |- | ||
+ | | è || &#232; | ||
+ | |- | ||
+ | | é || &#233; | ||
+ | |- | ||
+ | | ê || &#234; | ||
+ | |- | ||
+ | | ë || &#235; | ||
+ | |- | ||
+ | | ì || &#236; | ||
+ | |- | ||
+ | | í || &#237; | ||
+ | |- | ||
+ | | î || &#238; | ||
+ | |- | ||
+ | | ï || &#239; | ||
+ | |- | ||
+ | | ð || &#240; | ||
+ | |- | ||
+ | | ñ || &#241; | ||
+ | |- | ||
+ | | ò || &#242; | ||
+ | |- | ||
+ | | ó || &#243; | ||
+ | |- | ||
+ | | ô || &#244; | ||
+ | |- | ||
+ | | õ || &#245; | ||
+ | |- | ||
+ | | ö || &#246; | ||
+ | |- | ||
+ | | ÷ || &#247; | ||
+ | |- | ||
+ | | ø || &#248; &oslash; | ||
+ | |- | ||
+ | | ù || &#249; | ||
+ | |- | ||
+ | | ú || &#250; | ||
+ | |- | ||
+ | | û || &#251; | ||
+ | |- | ||
+ | | ü || &#252; | ||
+ | |- | ||
+ | | ý || &#253; | ||
+ | |- | ||
+ | | þ || &#254; &thorn; | ||
+ | |- | ||
+ | | ÿ || &#255; | ||
+ | |- | ||
+ | | Ā || &#256; | ||
+ | |} | ||
+ | |||
+ | |} | ||
+ | And yes, it keeps going after this: āĂă... | ||
+ | |||
+ | Others: | ||
+ | * &rsquo; - ’ - right single-quote | ||
+ | * &lsquo; - ‘ - left single-quote | ||
+ | ==Links== | ||
+ | * {{wikipedia|List of XML and HTML character entity references}} - a more complete list |
Revision as of 10:11, 11 November 2017
About
An HTML entity is a combination of characters which will be displayed within an HTML browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#number;" where number is the output character's ASCII value in decimal.
see also: wikipedia:Percent-encoding
Reference
- Wikipedia
- HTML 4.0 entities
- Named character references, in alphabetical order, including newer ones not supported by most browsers
- HTML Entity List: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)."
Questions
- Does Google resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: FRELGKLOTZ)
- 2007-07-15 answer: Yes, it does find it (though it took several months at least before this page got indexed).
- 2016-11-28 answer: Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz".
Note
2017-11-11 MediaWiki 1.28.0 apparently no longer supports the "&#d;" style of entity; you now have to use named entities.
Entities by Number
- 0-8 are not translated
- 9 is probably TAB
- 10 is probably LF
- 13 is probably CR
- 14-31 are not translated
- 32 is a standard space
lower ASCII | upper ASCII | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
And yes, it keeps going after this: āĂă...
Others:
- ’ - ’ - right single-quote
- ‘ - ‘ - left single-quote
Links
- Wikipedia - a more complete list