Difference between revisions of "HTML entity"
Jump to navigation
Jump to search
(→Reference: standard ASCII in entities; Google query) |
(→Entities by Number: formatting improvements; :"by appearance" section) |
||
(18 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | + | <hide> | |
− | [[category: | + | [[page type::article]] |
+ | [[page type::reference]] | ||
+ | [[thing type::character format]] | ||
+ | [[category:computer terminology]] | ||
+ | </hide> | ||
+ | ==About== | ||
+ | An [[HTML entity]] is a combination of characters which will be displayed within an [[HTML]] browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&amp;", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#<u>number</u>;" where <u>number</u> is the output character's [[ASCII]] value in decimal. | ||
+ | ''see also: [[wikipedia:Percent-encoding]]'' | ||
==Reference== | ==Reference== | ||
* {{wikipedia|Character entity reference}} | * {{wikipedia|Character entity reference}} | ||
* [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities] | * [http://htmlhelp.com/reference/html40/entities/ HTML 4.0 entities] | ||
+ | * [https://developers.whatwg.org/named-character-references.html#named-character-references Named character references], in alphabetical order, including newer ones not supported by most browsers | ||
+ | * [https://www.freeformatter.com/html-entities.html HTML Entity List]: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)." | ||
+ | |||
==Questions== | ==Questions== | ||
− | * Does Google resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''FRELGKLOTZ''') | + | * Does [[Google]] resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: '''FRELGKLOTZ''') |
− | == | + | ** '''2007-07-15 answer''': Yes, it does find it (though it took several months at least before this page got indexed). |
+ | ** '''2016-11-28 answer''': Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz". | ||
+ | |||
+ | ==Note== | ||
+ | * '''2017-11-11''' MediaWiki 1.28.0 apparently no longer supports the "&#d;" style of entity; you now have to use named entities. | ||
+ | * '''2020-05-23''' This seems to have been fixed? | ||
+ | |||
+ | ==Entities by Number== | ||
+ | * 0-8 are not translated | ||
+ | * 9 is probably TAB | ||
+ | * 10 is probably [[linefeed|LF]] | ||
+ | * 13 is probably [[carriage return|CR]] | ||
+ | * 14-31 are not translated | ||
+ | * 32 is a standard space | ||
+ | {| width=100% | ||
+ | ! colspan=3 style="border-bottom: 1px solid blue;" | lower ASCII || colspan=4 style="border-bottom: 1px solid green;" | upper ASCII | ||
+ | |- | ||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | {| | ||
+ | {{show/entity/row|#33}} | ||
+ | {{show/entity/row|#34}} | ||
+ | {{show/entity/row|#35}} | ||
+ | {{show/entity/row|#36}} | ||
+ | {{show/entity/row|#37}} | ||
+ | {{show/entity/row|#38}} {{show/entity|amp}} | ||
+ | {{show/entity/row|#39}} | ||
+ | {{show/entity/row|#40}} | ||
+ | {{show/entity/row|#41}} | ||
+ | {{show/entity/row|#42}} | ||
+ | {{show/entity/row|#43}} | ||
+ | {{show/entity/row|#44}} | ||
+ | {{show/entity/row|#45}} | ||
+ | {{show/entity/row|#46}} | ||
+ | {{show/entity/row|#47}} | ||
+ | {{show/entity/row|#48}} | ||
+ | {{show/entity/row|#49}} | ||
+ | {{show/entity/row|#50}} | ||
+ | {{show/entity/row|#51}} | ||
+ | {{show/entity/row|#52}} | ||
+ | {{show/entity/row|#53}} | ||
+ | {{show/entity/row|#54}} | ||
+ | {{show/entity/row|#55}} | ||
+ | {{show/entity/row|#56}} | ||
+ | {{show/entity/row|#57}} | ||
+ | {{show/entity/row|#58}} | ||
+ | {{show/entity/row|#59}} | ||
+ | {{show/entity/row|#60}} | ||
+ | {{show/entity/row|#61}} | ||
+ | {{show/entity/row|#62}} | ||
+ | {{show/entity/row|#63}} | ||
+ | {{show/entity/row|#64}} | ||
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | A || &#65; | ||
+ | |- | ||
+ | | B || &#66; | ||
+ | |- | ||
+ | | C || &#67; | ||
+ | |- | ||
+ | | D || &#68; | ||
+ | |- | ||
+ | | E || &#69; | ||
+ | |- | ||
+ | | F || &#70; | ||
+ | |- | ||
+ | | G || &#71; | ||
+ | |- | ||
+ | | H || &#72; | ||
+ | |- | ||
+ | | I || &#73; | ||
+ | |- | ||
+ | | J || &#74; | ||
+ | |- | ||
+ | | K || &#75; | ||
+ | |- | ||
+ | | L || &#76; | ||
+ | |- | ||
+ | | M || &#77; | ||
+ | |- | ||
+ | | N || &#78; | ||
+ | |- | ||
+ | | O || &#79; | ||
+ | |- | ||
+ | | P || &#80; | ||
+ | |- | ||
+ | | Q || &#81; | ||
+ | |- | ||
+ | | R || &#82; | ||
+ | |- | ||
+ | | S || &#83; | ||
+ | |- | ||
+ | | T || &#84; | ||
+ | |- | ||
+ | | U || &#85; | ||
+ | |- | ||
+ | | V || &#86; | ||
+ | |- | ||
+ | | W || &#87; | ||
+ | |- | ||
+ | | X || &#88; | ||
+ | |- | ||
+ | | Y || &#89; | ||
+ | |- | ||
+ | | Z || &#90; | ||
+ | |- | ||
+ | | [ || &#91; | ||
+ | |- | ||
+ | | \ || &#92; | ||
+ | |- | ||
+ | | ] || &#93; | ||
+ | |- | ||
+ | | ^ || &#94; | ||
+ | |- | ||
+ | | _ || &#95; | ||
+ | |- | ||
+ | | ` || &#96; | ||
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | a || &#97; | ||
+ | |- | ||
+ | | b || &#98; | ||
+ | |- | ||
+ | | c || &#99; | ||
+ | |- | ||
+ | | d || &#100; | ||
+ | |- | ||
+ | | e || &#101; | ||
+ | |- | ||
+ | | f || &#102; | ||
+ | |- | ||
+ | | g || &#103; | ||
+ | |- | ||
+ | | h || &#104; | ||
+ | |- | ||
+ | | i || &#105; | ||
+ | |- | ||
+ | | j || &#106; | ||
+ | |- | ||
+ | | k || &#107; | ||
+ | |- | ||
+ | | l || &#108; | ||
+ | |- | ||
+ | | m || &#109; | ||
+ | |- | ||
+ | | n || &#110; | ||
+ | |- | ||
+ | | o || &#111; | ||
+ | |- | ||
+ | | p || &#112; | ||
+ | |- | ||
+ | | q || &#113; | ||
+ | |- | ||
+ | | r || &#114; | ||
+ | |- | ||
+ | | s || &#115; | ||
+ | |- | ||
+ | | t || &#116; | ||
+ | |- | ||
+ | | u || &#117; | ||
+ | |- | ||
+ | | v || &#118; | ||
+ | |- | ||
+ | | w || &#119; | ||
+ | |- | ||
+ | | x || &#120; | ||
+ | |- | ||
+ | | y || &#121; | ||
+ | |- | ||
+ | | z || &#122; | ||
+ | |- | ||
+ | | { || &#123; | ||
+ | |- | ||
+ | | | || &#124; | ||
+ | |- | ||
+ | | } || &#125; | ||
+ | |- | ||
+ | | ~ || &#126; | ||
+ | |- | ||
+ | |  || &#127; | ||
+ | |- | ||
+ | | € || &#128; | ||
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | |  || &#129; | ||
+ | |- | ||
+ | | ‚ || &#130; | ||
+ | |- | ||
+ | | ƒ || &#131; | ||
+ | |- | ||
+ | | „ || &#132; | ||
+ | |- | ||
+ | | … || &#133; | ||
+ | |- | ||
+ | | † || &#134; | ||
+ | |- | ||
+ | | ‡ || &#135; | ||
+ | |- | ||
+ | | ˆ || &#136; | ||
+ | |- | ||
+ | | ‰ || &#137; | ||
+ | |- | ||
+ | | Š || &#138; | ||
+ | |- | ||
+ | | ‹ || &#139; | ||
+ | |- | ||
+ | | Œ || &#140; | ||
+ | |- | ||
+ | |  || &#141; | ||
+ | |- | ||
+ | | Ž || &#142; | ||
+ | |- | ||
+ | |  || &#143; | ||
+ | |- | ||
+ | |  || &#144; | ||
+ | |- | ||
+ | | ‘ || &#145; | ||
+ | |- | ||
+ | | ’ || &#146; | ||
+ | |- | ||
+ | | “ || &#147; | ||
+ | |- | ||
+ | | ” || &#148; | ||
+ | |- | ||
+ | | • || &#149; | ||
+ | |- | ||
+ | | – || &#150; | ||
+ | |- | ||
+ | | — || &#151; | ||
+ | |- | ||
+ | | ˜ || &#152; | ||
+ | |- | ||
+ | | ™ || &#153; &trade; | ||
+ | |- | ||
+ | | š || &#154; | ||
+ | |- | ||
+ | | › || &#155; | ||
+ | |- | ||
+ | | œ || &#156; | ||
+ | |- | ||
+ | |  || &#157; | ||
+ | |- | ||
+ | | ž || &#158; | ||
+ | |- | ||
+ | | Ÿ || &#159; | ||
+ | |- | ||
+ | |   || &#160; | ||
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | ¡ || &#161; | ||
+ | |- | ||
+ | | ¢ || &#162; | ||
+ | |- | ||
+ | | £ || &#163; &pound; | ||
+ | |- | ||
+ | | ¤ || &#164; | ||
+ | |- | ||
+ | | ¥ || &#165; | ||
+ | |- | ||
+ | | ¦ || &#166; | ||
+ | |- | ||
+ | | § || &#167; &sect; | ||
+ | |- | ||
+ | | ¨ || &#168; | ||
+ | |- | ||
+ | | © || &#169; &copy; | ||
+ | |- | ||
+ | | ª || &#170; | ||
+ | |- | ||
+ | | « || &#171; | ||
+ | |- | ||
+ | | ¬ || &#172; | ||
+ | |- | ||
+ | | ­ || &#173; | ||
+ | |- | ||
+ | | ® || &#174; | ||
+ | |- | ||
+ | | ¯ || &#175; | ||
+ | |- | ||
+ | | ° || &#176; | ||
+ | |- | ||
+ | | ± || &#177; | ||
+ | |- | ||
+ | | ² || &#178; | ||
+ | |- | ||
+ | | ³ || &#179; | ||
+ | |- | ||
+ | | ´ || &#180; | ||
+ | |- | ||
+ | | µ || &#181; | ||
+ | |- | ||
+ | | ¶ || &#182; | ||
+ | |- | ||
+ | | · || &#183; | ||
+ | |- | ||
+ | | ¸ || &#184; | ||
+ | |- | ||
+ | | ¹ || &#185; | ||
+ | |- | ||
+ | | º || &#186; | ||
+ | |- | ||
+ | | » || &#187; | ||
+ | |- | ||
+ | | ¼ || &#188; | ||
+ | |- | ||
+ | | ½ || &#189; | ||
+ | |- | ||
+ | | ¾ || &#190; | ||
+ | |- | ||
+ | | ¿ || &#191; | ||
+ | |- | ||
+ | | À || &#192; &Agrave; | ||
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
{| | {| | ||
− | |||
|- | |- | ||
− | | &# | + | | Á || &#193; &Aacute; |
|- | |- | ||
− | | &# | + | | Â || &#194; |
|- | |- | ||
− | | &# | + | | Ã || &#195; &Atilde; |
|- | |- | ||
− | | &# | + | | Ä || &#196; &Auml; |
|- | |- | ||
− | | &# | + | | Å || &#197; |
|- | |- | ||
− | | &# | + | | Æ || &#198; &AElig; |
|- | |- | ||
− | | &# | + | | Ç || &#199; &Ccedil; |
|- | |- | ||
− | | &# | + | | È || &#200; |
|- | |- | ||
− | | &# | + | | É || &#201; |
|- | |- | ||
− | | &# | + | | Ê || &#202; |
|- | |- | ||
− | | &# | + | | Ë || &#203; |
|- | |- | ||
− | | &# | + | | Ì || &#204; |
|- | |- | ||
− | | &# | + | | Í || &#205; |
|- | |- | ||
− | | &# | + | | Î || &#206; |
|- | |- | ||
− | | &# | + | | Ï || &#207; |
|- | |- | ||
− | | &# | + | | Ð || &#208; |
|- | |- | ||
− | | &# | + | | Ñ || &#209; |
|- | |- | ||
− | | &# | + | | Ò || &#210; |
|- | |- | ||
− | | &# | + | | Ó || &#211; |
|- | |- | ||
− | | &# | + | | Ô || &#212; |
|- | |- | ||
− | | &# | + | | Õ || &#213; |
|- | |- | ||
− | | &# | + | | Ö || &#214; |
|- | |- | ||
− | | &# | + | | × || &#215; |
|- | |- | ||
− | | &# | + | | Ø || &#216; |
|- | |- | ||
− | | &# | + | | Ù || &#217; |
|- | |- | ||
− | | &# | + | | Ú || &#218; |
|- | |- | ||
− | | &# | + | | Û || &#219; |
|- | |- | ||
− | | &# | + | | Ü || &#220; |
|- | |- | ||
− | | &# | + | | Ý || &#221; |
|- | |- | ||
− | | &# | + | | Þ || &#222; |
|- | |- | ||
− | | &# | + | | ß || &#223; |
|- | |- | ||
− | | | + | | à || &#224; |
+ | |} | ||
+ | |||
+ | | style="border:#0d0 1px solid; padding: 2px;" | | ||
+ | |||
+ | {| | ||
+ | |- | ||
+ | | á || &#225; | ||
+ | |- | ||
+ | | â || &#226; | ||
|- | |- | ||
− | | | + | | ã || &#227; |
+ | |- | ||
+ | | ä || &#228; | ||
+ | |- | ||
+ | | å || &#229; | ||
+ | |- | ||
+ | | æ || &#230; | ||
+ | |- | ||
+ | | ç || &#231; &ccedil; | ||
+ | |- | ||
+ | | è || &#232; | ||
+ | |- | ||
+ | | é || &#233; | ||
+ | |- | ||
+ | | ê || &#234; | ||
+ | |- | ||
+ | | ë || &#235; | ||
+ | |- | ||
+ | | ì || &#236; | ||
+ | |- | ||
+ | | í || &#237; | ||
+ | |- | ||
+ | | î || &#238; | ||
+ | |- | ||
+ | | ï || &#239; | ||
+ | |- | ||
+ | | ð || &#240; | ||
+ | |- | ||
+ | | ñ || &#241; | ||
+ | |- | ||
+ | | ò || &#242; | ||
+ | |- | ||
+ | | ó || &#243; | ||
+ | |- | ||
+ | | ô || &#244; | ||
+ | |- | ||
+ | | õ || &#245; | ||
+ | |- | ||
+ | | ö || &#246; | ||
+ | |- | ||
+ | | ÷ || &#247; | ||
+ | |- | ||
+ | | ø || &#248; &oslash; | ||
+ | |- | ||
+ | | ù || &#249; | ||
+ | |- | ||
+ | | ú || &#250; | ||
+ | |- | ||
+ | | û || &#251; | ||
+ | |- | ||
+ | | ü || &#252; | ||
+ | |- | ||
+ | | ý || &#253; | ||
+ | |- | ||
+ | | þ || &#254; &thorn; | ||
+ | |- | ||
+ | | ÿ || &#255; | ||
+ | |- | ||
+ | | Ā || &#256; | ||
+ | |} | ||
+ | |||
+ | |} | ||
+ | ==Entities by Appearance== | ||
+ | * {{show/entity|rsquo}} - right single-quote | ||
+ | * {{show/entity|lsquo}} - left single-quote | ||
+ | * {{show/entity|Alpha}} - Greek letter alpha (upper) | ||
+ | * {{show/entity|Beta}} - Greek letter beta (upper) | ||
+ | * {{show/entity|Gamma}} - Greek letter gamma (upper) | ||
+ | * {{show/entity|Delta}} - Greek letter delta (upper) | ||
+ | * {{show/entity|alpha}} - Greek letter alpha (lower) | ||
+ | * {{show/entity|beta}} - Greek letter beta (lower) | ||
+ | * {{show/entity|gamma}} - Greek letter gamma (lower) | ||
+ | * {{show/entity|delta}} - Greek letter delta (lower) | ||
+ | ===Roman alphabet=== | ||
+ | {| border=1 | ||
+ | |- | ||
+ | ! upper || lower || mnemonics | ||
+ | |- | ||
+ | | {{show/entity|#193}} - '''A''' with acute accent | ||
+ | | {{show/entity|#225}} - '''a''' with acute accent | ||
+ | | {{show/entity|Aacute}} {{show/entity|aacute}} | ||
+ | |- | ||
+ | | | ||
+ | | {{show/entity|#257}} - '''a''' with macron | ||
+ | |- | ||
+ | | | ||
+ | | {{show/entity|#259}} - '''a''' with {{l/mw|caron}} | ||
+ | |- | ||
+ | | {{show/entity|#268}} - '''C''' with {{l/mw|caron}} | ||
+ | | {{show/entity|#269}} - '''c''' with {{l/mw|caron}} | ||
+ | | {{show/entity|Ccaron}} {{show/entity|ccaron}} | ||
|} | |} | ||
+ | |||
+ | ==Links== | ||
+ | * {{wikipedia|List of XML and HTML character entity references}} - a more complete list |
Latest revision as of 21:15, 26 April 2021
About
An HTML entity is a combination of characters which will be displayed within an HTML browser as a single character. All HTML entities begin with "&" (ampersand) and end with ";" (semicolon). Many entities have mnemonic names (such as "&", which displays an ampersand); any possible character can be displayed as an entity by using the format "&#number;" where number is the output character's ASCII value in decimal.
see also: wikipedia:Percent-encoding
Reference
- Wikipedia
- HTML 4.0 entities
- Named character references, in alphabetical order, including newer ones not supported by most browsers
- HTML Entity List: "Complete list of HTML entities with their numbers and names. Also included is a full list of ASCII characters that can be represented in HTML (i.e. printable characters)."
Questions
- Does Google resolve html entities when indexing web pages? That is, if I spelled a word (e.g. "schmerglefrotz") entirely using html entities, would someone be able to find that page using google by typing "schmerglefrotz" (after the site had been spidered, of course)? (As a test, I will spell a completely different word using HTML entities, and try Googling it later: FRELGKLOTZ)
- 2007-07-15 answer: Yes, it does find it (though it took several months at least before this page got indexed).
- 2016-11-28 answer: Searching for the word in question no longer produces any results. Neither does searching for "schmerglefrotz".
Note
- 2017-11-11 MediaWiki 1.28.0 apparently no longer supports the "&#d;" style of entity; you now have to use named entities.
- 2020-05-23 This seems to have been fixed?
Entities by Number
- 0-8 are not translated
- 9 is probably TAB
- 10 is probably LF
- 13 is probably CR
- 14-31 are not translated
- 32 is a standard space
lower ASCII | upper ASCII | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
Entities by Appearance
’
: ’ - right single-quote‘
: ‘ - left single-quoteΑ
: Α - Greek letter alpha (upper)Β
: Β - Greek letter beta (upper)Γ
: Γ - Greek letter gamma (upper)Δ
: Δ - Greek letter delta (upper)α
: α - Greek letter alpha (lower)β
: β - Greek letter beta (lower)γ
: γ - Greek letter gamma (lower)δ
: δ - Greek letter delta (lower)
Roman alphabet
upper | lower | mnemonics |
---|---|---|
Á : Á - A with acute accent
|
á : á - a with acute accent
|
Á : Á á : á
|
ā : ā - a with macron
| ||
ă : ă - a with caron
| ||
Č : Č - C with caron
|
č : č - c with caron
|
Č : Č č : č
|
Links
- Wikipedia - a more complete list