Unicode的空白字符有哪些
最新推荐文章于 2025-09-11 02:17:48 发布
转载
最新推荐文章于 2025-09-11 02:17:48 发布
·
8.1k 阅读
·
1
·
6
·
CC 4.0 BY-SA版权
原文链接:https://en.wikipedia.org/wiki/Whitespace_character
文章标签:
#linux
笔记
专栏收录该内容
5 篇文章
订阅专栏
本文详细介绍了Unicode标准中具有White_Space属性的各种字符,包括常见的ASCII空格、制表符、换行符等,以及用于数学公式和特殊排版的空格字符。文章还列举了一些不具有White_Space属性但与空白处理相关的Unicode字符。
Unicode characters with White_Space property
nameHexDecScriptBlockGeneral categoryNotescharacter tabulationU+00099CommonBasic LatinOther, controlHT, Horizontal Tab. HTML/XML named entity: 	, LaTeX: ‘\tab’line feedU+000A10CommonBasic LatinOther, controlLF, Line feed. HTML/XML named entity: 
line tabulationU+000B11CommonBasic LatinOther, controlVT, Vertical Tabform feedU+000C12CommonBasic LatinOther, controlFF, Form feedcarriage returnU+000D13CommonBasic LatinOther, controlCR, Carriage returnspaceU+002032CommonBasic LatinSeparator, spaceMost common (normal ASCII space)next lineU+0085133CommonLatin-1 SupplementOther, controlNEL, Next lineno-break spaceU+00A0160CommonLatin-1 SupplementSeparator, spaceNon-breaking space: identical to U+0020, but not a point at which a line may be broken. HTML/XML named entity: , LaTeX: '\ ’ogham space markU+16805760OghamOghamSeparator, spaceUsed for interword separation in Ogham text. Normally a vertical line in vertical text or a horizontal line in horizontal text, but may also be a blank space in “stemless” fonts. Requires an Ogham font.en quadU+20008192CommonGeneral PunctuationSeparator, spaceWidth of one en. U+2002 is canonically equivalent to this character; U+2002 is preferred.em quadU+20018193CommonGeneral PunctuationSeparator, spaceAlso known as “mutton quad”. Width of one em. U+2003 is canonically equivalent to this character; U+2003 is preferred.en spaceU+20028194CommonGeneral PunctuationSeparator, spaceAlso known as “nut”. Width of one en. U+2000 En Quad is canonically equivalent to this character; U+2002 is preferred. HTML/XML named entity: , LaTeX: ‘\enspace’em spaceU+20038195CommonGeneral PunctuationSeparator, spaceAlso known as “mutton”. Width of one em. U+2001 Em Quad is canonically equivalent to this character; U+2003 is preferred. HTML/XML named entity: , LaTeX: ‘\quad’three-per-em spaceU+20048196CommonGeneral PunctuationSeparator, spaceAlso known as “thick space”. One third of an em wide. HTML/XML named entity:  four-per-em spaceU+20058197CommonGeneral PunctuationSeparator, spaceAlso known as “mid space”. One fourth of an em wide. HTML/XML named entity:  six-per-em spaceU+20068198CommonGeneral PunctuationSeparator, spaceOne sixth of an em wide. In computer typography, sometimes equated to U+2009.figure spaceU+20078199CommonGeneral PunctuationSeparator, spaceFigure space. In fonts with monospaced digits, equal to the width of one digit. HTML/XML named entity:  punctuation spaceU+20088200CommonGeneral PunctuationSeparator, spaceAs wide as the narrow punctuation in a font, i.e. the advance width of the period or comma.[2] HTML/XML named entity:  thin spaceU+20098201CommonGeneral PunctuationSeparator, spaceOne-fifth (sometimes one-sixth) of an em wide. Recommended for use as a thousands separator for measures made with SI units. Unlike U+2002 to U+2008, its width may get adjusted in typesetting.[3] HTML/XML named entity: ; LaTeX: ‘,’hair spaceU+200A8202CommonGeneral PunctuationSeparator, spaceThinner than a thin space. HTML/XML named entity:   (does not work in all browsers)line separatorU+20288232CommonGeneral PunctuationSeparator, lineparagraph separatorU+20298233CommonGeneral PunctuationSeparator, paragraphnarrow no-break spaceU+202F8239CommonGeneral PunctuationSeparator, spaceNarrow no-break space. Similar in function to U+00A0 No-Break Space. When used with Mongolian, its width is usually one third of the normal space; in other context, its width sometimes resembles that of the Thin Space (U+2009).medium mathematical spaceU+205F8287CommonGeneral PunctuationSeparator, spaceMMSP. Used in mathematical formulae. Four-eighteenths of an em.[4] In mathematical typography, the widths of spaces are usually given in integral multiples of an eighteenth of an em, and 4/18 em may be used in several situations, for example between the a and the + and between the + and the b in the expression a + b.[5] HTML/XML named entity:  ideographic spaceU+300012288CommonCJK Symbols and PunctuationSeparator, spaceAs wide as a CJK character cell (fullwidth). Used, for example, in tai tou.Related Unicode characters without White_Space property
nameHexDecScriptBlockGeneral categoryNotesmongolian vowel separatorU+180E6158MongolianMongolianOther, FormatMVS. A narrow space character, used in Mongolian to cause the final two characters of a word to take on different shapes.[6] It is no longer classified as space character (i.e. in Zs category) in Unicode 6.3.0, even though it was in previous versions of the standard.zero width spaceU+200B8203?General PunctuationOther, FormatZWSP, zero-width space. Used to indicate word boundaries to text processing systems when using scripts that do not use explicit spacing. It is similar to the soft hyphen, with the difference that the latter is used to indicate syllable boundaries, and should display a visible hyphen when the line breaks at it. HTML/XML named entity: ​zero width non-joinerU+200C8204?General PunctuationOther, FormatZWNJ, zero-width non-joiner. When placed between two characters that would otherwise be connected, a ZWNJ causes them to be printed in their final and initial forms, respectively. HTML/XML named entity: zero width joinerU+200D8205?General PunctuationOther, FormatZWJ, zero-width joiner. When placed between two characters that would otherwise not be connected, a ZWJ causes them to be printed in their connected forms. HTML/XML named entity: word joinerU+20608288?General PunctuationOther, FormatWJ, word joiner. Similar to U+200B, but not a point at which a line may be broken. HTML/XML named entity: ⁠zero width non-breaking spaceU+FEFF65279?Arabic Presentation Forms-BOther, FormatZero-width non-breaking space. Used primarily as a Byte Order Mark. Use as an indication of non-breaking is deprecated as of Unicode 3.2; see U+2060 instead.
魔力百科【转载】怀旧服中后期练级点经验表
方舟生存进化手游🔥幼崽快速成长秘籍💡