Skip to content
Surf Wiki
Save to docs
general/unicode-blocks

From Surf Wiki (app.surf) — the open knowledge base

General Punctuation

General Punctuation

FieldValue
blocknameGeneral Punctuation
rangestart2000
rangeend206F
script1Common (109 char.)
script2Inherited (2 char.)
symbolsPunctuation
Spaces
Format controls
deprecated6
1_0_067
1_19
3_07
3_212
4_02
4_19
5_11
6_34
note

Spaces Format controls

General Punctuation is a Unicode block containing punctuation, spacing, and formatting characters for use with all scripts and writing systems. Included are the defined-width spaces, joining formats, directional formats, smart quotes, archaic and novel punctuation such as the interrobang, and invisible mathematical operators.

Additional punctuation characters are in the Supplemental Punctuation block and sprinkled in dozens of other Unicode blocks.

Block

Several characters in this block are usually not rendered with a directly visible glyph. Ten whitespace characters—U+2002 through U+200B (fixed en or 1⁄2 em, em, 1⁄3 em, 1⁄4 em, 1⁄6 em, figure and punctuation space, variable thin or 1⁄5 em and hair space, fixed zero-width space)—and U+205F (math medium or 2⁄9 em space) differ by horizontal width, while U+2000 and U+2001 (en and em quad) are effectively aliases of U+2002 and U+2003, respectively; another two, U+202F and U+2060 (ill-termed word joiner), are variants of U+2009 or U+2004 and U+200B that prohibit line breaks. Three zero-width characters, U+200B through U+200D (space, non-joiner and joiner), differ in how they affect ligation and shaping of adjacent letters such as contextual forms in Arabic. Eleven invisible characters—U+200E, U+200F (left-to-right and right-to-left mark), U+202A through U+202E (embeds, pops and overrides) and U+2066 through U+2069 (isolates)—control the directionality of text unless higher-level markup overrides them. There are explicit line and paragraph separators at U+2028 and U+2029.

Variation selectors

This block has variation sequences defined for East Asian punctuation positional variants of the curly quotation marks ‘...’ and “...”. (VS01) and (VS02) are used for East Asian punctuation positional variants. (VS03) is used for Sibe positional variants.

base + VS03‘︂’︂“︂”︂Sibe form

The non-fullwidth forms are expected to be separated with a space on one side, the fullwidth forms are not:

The red registration corners mark the glyph metrics and show how the glyph aligns within the space allotted to the character. For variable-width display (left), an adjacent space is expected; for full-width CJK display (right), a space is not necessary.

In vertical text, the fullwidth forms should display somewhat differently, and even as regular CJK quotation marks 「...」 and 『...』 if the vertical orientation property is set to "Hans":

CJK behaviour of generic quotation marks in horizontal and vertical text when variation selector VS02 is appended. The 'horizontal' column at left is the 'VS2' column of the preceding table.

Emoji

The General Punctuation block contains two emoji: U+203C and U+2049.

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.

base+VS16 (emoji)

History

The following Unicode-related documents record the purpose and process of defining specific characters in the General Punctuation block:

VersionCountUTC IDL2 IDWG2 IDDocument
1.0.0U+2000..202E, 2030..203E, 2040..204467
N4182
1.1U+203F, 2045..20463
U+206A..206F6(to be determined)
3.0U+202F, 2048..20493N1603
N1711
N1734
txt)doc)
html, txt)doc)
html)
N1862
N1865
N1918
html, doc)
N1973
html, doc)
N2020
N2003
N2126
N2103
N4182
N4752R2
N4753
N4763
N4769
U+204A1N1747
html)
html, Figure 1)
N1840
N1847
doc)
N1920
html, doc)
U+204B..204D3N1748
html)
html, Figure 1)
N1840
N1847
doc)
N1920
html, doc)
3.2U+2047, 20512
N2092
N2195
N2191R
txt)
N2257
N2278
N2253
U+204E..2050, 2057, 205F, 2061..20627
N2191R
txt)
U+2052, 20632N2336
N2356
doc)
U+20601
N2235
N2253
4.0U+2053..20542N2419
N2442
4.1U+20551
U+2056, 2058..20593N2610R
N2642
U+205A..205C3
U+205D1
N2642
U+205E1N2655
5.1U+20641N3198R
doc)
6.3U+2066..20694
N4310
N4447
{{reflistgroup=lower-alpharefs=

References

References

  1. "Unicode character database". The Unicode Standard.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard.
  3. Lunde, Ken. (2023-10-14). "L2/23-212R: Proposal to add standardized variation sequences for four quotation marks".
  4. CheonHyeong, Sim. (2025-01-08). "L2/25-028: Proposal to Add VS3 for Sibe Quotation Marks".
  5. (2023-09-05). "UTR #51: Unicode Emoji". Unicode Consortium.
  6. (2023-02-01). "UCD: Emoji Data for UTR #51". Unicode Consortium.
  7. "UTS #51 Emoji Variation Sequences". The Unicode Consortium.
  8. Proposed code points and characters names may differ from final code points and names
  9. See also [https://www.unicode.org/L2/L2010/10458-emoji-var.pdf L2/10-458], [https://www.unicode.org/L2/L2011/11414-emoji-var-seq.pdf L2/11-414], [https://www.unicode.org/L2/L2011/11415-unified-emoji-ref.pdf L2/11-415], and [https://www.unicode.org/L2/L2011/11429-emoji-var-seq-list.pdf L2/11-429]
  10. Refer to the [[Miscellaneous Symbols and Pictographs#History. history section]] of the Miscellaneous Symbols and Pictographs block for additional emoji-related documents
  11. Refer to the [[Miscellaneous Mathematical Symbols-B#History. history section]] of the Miscellaneous Mathematical Symbols-B block for additional math-related documents
Info: Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about General Punctuation — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report