Difference between revisions of "Surrogate Pair To UTF32 (LogiX node)"

From Neos Wiki
Jump to navigation Jump to search
(Marked this version for translation)
m (typo)
 
Line 17: Line 17:
  
 
<!--T:6-->
 
<!--T:6-->
The <code>HighSurrogate</code> input must be a character within Unicode's High Surrogate set (<code>0xD800</code> to </code>0xDBFF</code>/) and the <code>LowSurrogate</code> input must be a character within Unicode's Low Surrogate set (<code>DC00</code> to <code>DFFF</code>). For a detailed explanation of how exactly surrogate pairs match to a UTF-16 codepoint, see [https://www.unicode.org/versions/Unicode3.0.0/ch03.pdf Unicode Version 3.0.0 Chapter 3], section 3.7, D28.
+
The <code>HighSurrogate</code> input must be a character within Unicode's High Surrogate set (<code>0xD800</code> to <code>0xDBFF</code>) and the <code>LowSurrogate</code> input must be a character within Unicode's Low Surrogate set (<code>DC00</code> to <code>DFFF</code>). For a detailed explanation of how exactly surrogate pairs match to a UTF-16 codepoint, see [https://www.unicode.org/versions/Unicode3.0.0/ch03.pdf Unicode Version 3.0.0 Chapter 3], section 3.7, D28.
  
 
== Examples == <!--T:4-->
 
== Examples == <!--T:4-->

Latest revision as of 22:54, 11 September 2022

Other languages:
English
Surrogate Pair To UTF32
'Surrogate Pair To UTF32' LogiX node
Inputs
  Char HighSurrogate
  Char LowSurrogate
Outputs
  Int *

The Surrogate Pair To UTF32 node takes two characters within Unicode's Surrogate blocks and converts them to a UTF-32 character.

Usage

This node takes two characters as input and outputs a one-character string.

The HighSurrogate input must be a character within Unicode's High Surrogate set (0xD800 to 0xDBFF) and the LowSurrogate input must be a character within Unicode's Low Surrogate set (DC00 to DFFF). For a detailed explanation of how exactly surrogate pairs match to a UTF-16 codepoint, see Unicode Version 3.0.0 Chapter 3, section 3.7, D28.

Examples

In the following image, the high surrogate 55356 (0xD83C) and low surrogate 56621 (0xDD2D) are used to output the desired character, 🄭 (U+1F12D)

Logix Example SurrogatePairToUTF32.png

Node Menu

Characters
Back + From UTF16 From UTF32 Get Character Is Control Is Digit
Is Letter Is Letter Or Digit Is Lower Is Number Is Punctuation Is Separator Is Surrogate
Is Symbol Is Upper Is White Space String To UTF32 Surrogate Pair To UTF32 To Lower To Upper
To UTF16