Difference between revisions of "unicode"

From Lojban
Jump to navigation Jump to search
m (Gleki moved page jbocre: Unicode to Unicode without leaving a redirect: Text replace - "jbocre: ([A-Z])" to "$1")
m (Conversion script moved page Unicode to unicode: Converting page titles to lowercase)
(No difference)

Latest revision as of 08:37, 30 June 2014

A mapping between each character used in writing (any kind of writing) and a number between 0 and 0x10FFFF = 1,114,111. Not all the numbers are used, and not all the writing systems are encoded yet.

For starters, all the numbers from 0xD800 to 0xDFFF (inclusive) are reserved for something called surrogates (which are used to encode the numbers >0xFFFF in pairs of 16-bit characters).

For lots more, see [1].