[Development] Why can't QString use UTF-8 internally?

Rutledge Shawn Shawn.Rutledge at theqtcompany.com
Thu Feb 12 08:39:25 CET 2015


On 11 Feb 2015, at 18:15, Konstantin Ritt <ritt.ks at gmail.com> wrote:

> FYI: Unicode codepoint != character visual representation. Moreover, a single character could be represented with  a sequence of glyps or vice versa - a sequence of characters could be represented with a single glyph.
> QString (and every other Unicode string class in the world) represents a sequence of Unicode codepoints (in this or that UTF), not characters or glyphs - always remember that!

Is it impossible to convert some of the possible multi-codepoint sequences into single ones, or is it just that we prefer to preserve them so that when you convert back to UTF you get the same bytes with which you created the QString?



More information about the Development mailing list