On Wed, 2015-02-18 at 15:22 +0200, Radu Coravu wrote:
> One remark about this:
> > Well, one reason might be that oXygen (like e.g. JEdit) surreptitiously converts all your input to NFC.
> I do not know about JEdit but Oxygen preserves the text exactly in the
> way you type it.
JEdit does the same thing. Here's the output of the Unix hd command on a
UTF-8 file in which the word "été" was entered as decomposed characters
and then as composed characters. The hex 74 is "t" and 0a is the newline
character. The sequence 65 cc 81 is the decomposed "é" (with 65 being
the "e" character and the others the accent) and the sequence c3 a9 is
the composed "é".
00000000 65 cc 81 74 65 cc 81 0a c3 a9 74 c3 a9 0a |