Commit Graph

2 Commits

Author SHA1 Message Date
Linus Torvalds 8899ed4e1f Fix the unicode character limit (0 .. 0x10ffff)
For some reason I had limited things to 0xffff, it really should be 0x10ffff.

We don't actually support a full 32-bit unicode model anyway, since we
use the high bits for the control/meta/^X/special bits, but there was no
reason to limit things to 16 bits when we had 28 bits available.  And
the real limit for real Unicode characters is 0x10ffff.

Add a silly example character past the 16-bit range to the UTF8 demo
file:
  'SMILING FACE WITH HALO' (U+1F607)
from the 'emoticons' block.

Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2012-09-24 19:44:21 -07:00
Linus Torvalds 3e0492ffd4 Add a true UTF-8 demonstration file
We obviously don't handle it, but it's a wonderful example if we ever do.

Demo by Markus Kuhn from

	http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-demo.txt

Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2005-10-10 10:07:31 -07:00