A few weeks ago I started a beginner level Chinese (Mandarin) evening class at the local community college. We're about four weeks in and I'm really enjoying it, will definitely do more.
I thought it would be an interesting project to load a Chinese-English dictionary onto my Kindle for reference. I've already played with kindlegen, which takes a
(
Read more... )
I agree it's blurry though, e.g. there are Unicode symbols for "arrow pointing right" or whatever which have some inherent definitional restrictions on how they could reasonably look.
But thinking about this Han unification thing (about which I know virtually nothing but what I've read here), if it's analogous to the different appearances of "a" in different typefaces, then again, I don't see what the problem was in Greg's Kindle. It would be as if "a" showed up in Times Roman but not in Arial - we'd say "WTF, that font is broken, it doesn't have all the characters it should have!" :) Sure, you can't expect every font to have every character, but when they're kind of related in that way. E.g. a Polish font still has "v" and "q" even though the Polish language doesn't use those letters. Ah well, I can see it's one of those annoying "the real world language situation is too chaotic and messy to be captured in an elegant mathematical model" type deals. :)
Reply
It sounds like Greg's Kindle, for whatever reason, was looking for a Japanese font first, and then when that font didn't have glyphs for some of the code points in the Chinese text, it was failing to substitute them from some other font but just showing missing characters. I don't know why; he proposes it's to avoid mixing styles. Another possibility might be that it (or that particular piece of software) simply doesn't have font substitution implemented at all, as a matter of corner-cutting rather than intelligent design.
The number of characters used by Chinese and not Japanese is in the thousands, so we can't really expect a font intended for use with Japanese (which will contain the Japanese styles of the shared characters, and thus look wrong for Chinese whether it has full character coverage or not) to also include all the Chinese characters just for completeness; it's a much taller order than hoping for a Polish font to contain "v' and "q."
Reply
Leave a comment