Should the Circled and Halfwidth characters be in the same
equivalence class as regular Katakana?
Should the "dependent" property of small letters be tracked?
Should the a "root" property be tracked? That is do we treat the
three series "h", "b" and "p" as being in one family, or in
three separate families and mark "he" as the "root" or "parent"?
General question: When more than one consonant or vowel is common
for a letter, should we store all of them as a list vs selecting
the best one?