However, one avenue that might be worth exploring some more is to check
each registry's character table (for those that have one) and see what
the Unicode category is for each character. The Japanese Katakana middle
dot U+30FB has the category "Pc" which means "punctuation, connector"
and LDH's hyphen U+002D has the category "Pd" which means "punctuation,
dash".
http://www.unicode.org/Public/UNIDATA/UnicodeData.txt
http://www.unicode.org/Public/UNIDATA/UCD.html#General_Category_Values
If it turns out that all or most of the registries that have tables are
using characters with only a small number of Unicode categories, then we
may wish to consider moving IDNA to that set of categories (disallowing
all others). This would keep the registries happy while keeping *some*
of the phishy characters out of DNS.