1. This issue was debated at length some time ago. I suggest that the people arguing for visual confusability as a criterion for matching look at that discussion in detail before proceding.
(i) From observation, when scripts have two cases, the upper-case form is more likely to be highly stylized, and hence differentiated from characters in other scripts, than the lower-case one. Hence, if one is going to adopt stylization-based (glyph-distinction, if you prefer) canonicalization rules, one is better off treating upper case as the normal form, rather than lower case.
-- http://sperling.com