Stephane’s latest post talks about an attempt to use a mathematical algorithm to help decide on which TLDs to allow in the future.
The algorithm, which you can read more about here, works on the basis of comparing “strings” to see if they are confusing. It draws in part on the Levenshtein distance
It will be interesting to see how ICANN make use of this tool in the months ahead, but as Black points out, it only deals with visual similarities:
Note that the algorithm is not meant to consider phonetic
similarity. For example, “fish”, “phish”, and “fiche” sound alike, but
are visually distinct and unlikely to be confused.