Hi, this is much to simple because even in plain latin/german we have many homoglyphs:
- O0o , Il
And another point are not homoglyph but “umlaute” - ae instead of ä
- oe instead of ö
- ue instead of ü
Since this is an important topic i think it should be discussed in public and also make the base list public.
Also i do not have an list of all active Domains currently have been logged via CT it would be interesting
to know
- how many domain collisions exists if the domain are compared based on the described method.
- How many without strip the suffix.
- Not pointing to the same IP