Closed ezaspy closed 1 year ago
This is called Punycode which is a method of encoding strings containing Unicode characters into ASCII favored by DNS. By design CLI output displays domains in Unicode, while other output formats like CSV or JSON have domains encoded with Punycode.
Great! Managed to find a solution for it 👍 'string'.encode().decode('idna')
I have run the following query which returns results but in the instances of accented characters in the domain, the characters are replaced with a strange combination which I do not know why they are formatted like this, examples below. I am using VSCode on macOS to run the command and I thought it could be something to do with the editor or the OS. I have changed my locale settings but it yeilds the same results.
xn--pypl-2nad.com should be páypàl.com I had a hunch it was not displaying correctly and was confirmed by querying VT: https://www.virustotal.com/gui/domain/xn--pypl-2nad.com