Will the following 2 canonical link tags be displayed by spiders pointing to the same URL?
<link rel="canonical" href="http://www.example.com/ŷ" /> <link rel="canonical" href="http://www.example.com/ŷ" /> - encoded<link rel="canonical" href="http://www.example.com/ŷ" /> - unencoded
<link rel="canonical" href="http://www.example.com/ŷ" />
<link rel="canonical" href="http://www.example.com/ŷ" />
ŷ is an HTML object that represents a Unicode character with a code point of 375 in decimal notation. In hexadecimal, this will be 0x177, so we are talking about U + 0177, which is ŷ .
ŷ
ŷ
This means that both URLs are exactly the same if:
If is displayed in the browser, in both cases it is likely that the character set is correct, but you must make sure that it is.
Not 100% sure, but I think both of them will point to the same URL. But keep in mind that when looking at W3 standards, they often offer encoding links.
if you are reporting your HTML as UTF-8, the URL is treated as the same.
Even if you can expect it to work in modern browsers, http://www.example.com/ŷ is an invalid URL .
http://www.example.com/ŷ
You should always encode Unicode characters.