Regex.IsMatch( "foo", "[\U00010000-\U0010FFFF]" )
Throws: System.ArgumentException: parsing the "[-]" - [xy] range in reverse order.
Looking at the hexadecimal values for \ U00010000 and \ U0010FFF, I get: 0xd800 0xdc00 for the first character and 0xdbff 0xdfff for the second.
So, I really have one problem. Why are Unicode characters formed with \ U split into two lines per line?
c # regex unicode astral-plane
Ben mcniel
source share