You should be careful in using this, you probably know that it has less accuracy. This will cause gcc not to use it systematically.
There is a trick even mentioned in the INTEL SSE manual (I hope I remember correctly). The result of sqrtss is only one Jeron of Heron from the target. It is possible that gcc can sometimes inline support a short iteration at some point (version), but not for others.
You can use the embedded content, as MSN says, but you must finally find the specifications on the INTEL website to find out what you are trading.
source share