Can someone tell me what this ascii character is?

I have this character appearing occasionally and I can not find it in the ascii table. I would like to run a filter by data before sending it to the database, but I should know what it is. Maybe someone can understand me. I use the wysiwyg editor, and that is where it comes from. The character appears very sporadically, but seems to appear more often than not when I do two \ r or backspace.

Here is a symbol

Γ‚ 

OK, it was suggested to change the content type to utf8 at the beginning of the document, but I still get these characters in the database. Here is the test after I added the content type

 adf af Γ‚ aafd aa aa a Γ‚ afa aΓ‚ adf 
+4
source share
5 answers

It is highly likely that this character is associated with UTF-8 encoding problems. Joel article Absolute minimum Every software developer Absolutely, should know positively about Unicode and character sets (no excuses!) , Of course, it is recommended to read in this case.

Filtering these characters before sending them to the database is, of course, the wrong thing.

In the case you mention, you are probably dealing with the character U + 00A0, which is the Unicode character for non-distribution. Bit chart for this symbol:

 1010 0000 

After UTF-8 encoding, where the encoded bytes look like

 110x xxxx 10xx xxxx 

where 'x' represents the bit of the Unicode character value, so U + 00A0 is encoded as:

 1100 0010 1010 0000 

which is equal to 0xC2 0xA0. Coincidentally, the second character is the same byte value as the original character you encoded (U + 00A0), while the first character is the view you see.

+16
source

This is "Latin Capitial A with Circumflex", Unicode U + 00C2 HTML

Wikipedia: http://en.wikipedia.org/wiki/%C3%82

+2
source

When I have this problem, the fix that works for me is based on @Greg's answer, given that:

0xC2 = 194, 0xA0 = 160,

In php:

 $output=str_replace(chr(194).chr(160), " ", $html); 

This returns Γ‚ with   they should have been.

+1
source

I am an OP. I was no longer logged in, but I returned to share this solution. The problem was actually a coding problem. I added:

 <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> 

After I did this, I noticed that I was still getting these funky characters in my database. Then I changed the encoding in the database table and did nothing. This left only the browser ... I checked the encoding in the browser and noticed that it uses ISO-8859-1. I changed the encoding in the browser to utf-8 and now it works fine. :)

Thanks to everyone who contributed.

0
source

I think you see the error I once experienced. ISO-8859-1 is actually a subset of Windows-1152 (I think it's 1152) for Western European languages. The problem is that browsers happily represent Windows-1152 characters when the web server accepts ISO-8859-1. This means that the browser sends invalid ISO-8859-1 data. This is what happened with my Windows installation, at least. I have seen this behavior in both IE and Firefox.

I had a problem with the wysiwyg editor where users pasted data from a Word document. This document will contain both hyphens and dashes. One of the characters will receive a filed fine. The other is garbage, because this symbol does not exist in ISO-8859-1 (I can’t remember what it is).

The .net structure we used did not help, as it did not complain about an invalid ISO character when converting to unicode.

0
source

All Articles