I want to print emojis from python (3) src
I am working on a project that analyzes Facebook Message Stories and in the downloaded htm raw file I find a lot of emoji like question mark boxes, how it happens when the value cannot be displayed. If I copy these characters to the terminal as strings, I get values ββlike \U000fe328 . This is also the output that I get when I run htm files through BeautifulSoup and output the data.
I searched for this line (and others), and sequentially one of the only sites that come with them is iemoji.com, in the case of the line above, this page listing the line as Python Src. I want to be able to print these lines as their respective emojis (after all, they were originally emojis when messaging), and after inspecting I found the src encodings mapping on this page that matched the above lines with the emoji line names. Then I found these emoji string names in the Unicode list , which for the most part seem to map emoji names to Unicode. If I try to print these values, I get a good result. As after
>>> print(u'\U0001F624') π€
Is there any way to match these "Python src" encodings with their unicode values? Linking both libraries will work if not so that the original src mapping is missing about 50% of the unicode values ββfound in the unicode library. And if I still have to do this, is there a good way to find the Python Src value of a given emoji? From my testing emoji how strings are equal to their Unicode, like 'π€' == u'\U0001F624' , but I can't get any relation to \U000fe328
source share