I do not think that there can be an analyzer that will work in all languages. The problem is that different languages โโhave different rules about word boundaries and their occurrence (for example, Thai does not use spaces to separate words). Or, if there is, of course, I do not want to be accompanying!
What you will need to do is โtagโ blocks of text as one or the other language and use the right parser for that particular language. You can try to detect the language โautomaticallyโ by performing a character analysis (ie, Text using primarily Japanese Katakana, probably Japanese).
Dean harding
source share