Regular expression with latin characters

Question

I have this regex:

if (cadena.matches("^[a-zA-Z ]+$")) return true;

It takes A to Z both lowercase and uppercase. Also accepts spaces.

But this only works for English. For example, in Catalan we have the symbol "ç". We also have characters with 'á', or 'à', etc.

Was there some kind of Google and I could not find a way to do this.

I found out that I can filter for UTF-8, but this will accept characters that are not actually letters.

How to implement this?

+11

Reinherd Jun 07 '13 at 9:36

source share

2 answers

Take a look at the documentation and use a class (for example, \p{InLatin1Supplemental} ).

-2

Uwe plonus Jun 07 '13 at 9:42

source share

mvp · Accepted Answer · 2013-06-07T09:42:43+0000

Use this regex:

 [\p{L}\s]+

\p{L} means any Unicode letter.