>>> words 'Does not match Äh Oh Äi Üs üx Öjjj' >>> re.findall(r"(\b[A-ZÜÖÄ][az.-]+\b)", words, re.UNICODE) ['Does', 'Äh', 'Oh', 'Äi', 'Üs', 'Öjjj']
Just add to the list all Unicode letters that are not in the AZ range, I added only German umlauts.
You can find all letters without ASCII (AZ) as follows:
>>> [c for c in words if not c.isalpha() and not c.isdigit() and not c.isspace()] ['\xc3', '\x84', '\xc3', '\x84', '\xc3', '\x9c', '\xc3', '\xbc', '\xc3', '\x96']
Now you will need to understand which capitals.
Oz123
source share