From what I see, it looks like TTS Builder accepts existing voices and allows you to change minor parameters to make a slightly different voice. But creating a voice with a different accent or pronunciation is, I think, more difficult.
From AT&T Research :
Creating high-quality voices requires a good voice, sound insulation, professional audio equipment, hours of written material with careful coverage of combinations of phonemes in the language, as well as time and experience, in order to turn these recordings into a worthy synthetic voice. Due to the costs involved, custom voice assemblies are usually performed for corporations that want to computerize an actorβs existing voice, for example, to continue branding.
...
Building a transformational model may require much less material than creating a TTS voice from scratch.
source share