I want to create some database test data, in particular table columns containing the names of people. To get a good idea of ββhow well indexing works with respect to name-based searches, I want to get as close as possible to real-world names and their true frequency distribution, for example. many different names with frequencies distributed over a power law distribution.
Ideally, I am looking for a freely available data file with names, followed by a single frequency value (or equivalent probability) for the name.
Names based on English-Saxon will be good, although names of other cultures will also be useful.
redcalx
source share