Large text file dictionary of random words for benchmarking?

I was wondering if anyone could point me to a very large dictionary of random words that could be used to test string data structures with high performance? I find some that are in the range of ~ 2 MB ... however I would like if it were possible more, if it was possible. I suppose there should be some kind of large standard row data set that could be used. Thanks!

+4
source share
2 answers

http://norvig.com/big.txt

The above link was mentioned in the Norwig spell check article - http://norvig.com/spell-correct.html

+4
source

I would recommend looking at the materials available at TREC (Text REtrieval Conference). Some good datasets that may fit your requirements.

+1
source

All Articles