Libraries or tools for generating random but realistic text

I am looking for tools to generate random but realistic text. I myself implemented the Markov Chain text generator, and while the results were promising, my attempts to improve them did not give much success.

I would be pleased with tools that consume the corpus or work on the basis of context-sensitive or context-free grammar. I would like the tool to be suitable for inclusion in another project. Most of my recent work has been in Java, so a tool in this language is preferred, but I would be fine with C #, C, C ++, or even JavaScript.

This is similar to a question , but more in volume.

+5
source share
3 answers

Extending your own Markov chain generator is probably the best choice if you want “random” text. Creating something that has context is an open research problem.

Try (if not):

  • Indication of punctuation individually or inclusion of punctuation in the chain, if you have not already done so. This includes paragraph marks.
  • If you use a chain of Markovites with a 2- or 3-story, try to reset using a 1-story when you encounter complete stops or new lines.

Alternatively, you can use WordNet through two passes with your enclosure:

  • , .. , , . WordNet . (, ..) , . , " " "[] [] [] [ ()] [] []"
  • , [], [] [] .

: , , , . "" wordnet , , .


, , .

+6

- this Lorem ipsum generator? API.

0

All Articles