You answered your question in the comments. You can design one, but you are essentially sacrificing potential performance to simplify your design. A small deviation from what you offer is called a barrel processor . Each cycle, the processor takes one command from another thread, and this simplifies the pipeline. HEP architecture is another variation of this idea.
source share