I would probably look at the things of Gnoth Sable on your shoes.
However, for general lexical analysis, there is a project similar to Boot Spirit called OpenToken . For more complex tasks you may find it helpful.
I did not work with modern incarnation, but back when I was a leader, the project was an agnostic compiler.
source share