How to parse a token from a string in C?

Is there one or two spaces between hello and the world?
can it be any number of spaces?
can include vertical spaces (\ n, \ f, \ v) or just horizontal ones (\ s, \ t, \ r)?
can include any space characters UNICODE?
if there were punctuation between the words ("hello world"), would punctuation be a separate marker, a part of "hello" or ignored?

As you can see, writing the right lexer is not easy, and strtok not the right lexer.

Other solutions may be single-character end machines that do exactly what you need, or a regular expression solution that makes word searching easier than spaces. There are many ways.

And, of course, it all depends on your actual requirements, and I don’t know them, so start with strtok . But it’s good to know the various limitations.

+3

Paul beckingham Feb 17 '09 at 22:09

source share

Keep in mind that strtok is very hard to understand because:

He changes the input
The delimiter is replaced by a null terminator
Combines adjacent delimiters and, of course,
Not stream protected.

You can read about this alternative .

+2

dirkgently Feb 17 '09 at 20:22

source share

Andrew Hare · Accepted Answer · 2009-02-17T19:27:27+0000

You want to use strtok - here is a good example.

How to parse a token from a string in C?

More articles: