I repeat the DNA sequences, pulling fragments from 5-15 bases at a time into C ++ std :: string objects. Sometimes my string will contain a base without ATCG, and I want to take action when this happens. For example, I could see:
CTACGGTACGRCTA
Since there is an “R,” I want to acknowledge this case. I am familiar with regex, but people seem to recommend several different libraries. I have seen Boost, TR1 and others. Can someone suggest another way to catch my case or tell me which library should I use and why?
thanks
c ++ c regex bioinformatics
nedblorf
source share