I am writing an application that takes data from a series of arbitrary RSS feeds. Channels are polled asynchronously in the background, and the method is called every time a new item is added to the feed.
My problem is identifying new items in the feed. What is the best way to do this? I came up with some ideas, but they are all spoiled.
Suggestion: every time you conduct a survey, continue all the newer than pubDate last item in the last survey Problem: pubDate is an optional field.
Suggestion: keep a hash of content for each item you return and not return content with the same hash Problem: Quickly gets out of control in terms of memory usage
language-agnostic c # rss
Martin
source share