I am doing a project in the classification of news. Basically, the system will classify news articles on the basis of a predetermined topic (for example, sports, political, international). To create a system, I need free datasets for training the system.
So far, after a few hours of googling and links from here , the only suitable datasets I could find are this . Although it will be, I hope, enough, I think I will try to find more.
Note that the datasets I want are:
- Contains full news articles, not just the title
- In English
- In .txt format, not in XML or db format
Can someone help me?
source
share