Dataset link to sum text?

Does anyone have a download link for generalizing text like DUC 2007 or TREC? Please help me.

+4
source share
3 answers

You can use http://archive.ics.uci.edu/ml/datasets/Legal+Case+Reports to approach extraction-based generalization of text. It contains catchPhrase, which can act as the selected sentence for training. But the phrase may not be so appropriate.

+2
source

You can access the DUC dataset after completing some organization and individual agreements. http://www-nlpir.nist.gov/projects/duc/data.html for more information

+1
source

You can write a sitemap finder in scrapy for

This can give you about 1.45 million abstracts and articles.

You can also check this harvardnlp sent a summary of dataset and CNN Dailymail , which can give the story of some articles.

A warning. Since these are all different sources, their recording method may vary.

0
source

All Articles