I saw this post to do scrapy crawl of any site without permission of the allowed domains.
Is there a better way to do this, for example, using a regular expression in a valid domain variable, for example -
allowed_domains = ["*"]
Hope there is more to breaking the scrapy framework for this.
source
share