I am browsing some directories using ASP.NET programming through Scrapy.
Pages for slippage are encoded as such:
javascript:__doPostBack('ctl00$MainContent$List','Page$X')
where X is an int between 1 and 180. The argument of MainContent is always the same. I have no idea how to get into them. I would like to add something to the SLE rules as simple as allow=('Page$') or attrs='__doPostBack' , but I assume that I need to be more complex to get the information out of the javascript link.
If itβs easier for you to βexposeβ each of the absolute links from javascript code and save them in csv, then use this csv to load requests into a new scraper, which is also good.
source share