How can I get crawler4j to load all links from a page faster?

Question

How can I get crawler4j to load all links from a page faster?

What I do:
- scan the page
- get all the links of the page, put them in the list
- launch a new crawler that visits all the links in the list
- download them

There must be a faster way when I can directly download links when visiting the page? thank!

+5

java crawler4j

seinecle Jan 10 '12 at 14:11

source share

2 answers

, , ( , ).

crawler4j . , , , , , , . , 1000 , 0,3 . , - 300 , . , .

- , , , , . , AWS ( ), , - , ( ISP, ).

, , , , , ( ) .

+2

jefflunt 10 . '12 14:34

Yasser · Accepted Answer · 2012-01-10T19:35:29+0000

crawler4j . . , . crawler4j shouldVisit. , true . , URL- true false.

URL-, true, , .

.

How can I get crawler4j to load all links from a page faster?

More articles: