Spider4j is an open source web crawler expand from webmagic for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes.
littlesearch / spider4j-2 Goto Github PK
View Code? Open in Web Editor NEWThis project forked from yida-lxw/spider4j
Spider4j is an open source web crawler expand from webmagic for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes.