Skip to content

crawler-engine with HTTP, proxy, JS-Java Interoperability, MQ task consumption, dynamic crawler scripts execution. support deployment in distribution style.

Notifications You must be signed in to change notification settings

runjia1987/crawler-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

author
runjia.zhu
Dec 23, 2017
9db5b48 · Dec 23, 2017

History

50 Commits
Dec 23, 2017
Sep 11, 2017
Jan 22, 2016
Sep 11, 2017
Mar 4, 2017
Dec 23, 2017
Mar 18, 2017
Jan 27, 2016
Jan 15, 2016
Feb 17, 2017

Repository files navigation

crawler-engine

INTRODUCE

crawler-engine with HTTP, proxy, JS-Java Interoperability, MQ task consumption, dynamic crawler scripts execution. support deployment in distribution style.

REQUIREMENTS

  • JDK 8 (note: if require Nashorn feature ClassFilter, then 1.8u45+ is mandatory);
  • build tool: Maven 3.2.x+;
  • MQ service(Erlang OTP): Rabbitmq 3.6.0+;
  • Mozilla Rhino
  • Apache HttpComponents

RUN STEPS

  1. BUILD: mvn clean package -Dmaven.test.skip, should have generated target/engineClient.zip;
  2. PREPARE: follow the instructions of rabbitmq_quickstart.txt;
  3. TESTCASE: mvn test;
  4. RUN: java -jar engineClient.jar [clientName] in target directory.

About

crawler-engine with HTTP, proxy, JS-Java Interoperability, MQ task consumption, dynamic crawler scripts execution. support deployment in distribution style.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published