The HelloQA pipeline works out of the box, but you'll need to download the Gutenberg index input data separately (it's a big file and doesn't make sense to keep it in the repository, bad practice and all).
!!! Note that the following is obsolete. Dataset is included in the repo. See also https://github.com/oaqa/helloqa/wiki/DSO-Project !!!
-
Download the Gutenberg index here.
-
Untar it.
-
Copy the resulting files into the data directory inside your HelloQA project such that your helloqa/data/guten directory contains bin, conf, data, lib, etc.
You're now ready to run the pipeline.
-
Open project in Eclipse.
-
Expand the directory tree under the project and look for "launches".
-
Expand "launches" and right-click on test.launch > Run As > test. This launch should now be the default behavior whenever you click the "Run" button in the toolbar while you're in the helloqa project.
As an alternative, you may run the project outside of Eclipse. Simply issue the command:
mvn exec:exec -Dexec.executable=java -Dexec.args="-Djava.library.path=lib/ -classpath %classpath edu.cmu.lti.oaqa.ecd.driver.ECDDriver phases.err-analysis-IE-dsoqa"
You may vary the last argument, see the src/main/resources/phases/ directory.