jweese / thrax Goto Github PK
View Code? Open in Web Editor NEWOffline extractor of synchronous context-free grammars for machine translation.
Home Page: http://cs.jhu.edu/~jonny/thrax/
License: Other
Offline extractor of synchronous context-free grammars for machine translation.
Home Page: http://cs.jhu.edu/~jonny/thrax/
License: Other
It'd be nice to mavenify this project.
Looks like the checked in jerboa jar's source code has some modifications from this project's API. Would you mind posting the source you used to build that jar somewhere? Or at least the Signature/SLSH .java files?
My Hiero extraction always cancels a few seconds after starting as "Pseudodistributed hadoop". In logs I see the following....
Running in mode: translation
[SCHED] planned job for class edu.jhu.thrax.hadoop.jobs.VocabularyJob
[SCHED] planned job for class edu.jhu.thrax.hadoop.jobs.ExtractionJob
[SCHED] planned job for class edu.jhu.thrax.hadoop.features.mapred.SourcePhraseGivenTargetFeature
[SCHED] planned job for class edu.jhu.thrax.hadoop.features.mapred.TargetPhraseGivenSourceFeature
[SCHED] planned job for class edu.jhu.thrax.hadoop.jobs.SourceWordGivenTargetWordProbabilityJob
[SCHED] planned job for class edu.jhu.thrax.hadoop.jobs.TargetWordGivenSourceWordProbabilityJob
[SCHED] planned job for class edu.jhu.thrax.hadoop.features.annotation.AnnotationFeatureJob
[SCHED] planned job for class edu.jhu.thrax.hadoop.jobs.OutputJob
[SCHED] class edu.jhu.thrax.hadoop.jobs.OutputJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.features.mapred.TargetPhraseGivenSourceFeature in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.ExtractionJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state READY
[SCHED] class edu.jhu.thrax.hadoop.features.annotation.AnnotationFeatureJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.SourceWordGivenTargetWordProbabilityJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state READY
[SCHED] class edu.jhu.thrax.hadoop.jobs.TargetWordGivenSourceWordProbabilityJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state READY
[SCHED] class edu.jhu.thrax.hadoop.jobs.ExtractionJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state READY
[SCHED] class edu.jhu.thrax.hadoop.features.mapred.SourcePhraseGivenTargetFeature in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.ExtractionJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state WAITING
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state READY
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state RUNNING
15/05/08 16:25:09 INFO input.FileInputFormat: Total input paths to process : 1
[SCHED] class edu.jhu.thrax.hadoop.jobs.VocabularyJob in state FAILED
[SCHED] class edu.jhu.thrax.hadoop.jobs.SourceWordGivenTargetWordProbabilityJob in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.features.annotation.AnnotationFeatureJob in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.jobs.OutputJob in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.jobs.TargetWordGivenSourceWordProbabilityJob in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.jobs.ExtractionJob in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.features.mapred.TargetPhraseGivenSourceFeature in state PREREQ_FAILED
[SCHED] class edu.jhu.thrax.hadoop.features.mapred.SourcePhraseGivenTargetFeature in state PREREQ_FAILED
class edu.jhu.thrax.hadoop.features.mapred.TargetPhraseGivenSourceFeature PREREQ_FAILED
class edu.jhu.thrax.hadoop.jobs.OutputJob PREREQ_FAILED
class edu.jhu.thrax.hadoop.features.annotation.AnnotationFeatureJob PREREQ_FAILED
class edu.jhu.thrax.hadoop.jobs.SourceWordGivenTargetWordProbabilityJob PREREQ_FAILED
class edu.jhu.thrax.hadoop.jobs.TargetWordGivenSourceWordProbabilityJob PREREQ_FAILED
class edu.jhu.thrax.hadoop.features.mapred.SourcePhraseGivenTargetFeature PREREQ_FAILED
class edu.jhu.thrax.hadoop.jobs.VocabularyJob FAILED
class edu.jhu.thrax.hadoop.jobs.ExtractionJob PREREQ_FAILED
hello:
there is a error when we install the thrax:
/thrax/Thrax.java:9: error: package org.apache.hadoop. conf does not exist
this is the setup:
#TEST ANT
export HADOOP=/data/max/hadoop-0.20.2/hadoop-0.20.2-core.jar
export HADOOP_VERSION=0.20.2
#END TSET ANT
or:
export HADOOP=/data/max/hadoop-0.20.2
export HADOOP_VERSION=0.20.2
export AWS_SDK=/data/max/aws-java-sdk-1.1.3
export AWS_VERSION=1.1.3
export JAVA_HOME=/usr/lib/jvm/java-8-oracle
export CLASSPATH=$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar:/data/max/hadoop-0.20.2/hadoop-0.20.2-core.jar
export PATH=$JAVA_HOME/bin:$PATH
Both doesn't work
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.