Giter Club home page Giter Club logo

sabag / hebmorph Goto Github PK

View Code? Open in Web Editor NEW

This project forked from synhershko/hebmorph

0.0 0.0 0.0 6.06 MB

This is an open-source effort for making Hebrew properly searchable by various IR software libraries, while maintaining decent recall, precision and relevancy in retrievals. Includes Hebrew Analyzer for Lucene, and already produces results for Hebrew texts which are much better than the default Lucene implementation.

Home Page: http://code972.com/hebmorph

License: Other

C 2.65% Java 97.35%

hebmorph's Introduction

HebMorph is an open-source effort for making Hebrew properly searchable by various IR software libraries, while maintaining decent recall, precision and relevancy in retrievals. All code and files are released under the GNU Affero General Public License version 3.

More details at http://code972.com/HebMorph

Maven Central Build Status

Lucene / Elasticsearch compatibility

Since March 2017 hebmorph-lucene is being released for every Lucene/Solr version, with a matching major version number (most of the time minor as well). Matching Elasticsearch plugin versions are also available, see https://github.com/synhershko/elasticsearch-analysis-hebrew.

hebmorph-lucene version Lucene version Elasticsearch version Release date
6.2.x 6.1.x -> 6.2.x 5.1.x -> 5.2.x 3/2017
6.0.0 6.0.x 5.0.x 1/2017
2.4.0 5.5.x 2.4.x
2.3.x 5.4.x 2.2.x -> 2.3.x 4/2/2016
2.2.x 5.3.x 2.0.x -> 2.1.x 4/2/2016
2.1.x 4.10.4 1.6 -> 1.7.x 4/2/2016
2.0.x 4.10.x 1.4.x, 1.5.x 24/3/2015
1.5.0 4.9.0 1.3.x 9/9/2014
1.4.x 4.8.x 1.x -> 1.2.x August 2014
1.3.x 4.6.x 0.90.8 -> 0.90.13 June 2014
1.2.0 4.5.x 0.90.6, 0.90.7 10/11/2013
1.1.0 4.4.0 0.90.3 -> 0.90.5
1.0.0 <= 4.3.0 <= 0.90.2

Tutorial for integrating HebMorph with Elasticsearch can be found here http://code972.com/blog/2013/08/129-hebrew-search-with-elasticsearch-and-hebmorph

Get it from Maven Central

For the analyzer support, get hebmorph-lucene:

        <dependency>
            <groupId>com.code972.hebmorph</groupId>
            <artifactId>hebmorph-lucene</artifactId>
            <version>6.6.0</version>
            <scope>compile</scope>
        </dependency>

Lucene.NET compatibility

The .NET version of the library is compatible with Lucene.NET version 3.0.3, but has some known bugs that were fixed in the Java version and haven't been ported back yet.

License

HebMorph is copyright (C) 2010-2015, Itamar Syn-Hershko. HebMorph currently relies on Hspell, copyright (C) 2000-2013, Nadav Har'El and Dan Kenigsberg (http://hspell.ivrix.org.il/).

It is released to the public licensed under the GNU Affero General Public License v3. See the LICENSE file included in this distribution. Note that not only the programs in the distribution, but also the dictionary files and the generated word lists, are licensed under the AGPL. There is no warranty of any kind for the contents of this distribution.

hebmorph's People

Contributors

synhershko avatar egozyn avatar itaifrenkel avatar itsadok avatar iprovalo avatar kirillkh avatar oferfort avatar roman4git avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.