NOTES

This documentation is slightly ahead of the code; the "language" and "analyzer" options are not yet available.

News

The indexing API in 0.3 has changed since 0.2 to allow multiple design documents and "views" into Lucene. It will moves the Lucene-specific stuff into an options object.

Issue Tracking

Issue tracking at github.

System Requirements

Sun JDK 5 or higher is necessary. Couchdb-lucene is known to be incompatible with OpenJDK as it includes an earlier, and incompatible, version of the Rhino Javascript library.

Build couchdb-lucene

Install Maven 2.
checkout repository
type 'mvn'
configure couchdb (see below)

Configure CouchDB

[couchdb]
os_process_timeout=60000 ; increase the timeout from 5 seconds.

[external]
fti=/usr/bin/java -jar /path/to/couchdb-lucene*-jar-with-dependencies.jar -search

[update_notification]
indexer=/usr/bin/java -jar /path/to/couchdb-lucene*-jar-with-dependencies.jar -index

[httpd_db_handlers]
_fti = {couch_httpd_external, handle_external_req, <<"fti">>}

Indexing Strategy

Document Indexing

You must supply a index function in order to enable couchdb-lucene as by default, nothing will be indexed.

You may add any number of index views in any number of design documents. All searches will be constrained to documents emitted by the index functions.

Declare your functions as follows;

{
  "fulltext": {
    "by_subject": {
      "defaults": { "store":"yes" },
      "index":"function(doc) { var ret=new Document(); ret.add(doc.subject); return ret }"
    },
    "french_documents": {
      "defaults": { "language":"fr" },
      "index":"function(doc) { if (doc.language != "fr") { return null;} var ret=new Document(); etc return ret;  }"
    }
  }
}

A fulltext object contains multiple index view declarations. An index view consists of;

defaults: The default for numerous indexing options can be overridden here. A full list of options follows.
index: The indexing function itself, documented below.

name	description	available options	default
field	the field name to index under	user-defined	default
store	whether the data is stored. The value will be returned in the search result.	yes, no	no
index	whether (and how) the data is indexed	analyzed, analyzed_no_norms, no, not_analyzed, not_analyzed_no_norms	analyzed
analyzer	how the data is analyzed	auto, simple, standard	auto
language	which language the data is in	auto, br, cjk, cn, cz, de, el, en, fr, nl, ru, th	en

mlmiller / couchdb-lucene Goto Github PK

couchdb-lucene's Introduction

NOTES

News

Issue Tracking

System Requirements

Build couchdb-lucene

Configure CouchDB

Indexing Strategy

Document Indexing

The Defaults Object

The Document class

Example Transforms

Index Everything

Index Nothing

Index Select Fields

Index Attachments

A More Complex Example

Attachment Indexing

Supported Formats

Searching with couchdb-lucene

Special Fields

Dublin Core

Examples

Search Results Format

The search results array

Fetching information about the index

Working With The Source

Configuration

Basic Authentication

IPv6

couchdb-lucene's People

Stargazers

Watchers

Recommend Projects

Recommend Topics

Recommend Org