skroutz / elasticsearch-analysis-greeklish Goto Github PK
View Code? Open in Web Editor NEWA greeklish token filter for elasticsearch
Home Page: http://www.skroutz.gr/blog/team
A greeklish token filter for elasticsearch
Home Page: http://www.skroutz.gr/blog/team
Hi,
Although the plug in installs fine and logs show it's been loaded:
[2017-09-13T13:05:29,350][INFO ][o.e.p.PluginsService ] [PCName] loaded plugin [elasticsearch-analysis-greeklish]
when i try to use it on a custom analyzer, i get an error like:
{
"error": {
"root_cause": [
{
"type": "illegal_argument_exception",
"reason": "Unknown filter type [greeklish] for [greeklish_analysis]"
}
],
"type": "illegal_argument_exception",
"reason": "Unknown filter type [greeklish] for [greeklish_analysis]"
},
"status": 400
}
Here 's an example for an index that tries to use the plugin
PUT greek_sample
{
"settings": {
"analysis": {
"analyzer": {
"my_custom_analyzer": {
"type": "custom",
"tokenizer": "standard",
"filter": [
"greek_lowercase",
"greeklish_analysis"
]}
},
"filter": {
"greek_lowercase": {
"type": "lowercase",
"language": "greek"
},
"greeklish_analysis":{
"type": "greeklish",
"max_expansions": 15
}
}
}
},
"mappings": {
"person": {
"properties": {
"name": {
"type": "multi_field",
"fields": {
"greek": { "type": "string", "analyzer": "greek" },
"greeklish": { "type": "string", "analyzer": "my_custom_analyzer" }
}
}
}
}
}
}
Elasticsearch: 5.4.0
Plugin version: 5.4.0.1
Maybe we should consider the usage of mp
as b
. For example tamplet
should be equal to ταμπλετ
or tablet
.
What do you think;
Hi, I am trying o install your plugin and I get the following error:
ERROR: Unknown plugin gr.skroutz/elasticsearch-analysis-greeklish/0.11
The command o install plugins in the current elastic version is:
elasticsearch-plugin install gr.skroutz/elasticsearch-analysis-greeklish/0.11
I have also tried the following command which follows the current version's documentation with no luck.
elasticsearch-plugin install elasticsearch-analysis-greeklish
Please advise.
Thank you.
Even I have already use this mapping with ES2... Now when I'm trying to create this mapping with ES5, I'm getting the following error
{ "settings":{ "analysis":{ "analyzer":{ "greeklish_analyzer":{ "type":"custom", "tokenizer":"standard", "filter": ["lower_greek","greeklish_analysis"] }, "stem_analyzer":{ "type":"custom", "tokenizer":"standard", "filter": ["lower_greek","stem_greek"] } }, "filter": { "lower_greek": { "type":"lowercase", "language":"greek" }, "stem_greek": { "type":"skroutz_stem_greek" }, "greeklish_analysis":{ "type": "greeklish", "max_expansions": 15 } } } }, "mappings": { "shooters": { "properties": { "activated": { "type": "boolean" }, "company": { "type": "string", "analyzer": "stem_analyzer", "fields": { "english": { "type": "string", "analyzer": "english" }, "greeklish": { "type": "string", "analyzer": "greeklish_analyzer" } } }, "description": { "type": "string", "analyzer": "stem_analyzer", "fields": { "english": { "type": "string", "analyzer": "english" }, "greeklish": { "type": "string", "analyzer": "greeklish_analyzer" } } }, "email": { "type": "string" }, "name": { "type": "string", "analyzer": "stem_analyzer", "fields": { "english": { "type": "string", "analyzer": "english" }, "greeklish": { "type": "string", "analyzer": "greeklish_analyzer" } } }, "username": { "type": "string" } } } } }
ERROR:
{ "error": { "root_cause": [ { "type": "illegal_argument_exception", "reason": "Unknown filter type [greeklish] for [greeklish_analysis]" } ], "type": "illegal_argument_exception", "reason": "Unknown filter type [greeklish] for [greeklish_analysis]" }, "status": 400 }
I am trying to install in elastic 7.13.3 and I am facing the following error
-> Installing file:/root/org.elasticsearch-elasticsearch-7.13.3.zip
-> Downloading file:/root/org.elasticsearch-elasticsearch-7.13.3.zip
[=================================================] 100%
-> Failed installing file:/root/org.elasticsearch-elasticsearch-7.13.3.zip
-> Rolling back file:/root/org.elasticsearch-elasticsearch-7.13.3.zip
-> Rolled back file:/root/org.elasticsearch-elasticsearch-7.13.3.zip
Exception in thread "main" java.nio.file.NoSuchFileException: /usr/share/elasticsearch/plugins/.installing-6769439075775869049/plugin-descriptor.properties
at java.base/sun.nio.fs.UnixException.translateToIOException(UnixException.java:92)
at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:106)
at java.base/sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:111)
at java.base/sun.nio.fs.UnixFileSystemProvider.newByteChannel(UnixFileSystemProvider.java:219)
at java.base/java.nio.file.Files.newByteChannel(Files.java:375)
at java.base/java.nio.file.Files.newByteChannel(Files.java:426)
at java.base/java.nio.file.spi.FileSystemProvider.newInputStream(FileSystemProvider.java:420)
at java.base/java.nio.file.Files.newInputStream(Files.java:160)
at org.elasticsearch.plugins.PluginInfo.readFromProperties(PluginInfo.java:173)
at org.elasticsearch.plugins.InstallPluginCommand.loadPluginInfo(InstallPluginCommand.java:786)
at org.elasticsearch.plugins.InstallPluginCommand.installPlugin(InstallPluginCommand.java:845)
at org.elasticsearch.plugins.InstallPluginCommand.execute(InstallPluginCommand.java:245)
at org.elasticsearch.plugins.InstallPluginCommand.execute(InstallPluginCommand.java:215)
at org.elasticsearch.cli.EnvironmentAwareCommand.execute(EnvironmentAwareCommand.java:75)
at org.elasticsearch.cli.Command.mainWithoutErrorHandling(Command.java:116)
at org.elasticsearch.cli.MultiCommand.execute(MultiCommand.java:80)
at org.elasticsearch.cli.Command.mainWithoutErrorHandling(Command.java:116)
at org.elasticsearch.cli.Command.main(Command.java:79)
at org.elasticsearch.plugins.PluginCli.main(PluginCli.java:36)
It seems that latest version of elastic requires the file plugin-descriptor.properties which is missing.
The same issue is facing while trying to install to elastic 7.15.2
lower_greek and stem_greek works fine... But "greeklish".. doesn't work...
Am I doing something wrong??
I'm trying to find "Dimitris" but never get name "Δημήτρης" to my results
{
"settings":{
"analysis":{
"analyzer":{
"stem_analyzer":{
"type":"custom",
"tokenizer":"standard",
"filter": ["lower_greek", "stem_greek","greeklish_analysis"]
}
},
"filter": {
"lower_greek": {
"type":"lowercase",
"language":"greek"
},
"stem_greek": {
"type":"skroutz_stem_greek"
},
"greeklish_analysis":{
"type": "greeklish",
"max_expansions": 15
}
}
}
},
"mappings": {
"shooters": {
"properties": {
"name": {
"type": "string",
"analyzer": "stem_analyzer"
}
}
}
}
}
Right now, «κινιτη» expands to «κινητος», «κινητους», «κινητεων», «κινητης» and «κινητων».
We would like to also get the forms «κινητο» and «κινητα», which are valid for this specific word but not for the general case of «-η» words (e.g. «ψυξη»).
Do you oppose the idea of adding exceptions to the general rules (as we already do in the various stemmers) to handle that kind of cases? My only worry is that I'm not sure whether this kind of behaviour is valid globally or tied to a specific use case (such as «κινητά τηλέφωνα»/«κινητή τηλεφωνία»). If so, maybe the exceptions could be supplied by the user in a text file (but this might be overengineering things a bit).
Would like to hear your thoughts @astathopoulos @chief
Hello. do you have a usage example or can you suggest something? i am new in ES ant try to use the plugin but i have stuck. i have installed successful but what next?
should i make a default settings for lowercase greeklish?
should the greek must put in the ES as lowercase?
is it able to make a short example?
Hi everyone!
I am trying to use the greeklish analyzer in ElasticSearch 6.7.1 but the analyzer keeps appending the analysis of former tokens.
For example i use the token "αυγο" and i get a first analysis.
Then is use the token "παιχνιδι" and i get the analysis of "παιχνιδι" but then appended in the results i see the analysis of "αυγό".
Can you help me?
Thank you in advance
Hello there, I tried installing this plugin on Raspbian Stretch with the relevant command
sudo /usr/share/elasticsearch/bin/plugin install gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1 -verbose
and I got the following error:
-> Installing gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1... Trying https://github.com/null/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1/archive/master.zip... Failed: IOException[Can't get https://github.com/null/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1/archive/master.zip to /usr/share/elasticsearch/plugins/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1.zip]; nested: FileNotFoundException[https://github.com/null/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1/archive/master.zip]; nested: FileNotFoundException[https://github.com/null/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1/archive/master.zip]; Failed to install gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1, reason: failed to download out of all possible locations..., use -verbose to get detailed information
It seems this URL for the .zip is not correct. Any ideas on how to resolve this?
https://github.com/null/gr.skroutz:elasticsearch-analysis-greeklish:5.4.2.1/archive/master.zip
Hi,
It seems that the version 7.7.0 is valnurable to log4j attack.
Are there any plans to update to a newer and safer version (minimum version 7.16.1)?
Thanks
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.