yogurt-cultures / kefir Goto Github PK
View Code? Open in Web Editor NEW🥛turkic morphology project
License: Other
🥛turkic morphology project
License: Other
Hi, thank you for this amazing project.
I want to use simple present tense for the verbs which are returning from morphological analysis without tense tag.
For example: 'gelmek' -> 'gelir', 'gitmek' -> 'gider'
Is there any way to do that with Kefir?
Summary of the project is confusing. Currently it seems to provide only NLG features.
When I try to install kefir I get this error:
Collecting kefir
Using cached https://files.pythonhosted.org/packages/ac/bc/2b33a0657cda729faedefb896b4e1315251dc38387763cf3973ca0b758c7/kefir-0.1.0.tar.gz
Complete output from command python setup.py egg_info:
Traceback (most recent call last):
File "", line 1, in
File "C:\Users\Username\AppData\Local\Temp\pip-install-xf5i6hha\kefir\setup.py", line 37, in
readme = f.read()
File "c:\users\username\appdata\local\programs\python\python36\lib\encodings\cp1254.py", line 23, in decode
return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 2554: character maps to
----------------------------------------
Command "python setup.py egg_info" failed with error code 1 in C:\Users\username\AppData\Local\Temp\pip-install-xf5i6hha\kefir\
By using a dictionary you can identify exceptional cases which is not possible by just inspection of the letters.
One possiblity is zemberek: https://github.com/ahmetaa/zemberek-nlp/blob/master/morphology/src/main/resources/tr/master-dictionary.dict
These are the rules: https://github.com/ahmetaa/zemberek-nlp/wiki/Text-Dictionary-Rules
Zemberek has a binary version based on protocol buffers as well for pre processed attributes and fast loading. it should be possible to read it with python directly:
Proto:
https://github.com/ahmetaa/zemberek-nlp/blob/master/morphology/src/main/proto/lexicon.proto
Binary dictionary:
https://github.com/ahmetaa/zemberek-nlp/blob/master/morphology/src/main/resources/tr/lexicon.bin
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.