Giter Club home page Giter Club logo

python-myanmar's Introduction

Introduction

Build Status

Test Coverage

Pypi Version

This package contains a variety of python modules for Myanmar text proccesing, such as syllabification, romanization, encoding conversion, nrc validation etc. Only python3 is currently supported at the moment.

Installation

The package is distributed on PyPI and can be installed with pip:

pip install python-myanmar

For more information, please read the full documentation here.

python-myanmar's People

Contributors

knwin avatar laminko avatar naingyeminn avatar setkyar avatar trhura avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

python-myanmar's Issues

Convert Error

Cannot convert Unicode "ဪ" < Shift+] > to Zawgyi.

Cannot import modules

from myanmar.romanizer import romanize, MLC
from myanmar.romanizer import romanize, IPA

ImportError: cannot import name 'romanize'

from myanmar.romanizer import romanize, BGN_PCGN

work fine!

Can't convert...

First, thanks for the code...
It was amazing...

I trying to use your code for our project... and get follow error
I don't know what kind of error that is... can you point me out? Thanks

==============================================

converter.convert('vm;','wininnwa','zawgyi')
Traceback (most recent call last):

File "", line 1, in
converter.convert('vm;','wininnwa','zawgyi')

File "D:\fontchangetest\converter.py", line 35, in convert
from_encoding = encodingsfrom_encoding

File "D:\fontchangetest\encodings.py", line 103, in init
super ().init(*args, **kwargs)

File "D:\fontchangetest\encodings.py", line 14, in init
self.json_data = self.load_json (os.path.join(_ROOT, 'data', name))

File "D:\fontchangetest\encodings.py", line 69, in load_json
data = json.load (iFile)

File "C:\Users\phyokyi\AppData\Local\Continuum\Anaconda3\lib\json_init_.py", line 265, in load
return loads(fp.read(),

File "C:\Users\phyokyi\AppData\Local\Continuum\Anaconda3\lib\encodings\cp1252.py", line 23, in decode
return codecs.charmap_decode(input,self.errors,decoding_table)[0]

UnicodeDecodeError: 'charmap' codec can't decode byte 0x90 in position 94: character maps to

Font Detection

I've Some library about font detections.
As my code, before convert the font, it will look what font is it.
That mean

  • We can reduce cost of converting process
  • Prevent Double converting

Basic Font detect library

{
  unicode5: [
    "", "", "ည်", "န်", "င်", "ေး", "ော",
    "်း", "", "[ၐ-ၙ]", "^([က-အ]ြ|[က-အ]ေ)"
  ],
  zawgyi : [
    "ာ္", "်ာ", "\s(ျ|ေ|[ၾ-ႄ])[က-အ]",
    "^(ျ|ေ|[ၾ-ႄ])[က-အ]", "[က-အ]္[^က-အ]", "ဥ္",
    "္း" ,"[ါ-ူေ်း](ျ|[ၾ-ႄ])[က-အ]" ,"ံု",
    "[က-အ]္ေ" , "","\s", "ာေ",
    "[ါ-ူ်း]ေ[က-အ]", "ေေ", "ုိ", "္$"
  ]
}

My code written in JavaScript Look Here
Knayi-myscript

telenor prefix 0975x-xxx-xxx not supported

New telenor prefixes:

1,975 (0~2)xx -xxxx,WCDMA / GSM,10,Telenor Myanmar Limited
13,9753-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
14,9754-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
15,9755-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
16,9756-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
17,9757-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
18,9758-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited
19,9759-xxx-xxx,WCDMA / GSM,10,Telenor Myanmar Limited

REF: ITU

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.