Giter Club home page Giter Club logo

rime-tool's People

Contributors

osfans avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

rime-tool's Issues

請求幫助寫一個分類匯總的程式

百萬行數據,Excel分類匯總實在太慢了,
@osfans 大大有時間幫忙寫一個python3分類匯總的程式吧!感謝!


文檔內碼:GBK+UTF-8
文檔格式:字詞tab編碼
編碼格式:小狼毫格式(空格分隔)
編碼字符:字母、數字、拼音


好 hao
好 hao
工作 gong1 zuo4
工作 gong1 zuo4
工作 gong1 zuo4
樸 piáo
樸 piáo
樸 piáo
樸 piáo
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng


分類匯總結果:字詞tab編碼tab數量

好 hao 2
工作 gong1 zuo4 3
樸 piáo 4
這好像 zhè hǎo xiàng 6

问个词汇的出处

@osfans 问个词汇的出处:
在Rime菜鸟群/群文件/输入法素材
你发的"汉典词汇_带拼音-2015年11月.xlsx"文件,
搜索github.com和百度都没有“汉典词汇”的词。
能提供源(源文件)出处地址吗?
谢。

I have newly made a new input schema (Unicode Nushu)

https://github.com/chromezh/unicode_nushu


女书是流传在湖南江永县潇水流域的一种妇女专用文字。

女书是一种女人创造、女人使用、专门写女性生活与感情的文字。它记录的是当地的一种土话,有大约一千个单字,形体呈斜长的菱形框架,风格飘逸、舒展。女书是一种单音节文字,每个音节表示一组同音不同意义的语词。她像一朵野花,几百年深藏在湖南省都庞岭的偏僻乡村中,自生自长、自开自谢。就在她濒临灭绝的时候,1982 年被学者所发现。20 多年来,学者们为破译女书,研究女书,抢救女书,付出了艰苦的努力,做出了重大的贡献。

女书被收录进 Unicode 字符集,因此我使用 Rime 制作了 Unicode 女书输入法。


I have implemented a new input schema -- Unicode Nvshu Input Method. You are welcomed to include it to this Rime Collection.

Nüshu (simplified Chinese: 女书; traditional Chinese: 女書; pinyin: Nǚshū [nỳʂú]; literally: "women's script"), is a syllabic script derived from Chinese characters that was used exclusively among women in Jiangyong County in Hunan province of southern China. Nüshu has been included in the Unicode Standard since June 2017.

Unlike the standard written Chinese, which is logographic (with each character representing a word or part of a word), Nüshu is phonetic, with each of its approximately 600-700 characters representing a syllable. This is about half the number required to represent all the syllables in Tuhua, as tonal distinctions are frequently ignored, making it "the most revolutionary and thorough simplification of Chinese characters ever attempted". Zhou Shuoyi, described as the only male to have mastered the script, compiled a dictionary listing 1,800 variant characters and allographs.

fts3;windows下essay.txt等

我也来帮衬啦。

    1. Python 3.4.0(v3.4.0:04f714765c13)环境下,提示无fts3模型。google办法,替换最新版C:\Python 34\DLLs\sqlite3.dll 就好了。
    1. Windows下八股文在C:\\Program Files (x86)\\Rime\\weasel-0.9.30\\data\\trime-tool.py得对这种情况处理一下。
    1. dict里,
use_preset_vocabulary: true  
max_phrase_length: 7  

trime.db有40多M,怕会很卡。如果引入的八股文最大词长改成3,则有20多M。可能原因:为了覆盖地域更广,平均每个字的读音比较多,八股文自动编码的结果成倍增加。
继续跑去trime项目反馈一下。

    1. 你的readme里,没提怎么处理dict.yaml,其实是不用人处理的。但没动手之前会一直有疑问:“schema.yaml输入了命令行,那dict.yaml要不要?”

請求幫助寫一個篩選過濾的程式

@osfans 幫忙寫一個python3篩選過濾的程式!

文檔內碼:GBK+UTF-8
文檔格式:字詞tab編碼
編碼格式:小狼毫格式(空格分隔)
編碼字符:字母、數字、拼音


好 hao
好 hao
工作 gong1 zuo4
工作 gong1 zuo4
工作 gong1 zuo4
樸 piáo
樸 piáo
樸 piáo
樸 piáo
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng
這好像 zhè hǎo xiàng


篩選條件:


工作


篩選結果:

工作 gong1 zuo4
工作 gong1 zuo4
工作 gong1 zuo4
樸 piáo
樸 piáo
樸 piáo
樸 piáo

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.