Comments (5)
Yeah, this sounds like a good idea. One problem though, in JMdict database hiragana readings are not separated by kanji so this wasn't possible to implement at the time I wrote romanization algorithm. More recently I have implemented a kanji module (kanji.lisp) that has a function match-readings
that can potentially be used to resolve this issue.
from ichiran.
Oh. I didn't know that. I guess that makes what I had in mind quite difficult.
Special readings could be a problem. And then, what if (this is entirely fictional. I don't know if a real-world example exists) you have a word made up of two kanji, the first one could be read あ or あお and the latter can be read お or おう? If you only know that the entire word reads あおう, then that could be split into あ-おう or あお-う...
from ichiran.
I noticed that the traditional basic option on the site doesn't create the ō
. I'm wondering @tshatrov , is there a way to change the romanization settings using ichiran-cli (I'm specifically interested in doing that using the -f
option)?
from ichiran.
@tslater I think if you do (setf ichiran:*default-romanization-method* ichiran:*hepburn-basic*)
before building the executable, then -f
will use basic romanization.
from ichiran.
Looks like it is working. Thanks!
from ichiran.
Related Issues (20)
- Additional data getting inserted into json results HOT 1
- Paper/Explanation of algorithm used HOT 9
- Support for がい/かい suffix HOT 1
- JSON returned by ichiran/cli HOT 4
- 一箇年 and 堪へる are missing kana_text, causing internal server error HOT 2
- Used postgres version HOT 4
- Minor note about database_name HOT 2
- Include root word information for conjugated words in JSON
- ichiran-cli doesn't work HOT 8
- Support for んだ and んです suffix HOT 1
- ichiran gets 「1週間後 」wrong HOT 1
- Logging for Postgres queries HOT 4
- Docker entrypoint missing on Windows HOT 1
- `Database error 42P01: relation "kana_text" does not exist` from CLI due to `switch-conn-vars` HOT 2
- Improving expression detection HOT 1
- てもいい / でもいい dropping も out of data HOT 2
- Spliting words functionality HOT 6
- Docker compose issues with pg_restore and running tests/cli HOT 4
- Newest Ichiran with newest data seems to be failing 31 tests HOT 11
- Curious treatment of kanji-break list HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ichiran.