Giter Club home page Giter Club logo

carmel-repository's Introduction

Carmel-Repository

This is a repository for all finite-state machines that are compatible with the Carmel toolkit (http://www.isi.edu/licensed-sw/carmel).

FSA Description
none yet none yet
WFSA Description
wfsa001 English letter 2-gram model, P(e)
wfsa002 English phoneme 3-gram model, P(epron); plus generic n-gram WFSA builder
wfsa003 Letter 3-gram models for dozens of languages, built from UNDHR data
wfsa004 Letter 3-gram model for English, built from Penn Treebank plaintext
wfsa005 Pronounceable grapheme segment 3-gram model for English, built from the aligned CMU Pronunciation Dictionary
wfsa006 Accepts any word of alphabet [A-Z] except for the word "FILTER." Includes code for building any such filter.
FST Description
fst001 English-to-Cipher letter substitutor, fully connected
fst002 English-to-Phonemes, built from CMU Pron Dict
fst002a English-to-Phonemes, with fixes
fst003 Pinyin-if-to-pinyin, bi-directional
WFST Description
wfst001 English-to-French word-to-word translator, P(f|e)
wfst002 French-to-English word-to-word translator, P(e|f)
wfst003 Pinyin-to-Chinese bi-directional P(p|c) and P(c|p)
wfst004 Pinyin-if-to-Epron, Pinyin initial/final to IPA (English Pronounciation)
wfst005 English-to-Syllable-Stress, includes original CMU pronunciation lexicon
wfst006 English phonemes to grapheme (spell out), useful for OOV
wfst007 English graphemes to pronounceable grapheme segment sequences
wfst008 Adds start and end tags to English pronounceable grapheme segment sequences
wfst009 English pronounceable grapheme segment sequences to phoneme (pronouncer), useful for OOV
Cascade Description
casc001 English-to-English word-to-word paraphraser, P(e|e)
casc002 English letter-substitution cipher solver
casc003 English-to-Chinglish
casc004 English word to pronounciation
casc005 Symbol clustering using EM (any # of clusters, any n-gram order)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.