Comments (4)
I recently tried (with openseq2seq) the new decoder from mozilla-deepspeech which is based on this code https://github.com/parlance/ctcdecode/ . It can run on multiple cpus by default, if you feed batches of audio files, for trancriptions and the results I get are constantly better compared to the old decoder (without doing any hyperparameter tuning for alpa and beta, I just used the default values mozilla uses)
from openseq2seq.
@bill-kalog : Hi, I'm interested in your approach, could you please explain more about it? How you can use this ctcdecode combine with OpenSeq2Seq because I think the ctc-decode in OpenSeq2Seq is already built for their models. Which flags or hyperparameters have you replaced to use this ctc-decode?
from openseq2seq.
@ngochuyenluu, we already integrated Baidu beam search decoder in OpenSeq2Seq. That is the same decoder as in ctcdecode
project. Please see https://nvidia.github.io/OpenSeq2Seq/html/speech-recognition.html#decoders for more details.
from openseq2seq.
@vsl9 hello, can i use the baidu ctc decoder in training? and i also found that the baidu WarpCTC suppports gpu too, can we use it in this openSeq2Seq project?
from openseq2seq.
Related Issues (20)
- Unable to initialize FrameASR object (Trying to infer from Pre-trained model) HOT 1
- Choppy generation using pre-trained tacotron-gst model checkpoint HOT 1
- download language model issue
- Unreadable Output HOT 1
- Model training stops after 1 step for Speech to Text Jasper model HOT 1
- AttributeError: module 'tensorflow._api.v2.train' has no attribute 'SessionRunHook' HOT 3
- Deep speech 2 training time
- INFO: Skipping trie generation, since no custom TF op based CTC decoder found.
- Windows operating system support. HOT 1
- Making use of Language Model with CTC Decoder HOT 2
- How can we run inference with a pb file HOT 1
- Early Stopping
- Data_layer typo
- Empty string can be added to ngram vector
- How can we emit word confidence while decoding?
- Jasper 10x3 and 10x5 same size of models
- Compatibility with tensorflow 2.3
- Nemo & OpenSeq2Seq difference ? HOT 2
- Streaming example output stuck words HOT 1
- GPU is not being used?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openseq2seq.