Giter Club home page Giter Club logo

Comments (4)

bill-kalog avatar bill-kalog commented on June 18, 2024

I recently tried (with openseq2seq) the new decoder from mozilla-deepspeech which is based on this code https://github.com/parlance/ctcdecode/ . It can run on multiple cpus by default, if you feed batches of audio files, for trancriptions and the results I get are constantly better compared to the old decoder (without doing any hyperparameter tuning for alpa and beta, I just used the default values mozilla uses)

from openseq2seq.

ngochuyenluu avatar ngochuyenluu commented on June 18, 2024

@bill-kalog : Hi, I'm interested in your approach, could you please explain more about it? How you can use this ctcdecode combine with OpenSeq2Seq because I think the ctc-decode in OpenSeq2Seq is already built for their models. Which flags or hyperparameters have you replaced to use this ctc-decode?

from openseq2seq.

vsl9 avatar vsl9 commented on June 18, 2024

@ngochuyenluu, we already integrated Baidu beam search decoder in OpenSeq2Seq. That is the same decoder as in ctcdecode project. Please see https://nvidia.github.io/OpenSeq2Seq/html/speech-recognition.html#decoders for more details.

from openseq2seq.

ArtemisZGL avatar ArtemisZGL commented on June 18, 2024

@vsl9 hello, can i use the baidu ctc decoder in training? and i also found that the baidu WarpCTC suppports gpu too, can we use it in this openSeq2Seq project?

from openseq2seq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.