Giter Club home page Giter Club logo

Bookbot Hive's Projects

auxiliaryasr icon auxiliaryasr

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

babygruut icon babygruut

A tokenizer, text cleaner, and phonemizer for many human languages.

cppflow icon cppflow

Run TensorFlow models in C++ without installation and without Bazel

datasets icon datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

doeloe icon doeloe

A library to convert Indonesian Republican Spelling System into EYD.

domba icon domba

Finetuning InstructLLaMA with Indonesian data

dreambooth-stable-diffusion icon dreambooth-stable-diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion (tweaks focused on training faces)

espnet icon espnet

End-to-End Speech Processing Toolkit

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

k2-indonesian-asr icon k2-indonesian-asr

Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).

lexikos icon lexikos

Lexikos - λεξικός /lek.si.kós/ - A collection of pronunciation dictionaries and neural grapheme-to-phoneme models.

lhotse icon lhotse

Tools for handling speech data in machine learning projects.

matcha-tts icon matcha-tts

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

mnn icon mnn

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

nemo icon nemo

NeMo: a toolkit for conversational AI

nix-tts icon nix-tts

End-To-End SpeechSynthesis system with knowledge distillation

optimum icon optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

piper icon piper

A fast, local neural text to speech system

pl-bert icon pl-bert

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

sherpa-ncnn icon sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, etc.

sherpa-onnx icon sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.