Topic: speech Goto Github
Some thing interesting about speech
Some thing interesting about speech
speech,OpenAI Whisper ASR Webservice API
User: ahmetoner
Home Page: https://ahmetoner.github.io/whisper-asr-webservice
speech,AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Organization: aigc-audio
Home Page: https://huggingface.co/spaces/AIGC-Audio/AudioGPT
speech,🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
User: avinashkranjan
Home Page: https://amazing-python-scripts.avinashranjan.com
speech,🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
User: babysor
speech,SALMONN: Speech Audio Language Music Open Neural Network
Organization: bytedance
Home Page: https://bytedance.github.io/SALMONN/
speech,🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Organization: coqui-ai
Home Page: http://coqui.ai
speech,Community list of startups working with AI in audio and music technology
User: csteinmetz1
Home Page: https://csteinmetz1.github.io/ai-audio-startups/
speech,DELTA is a deep learning based natural language and speech processing platform.
Organization: delta-ml
Home Page: https://delta-didi.readthedocs.io/
speech,自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
User: dengbocong
speech,💬 SpeechGPT is a web application that enables you to converse with ChatGPT.
User: hahahumble
Home Page: https://speechgpt.app
speech,General Speech Restoration
User: haoheliu
Home Page: https://haoheliu.github.io/demopage-voicefixer/
speech,🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Organization: huggingface
Home Page: https://huggingface.co/docs/datasets
speech,VITS-based Voice Conversion focused on simplicity, quality and performance
Organization: iahispano
Home Page: https://applio.org
speech,Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Organization: idea-research
Home Page: https://arxiv.org/abs/2401.14159
speech,Free, easy, portable audio engine for games
User: jarikomppa
Home Page: http://soloud-audio.com
speech,Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
User: jianchang512
Home Page: https://v.wonyes.org
speech,Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
User: jtkim-kaist
speech,Open-Source Large Vocabulary Continuous Speech Recognition Engine
Organization: julius-speech
speech,kaldi-asr/kaldi is the official location of the Kaldi project.
Organization: kaldi-asr
Home Page: http://kaldi-asr.org
speech,A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
User: kyubyong
speech,Tools for handling speech data in machine learning projects.
Organization: lhotse-speech
Home Page: https://lhotse.readthedocs.io/en/latest/
speech,Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Organization: linto-ai
speech,WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
User: m-bain
speech,Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
User: mahmoudashraf97
speech,Foundational model for human-like, expressive TTS
Organization: metavoiceio
Home Page: https://themetavoice.xyz/
speech,The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
User: miteshputhran
speech,ModelScope: bring the notion of Model-as-a-Service to life.
Organization: modelscope
Home Page: https://www.modelscope.cn/
speech,:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Organization: mozilla
speech,pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
User: mravanelli
speech,A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Organization: natspeech
speech,EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
User: netease-youdao
speech,Fully customizable AI chatbot component for your website
User: ovidijusparsiunas
Home Page: https://deepchat.dev
speech,Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Organization: paddlepaddle
speech,Python library and CLI tool to interface with Google Translate's text-to-speech API
User: pndurette
Home Page: http://gtts.readthedocs.org/
speech,Praat: Doing Phonetics By Computer
Organization: praat
Home Page: http://www.praat.org
speech,A Python wrapper for Kaldi
Organization: pykaldi
Home Page: https://pykaldi.github.io
speech,Data manipulation and transformation for audio signal processing, powered by PyTorch
Organization: pytorch
Home Page: https://pytorch.org/audio
speech,WaveNet vocoder
User: r9y9
Home Page: https://r9y9.github.io/wavenet_vocoder/
speech,aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
User: readbeyond
Home Page: http://www.readbeyond.it/aeneas/
speech,Noise supression using deep filtering
User: rikorose
Home Page: https://huggingface.co/spaces/hshr/DeepFilterNet2
speech,Videos, notes and experiments to understand deep learning
User: roatienza
speech,Speech Enhancement Generative Adversarial Network in TensorFlow
User: santi-pdp
speech,Code examples for new APIs of iOS 10.
User: shu223
speech,Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
User: snakers4
speech,[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
User: sooftware
speech,SoftVC VITS Singing Voice Conversion
Organization: svc-develop-team
speech,:speech_balloon: Speech recognition for your site
User: talater
Home Page: https://www.talater.com/annyang/
speech,Lingvo
Organization: tensorflow
speech,基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
User: yeyupiaoling
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.