Topic: speech-recognition Goto Github

Some thing interesting about speech-recognition

👇 Here are 4632 public repositories matching this topic...

ahmetoner / whisper-asr-webservice

speech-recognition,OpenAI Whisper ASR Webservice API

User: ahmetoner

Home Page: https://ahmetoner.github.io/whisper-asr-webservice

asr automatic-speech-recognition docker openai-whisper speech speech-recognition speech-to-text

alan-ai / alan-sdk-android

speech-recognition,Conversational AI SDK for Android to enable text and voice conversations with actions (Java, Kotlin)

Organization: alan-ai

Home Page: https://alan.app/

alan-sdk android voice voice-assistant alan-voice alan-studio sdk alan-ai voice-commands voice-control

alan-ai / alan-sdk-flutter

speech-recognition,Conversational AI SDK for Flutter to enable text and voice conversations with actions (iOS and Android)

Organization: alan-ai

Home Page: https://alan.app

alan-sdk alan-studio chatbot voice voice-assistant voice-ai alan-voice flutter sdk voice-commands

alan-ai / alan-sdk-ios

speech-recognition,Conversational AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)

Organization: alan-ai

Home Page: https://alan.app

alan-ios-sdk alan-studio chatbot voice voice-assistant voice-ai alan-voice ios sdk voice-commands

alphacep / vosk-api

speech-recognition,Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Organization: alphacep

speech-recognition asr voice-recognition speech-to-text android ios raspberry-pi deep-learning deep-neural-networks speech-to-text-android

argmaxinc / whisperkit

speech-recognition,On-device Speech Recognition for Apple Silicon

Organization: argmaxinc

Home Page: https://takeargmax.com/blog/whisperkit

inference ios pretrained-models speech-recognition swift whisper transformers macos visionos watchos

astorfi / lip-reading-deeplearning

speech-recognition,:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

User: astorfi

deep-learning computer-vision speech-recognition 3d-convolutional-network tensorflow

cmusphinx / pocketsphinx

speech-recognition,A small speech recognizer

Organization: cmusphinx

c python speech-recognition

coqui-ai / stt

speech-recognition,🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Organization: coqui-ai

Home Page: https://coqui.ai

stt speech-to-text tensorflow deep-learning automatic-speech-recognition asr voice-recognition speech-recognition speech-recognizer speech-recognition-api

espnet / espnet

speech-recognition,End-to-End Speech Processing Toolkit

Organization: espnet

Home Page: https://espnet.github.io/espnet/

deep-learning end-to-end chainer pytorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization spoken-language-understanding text-to-speech

flashlight / wav2letter

speech-recognition,Facebook AI Research's Automatic Speech Recognition Toolkit

Organization: flashlight

Home Page: https://github.com/facebookresearch/wav2letter/wiki

cpp deep-learning end-to-end speech-recognition wav2letter

funaudiollm / sensevoice

speech-recognition,Multilingual Voice Understanding Model

Organization: funaudiollm

Home Page: https://funaudiollm.github.io/

ai asr gpt-4o speech-recognition speech-to-text aigc audio-event-classification cross-lingual llm python

ggerganov / whisper.cpp

speech-recognition,Port of OpenAI's Whisper model in C/C++

User: ggerganov

openai speech-to-text transformer whisper inference speech-recognition

huggingface / distil-whisper

speech-recognition,Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Organization: huggingface

audio speech-recognition whisper

huggingface / transformers

speech-recognition,🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Organization: huggingface

Home Page: https://huggingface.co/transformers

nlp natural-language-processing pytorch language-model tensorflow bert language-models pytorch-transformers nlp-library transformer

jianchang512 / stt

speech-recognition,Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务，输出json、srt字幕带时间戳、纯文字格式

User: jianchang512

Home Page: https://pyvideotrans.com

speech speech-recognition speech-to-text stt

julius-speech / julius

speech-recognition,Open-Source Large Vocabulary Continuous Speech Recognition Engine

Organization: julius-speech

speech recognition audio-processing speech-recognition

kaldi-asr / kaldi

speech-recognition,kaldi-asr/kaldi is the official location of the Kaldi project.

Organization: kaldi-asr

Home Page: http://kaldi-asr.org

kaldi c-plus-plus cuda shell speech-recognition speech-to-text speaker-verification speaker-id speech

kalliope-project / kalliope

speech-recognition,Kalliope is a framework that will help you to create your own personal assistant.

Organization: kalliope-project

Home Page: https://kalliope-project.github.io/

raspberry bot-creation jarvis personal-assistant linux speech-to-text speech-recognition speech-synthesis bot home-automation

kmario23 / deep-learning-drizzle

speech-recognition,Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

User: kmario23

Home Page: https://deep-learning-drizzle.github.io

machine-learning deep-learning deep-neural-networks pattern-recognition computer-vision optimization visual-recognition reinforcement-learning deep-reinforcement-learning natural-language-processing artificial-neural-networks artificial-intelligence-algorithms probabilistic-graphical-models bayesian-statistics speech-recognition graph-neural-networks medical-imaging geometric-deep-learning explainable-ai probability

leon-ai / leon

speech-recognition,🧠 Leon is your open-source personal assistant.

Organization: leon-ai

Home Page: https://getleon.ai

leon personal-assistant nodejs python ai artificial-intelligence speech-to-text text-to-speech speech-recognition speech-synthesis

linto-ai / whisper-timestamped

speech-recognition,Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Organization: linto-ai

deep-learning speech speech-recognition speech-to-text asr machine-learning python python3 pytorch attention-is-all-you-need

m-bain / whisperx

speech-recognition,WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

User: m-bain

asr speech speech-recognition speech-to-text whisper

mahmoudashraf97 / whisper-diarization

speech-recognition,Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

User: mahmoudashraf97

asr speaker-diarization speech speech-recognition speech-to-text whisper

modelscope / funasr

speech-recognition,A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Organization: modelscope

Home Page: https://www.funasr.com

conformer pytorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection

modelscope / funclip

speech-recognition,Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Organization: modelscope

speech-recognition video-clip video-subtitles subtitles-generator speech-to-text gradio gradio-python-llm llm

mozilla / deepspeech

speech-recognition,DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Organization: mozilla

deep-learning machine-learning neural-networks tensorflow speech-recognition speech-to-text deepspeech embedded on-device offline

speech-recognition,pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

User: mravanelli

speech-recognition gru dnn kaldi rnn-model pytorch timit deep-learning deep-neural-networks recurrent-neural-networks

nl8590687 / asrt_speechrecognition

speech-recognition,A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

User: nl8590687

Home Page: https://asrt.ailemon.net

tensorflow cnn ctc python keras speech-recognition speech-to-text chinese-speech-recognition asrt python3

nobody132 / masr

speech-recognition,中文语音识别; Mandarin Automatic Speech Recognition;

User: nobody132

chinese-speech-recognition mandarin-chinese pytorch speech-recognition

nvidia / deeplearningexamples

speech-recognition,State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Organization: nvidia

computer-vision deep-learning drug-discovery forecasting large-language-models mxnet paddlepaddle pytorch recommender-systems speech-recognition

openvinotoolkit / openvino

speech-recognition,OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Organization: openvinotoolkit

Home Page: https://docs.openvino.ai

inference deep-learning openvino ai computer-vision diffusion-models generative-ai llm-inference natural-language-processing nlp

paddlepaddle / paddlespeech

speech-recognition,Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Organization: paddlepaddle

Home Page: https://paddlespeech.readthedocs.io

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr

pannous / tensorflow-speech-recognition

speech-recognition,🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

User: pannous

tensorflow speech-recognition neural-network deep-learning stt speech-to-text

picovoice / porcupine

speech-recognition,On-device wake word detection powered by deep learning

Organization: picovoice

Home Page: https://picovoice.ai/

wake-word-detection hotword keyword-spotting keyword-spotter wake-word wake-word-engine handsfree hotword-detection hotword-detector on-device

react-native-voice / voice

speech-recognition,:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support)

Organization: react-native-voice

react-native android ios speech-recognition voice-recognition

rhasspy / rhasspy

speech-recognition,Offline private voice assistant for many human languages

Organization: rhasspy

Home Page: https://community.rhasspy.org/

voice-assistants speech-recognition home-assistant node-red privacy voice-commands

sanchit-gandhi / whisper-jax

speech-recognition,JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

User: sanchit-gandhi

deep-learning jax speech-recognition speech-to-text whisper

snakers4 / silero-models

speech-recognition,Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

User: snakers4

speech-recognition speech-to-text stt asr pretrained-models english german spanish stt-benchmark pytorch

speechbrain / speechbrain

speech-recognition,A PyTorch-based Speech Toolkit

Organization: speechbrain

Home Page: http://speechbrain.github.io

speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition

syhw / wer_are_we

speech-recognition,Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

User: syhw

deep-neural-network wer speech-recognition

systran / faster-whisper

speech-recognition,Faster Whisper transcription with CTranslate2

Organization: systran

deep-learning inference quantization speech-recognition speech-to-text transformer whisper openai

talater / annyang

speech-recognition,💬 Speech recognition for your site

User: talater

Home Page: https://www.talater.com/annyang/

speech-recognition speech speech-to-text voice

tensorflow / lingvo

speech-recognition,Lingvo

Organization: tensorflow

speech-recognition translation speech-to-text machine-translation mnist seq2seq language-model tts asr lm nlp tensorflow speech research distributed gpu-computing speech-synthesis

toverainc / willow

speech-recognition,Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

Organization: toverainc

Home Page: https://heywillow.io/

alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper

uberi / speech_recognition

speech-recognition,Speech recognition module for Python, supporting several engines and APIs, online and offline.

User: uberi

Home Page: https://pypi.python.org/pypi/SpeechRecognition/

python audio speech-recognition speech-to-text

wenet-e2e / wenet

speech-recognition,Production First and Production Ready End-to-End Speech Recognition Toolkit

Organization: wenet-e2e

Home Page: https://wenet-e2e.github.io/wenet/

asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper

yanshengjia / ml-road

speech-recognition,Machine Learning Resources, Practice and Research

User: yanshengjia

machine-learning deep-learning nlp computer-vision speech-recognition tensorflow pytorch

zzw922cn / automatic_speech_recognition

speech-recognition,End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

User: zzw922cn

automatic-speech-recognition tensorflow timit-dataset feature-vector phonemes data-preprocessing rnn audio deep-learning lstm

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

speech-recognition,Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

User: zzw922cn

automatic-speech-recognition papers roadmap rnn cnn dnn attention-mechanism seq2seq acoustic-model timit-dataset

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Topic: speech-recognition Goto Github

👇 Here are 4632 public repositories matching this topic...

ahmetoner / whisper-asr-webservice

alan-ai / alan-sdk-android

alan-ai / alan-sdk-flutter

alan-ai / alan-sdk-ios

alphacep / vosk-api

argmaxinc / whisperkit

astorfi / lip-reading-deeplearning

cmusphinx / pocketsphinx

coqui-ai / stt

espnet / espnet

flashlight / wav2letter

funaudiollm / sensevoice

ggerganov / whisper.cpp

huggingface / distil-whisper

huggingface / transformers

jianchang512 / stt

julius-speech / julius

kaldi-asr / kaldi

kalliope-project / kalliope

kmario23 / deep-learning-drizzle

leon-ai / leon

linto-ai / whisper-timestamped

m-bain / whisperx

mahmoudashraf97 / whisper-diarization

modelscope / funasr

modelscope / funclip

mozilla / deepspeech

mravanelli / pytorch-kaldi

nl8590687 / asrt_speechrecognition

nobody132 / masr

nvidia / deeplearningexamples

openvinotoolkit / openvino

paddlepaddle / paddlespeech

pannous / tensorflow-speech-recognition

picovoice / porcupine

react-native-voice / voice

rhasspy / rhasspy

sanchit-gandhi / whisper-jax

snakers4 / silero-models

speechbrain / speechbrain

syhw / wer_are_we

systran / faster-whisper

talater / annyang

tensorflow / lingvo

toverainc / willow

uberi / speech_recognition

wenet-e2e / wenet

yanshengjia / ml-road

zzw922cn / automatic_speech_recognition

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Recommend Projects

Recommend Topics

Recommend Org