Topic: speech-recognition Goto Github
Some thing interesting about speech-recognition
Some thing interesting about speech-recognition
speech-recognition,OpenAI Whisper ASR Webservice API
User: ahmetoner
Home Page: https://ahmetoner.github.io/whisper-asr-webservice
speech-recognition,Conversational AI SDK for Android to enable text and voice conversations with actions (Java, Kotlin)
Organization: alan-ai
Home Page: https://alan.app/
speech-recognition,Conversational AI SDK for Flutter to enable text and voice conversations with actions (iOS and Android)
Organization: alan-ai
Home Page: https://alan.app
speech-recognition,Conversational AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)
Organization: alan-ai
Home Page: https://alan.app
speech-recognition,Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Organization: alphacep
speech-recognition,On-device Speech Recognition for Apple Silicon
Organization: argmaxinc
Home Page: https://takeargmax.com/blog/whisperkit
speech-recognition,:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
User: astorfi
speech-recognition,A small speech recognizer
Organization: cmusphinx
speech-recognition,🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Organization: coqui-ai
Home Page: https://coqui.ai
speech-recognition,End-to-End Speech Processing Toolkit
Organization: espnet
Home Page: https://espnet.github.io/espnet/
speech-recognition,Facebook AI Research's Automatic Speech Recognition Toolkit
Organization: flashlight
Home Page: https://github.com/facebookresearch/wav2letter/wiki
speech-recognition,Multilingual Voice Understanding Model
Organization: funaudiollm
Home Page: https://funaudiollm.github.io/
speech-recognition,Port of OpenAI's Whisper model in C/C++
User: ggerganov
speech-recognition,Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Organization: huggingface
speech-recognition,🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Organization: huggingface
Home Page: https://huggingface.co/transformers
speech-recognition,Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
User: jianchang512
Home Page: https://pyvideotrans.com
speech-recognition,Open-Source Large Vocabulary Continuous Speech Recognition Engine
Organization: julius-speech
speech-recognition,kaldi-asr/kaldi is the official location of the Kaldi project.
Organization: kaldi-asr
Home Page: http://kaldi-asr.org
speech-recognition,Kalliope is a framework that will help you to create your own personal assistant.
Organization: kalliope-project
Home Page: https://kalliope-project.github.io/
speech-recognition,Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
User: kmario23
Home Page: https://deep-learning-drizzle.github.io
speech-recognition,🧠 Leon is your open-source personal assistant.
Organization: leon-ai
Home Page: https://getleon.ai
speech-recognition,Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Organization: linto-ai
speech-recognition,WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
User: m-bain
speech-recognition,Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
User: mahmoudashraf97
speech-recognition,A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Organization: modelscope
Home Page: https://www.funasr.com
speech-recognition,Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Organization: modelscope
speech-recognition,DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Organization: mozilla
speech-recognition,pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
User: mravanelli
speech-recognition,A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
User: nl8590687
Home Page: https://asrt.ailemon.net
speech-recognition,中文语音识别; Mandarin Automatic Speech Recognition;
User: nobody132
speech-recognition,State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Organization: nvidia
speech-recognition,OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Organization: openvinotoolkit
Home Page: https://docs.openvino.ai
speech-recognition,Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Organization: paddlepaddle
Home Page: https://paddlespeech.readthedocs.io
speech-recognition,🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
User: pannous
speech-recognition,On-device wake word detection powered by deep learning
Organization: picovoice
Home Page: https://picovoice.ai/
speech-recognition,:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Organization: react-native-voice
speech-recognition,Offline private voice assistant for many human languages
Organization: rhasspy
Home Page: https://community.rhasspy.org/
speech-recognition,JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
User: sanchit-gandhi
speech-recognition,Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
User: snakers4
speech-recognition,A PyTorch-based Speech Toolkit
Organization: speechbrain
Home Page: http://speechbrain.github.io
speech-recognition,Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
User: syhw
speech-recognition,Faster Whisper transcription with CTranslate2
Organization: systran
speech-recognition,💬 Speech recognition for your site
User: talater
Home Page: https://www.talater.com/annyang/
speech-recognition,Lingvo
Organization: tensorflow
speech-recognition,Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Organization: toverainc
Home Page: https://heywillow.io/
speech-recognition,Speech recognition module for Python, supporting several engines and APIs, online and offline.
User: uberi
Home Page: https://pypi.python.org/pypi/SpeechRecognition/
speech-recognition,Production First and Production Ready End-to-End Speech Recognition Toolkit
Organization: wenet-e2e
Home Page: https://wenet-e2e.github.io/wenet/
speech-recognition,Machine Learning Resources, Practice and Research
User: yanshengjia
speech-recognition,End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
User: zzw922cn
speech-recognition,Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
User: zzw922cn
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.