zhu-gu-an Goto Github PK
Type: User
Type: User
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库
Deployable end-to-end asr decoders
Preprocess Audio for training
语音数据集自动化制作脚本
AVA-Speech dataset downloader
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Batch Support for OpenAI Whisper
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Predicts the level of noise and reverberation on your audiofiles
CapsWriter 简陋但好用的离线版,一个 PC 端的语音输入工具
一个简单的本地网页界面,直接使用ChatTTS将文字合成为语音,同时支持对外提供API接口。
chinese speech pretrained models
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
A modification from Kaldi's ComputePitchFeats.
2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等
A ctc decoder for both online and offline asr model
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
达摩fsmn vad c++推理服务
Noise supression using deep filtering
DeepSeek LLM: Let there be answers
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
NLP文本数据
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.