wendongj Goto Github PK
Name: wendong
Type: User
Bio: work with attention
Name: wendong
Type: User
Bio: work with attention
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
AEC Challenge
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM
A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
🔊 Text-Prompted Generative Audio Model
Easy to use Beamformers for multi-channel speech separation/enhancement
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Bert-Grad-Vocos-TTS
vits2 backbone with bert
Pytorch implementation of BigVSAN
从0到1构建一个MiniLLM
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
ChatTTS is a generative speech model for daily dialogue.
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Official PyTorch implementation of Contrastive Learning of Musical Representations
Conformer-based Metric GAN for speech enhancement
Fast Independent Vector Extraction: Code and data to reproduce the results from the paper.
Tensorflow implementation of Conformer - Transformer-based model for Speech Recognition
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.