road2018 Goto Github PK
Type: User
Type: User
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
This is the GitHub page for publicly available emotional speech data.
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Build your own Real-time Speech Emotion Recognizer
an Audio-Visual Voice Activity Detection using Deep Learning
ESC-50: Dataset for Environmental Sound Classification
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
End-to-End Speech Processing Toolkit
C++ library for audio and music analysis, description and synthesis, including Python bindings
The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
:fire: 2D and 3D Face alignment library build using pytorch
腾讯优图高精度双分支人脸检测器
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Using temporal convolution to detect Audio Deepfakes
This repository, called fast sentence transformers, contains code to run 5X faster sentence transformers using tools like quantization and ONNX.
A torch implementation of a recursion which turns out to be useful for RNN-T.
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
FastDVDnet: A Very Fast Deep Video Denoising algorithm
The Implementation of FastSpeech based on pytorch.
Library for fast text representation and classification.
80x faster and 95% accurate language identification with Fasttext
PytorchLightning porting of Facebook denoiser/demucs algorithm
Fully Convolutional DenseNets for semantic segmentation.
A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。
A speech dereverberation algorithm, also called wpe
Few-Shot Keyword Spotting
Mirror of https://git.ffmpeg.org/ffmpeg.git
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.