road2018 Goto Github PK

followers: 15.0 following: 35.0 repos: 1.1K gists: 0.0

Type: User

road2018's Projects

emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

emotional-speech-data

This is the GitHub page for publicly available emotional speech data.

emotionalconversionstargan

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

emovoice

Build your own Real-time Speech Emotion Recognizer

end-to-end-vad

an Audio-Visual Voice Activity Detection using Deep Learning

esc-50

ESC-50: Dataset for Environmental Sound Classification

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

eusipco2017

The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

face-alignment

:fire: 2D and 3D Face alignment library build using pytorch

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

fast-sentence-transformers

This repository, called fast sentence transformers, contains code to run 5X faster sentence transformers using tools like quantization and ONNX.

fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

fastasr

这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

road2018 Goto Github PK

road2018's Projects

Recommend Projects

Recommend Topics

Recommend Org