entn-at Goto Github PK
Name: Ewald Enzinger
Type: User
Bio: Ph.D. EE (UNSW Sydney). ML, speaker recognition, speech recognition, speech synthesis, forensic voice comparison
Twitter: entn_at
Location: Portland, Oregon
Blog: https://entn.at/
Name: Ewald Enzinger
Type: User
Bio: Ph.D. EE (UNSW Sydney). ML, speaker recognition, speech recognition, speech synthesis, forensic voice comparison
Twitter: entn_at
Location: Portland, Oregon
Blog: https://entn.at/
Alignment files of LibriTTS.
End-to-end spoken language identification with TensorFlow 2
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
A Light, Fast and Robust Speech Synthesis.
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Lightweight speaker anonymization [IEEE SLT2021]
Small compression utility
Improving generalization by controlling label-noise information in neural network weights.
Lingvo
Linphone.org mirror for linphone-android (git://git.linphone.org/linphone-android.git)
Linphone is a free VoIP and video softphone based on the SIP protocol. Mirror of linphone-iphone (git://git.linphone.org/linphone-iphone.git)
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
Audio-Visual Speech Recognition using Deep Learning
"LipNet: End-to-End Sentence-level Lipreading" in PyTorch
Lip Reading in the Wild using ResNet and LSTMs in PyTorch
Listen Attend and Spell (LAS) implement in pytorch
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.
MultiSpeaker Tacotron2 using LifeLong Learning.
ICASSP 2022
Tacotron2 Combine with Language Model (BERT).
A method to generate speech across multiple speakers
ctc releated
Efficient neural speech synthesis
Experimental Neural Net speech coding for FreeDV
Simulation of parallel synthesis with LPCNet vocoder
Tacotron2 + LPCNET for complete End-to-End TTS System
C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.
Local Pairwise Linear Discriminant Analysis
LRE i-vector classifier using Variational Information Bottleneck
Code for the paper "Latent Relation Language Models" at AAAI-20.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.