dariadiatlova Goto Github PK

followers: 43.0 following: 40.0 repos: 58.0 gists: 1.0

Name: Daria Diatlova

Type: User

Company: @deepvk

Bio: voice dl researcher

Location: Saint-Petersburg

Blog: https://www.linkedin.com/in/daria-diatlova-09b589184/

Daria Diatlova's Projects

bayesian-methods-hse-fall-2021

Домашние задания к курсу ШАД БММО21

cv

homework assignments: computer vision course

data-efficient-gans

[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

dla

Deep learning for audio processing

dns-challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

drl.hse

homework assignments: deep reinforcement learning, spring 2021

dsp

Digital Signal Processing course

dtln

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

dul_2021

emo-tts-data

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fre-gan

Test-task for VK-research internship 2022

frn

gas_data_analysis

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

hse-advanced-python

hse.numerical_methods

Homework assignments for numerical methods course

huawei.test_task

Binary classifier: male&female voice

human-spoof-voice-classifier

Test task for ID R&D

istftnet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

jbr_code_quality

Test task for the project: Code quality for online learning platforms Stepik and Hyperskill (JBR)

kazakh_tts

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.

dariadiatlova Goto Github PK

Daria Diatlova's Projects

Recommend Projects

Recommend Topics

Recommend Org