Comments (6)
I manage to get some progress. Now I training on data from LibriSpeech train-clean-100 and train-clean-360 and testing on train-dev-clean. After 40k steps the SDR reached only to ~5. Is it possible that it is related to the batch size that I am using (6)?
Another question - what is the the learning rate policy? Did you fixed it on 1e-3 throughout the whole training or updated it?
Thanks.
from voicefilter.
from voicefilter.
Hi Morank88 - did you get any improvement on your SDR? SDR I'm getting is much worse than even you're getting:
I've tried a number of runs (firstly, I had smaller batch size to run on lesser GFX card, but the run above was at as-downloaded batch size on amazon EC2 instance with NV100). Only differences are that I made the test sample 1000 (it's 100 in the code, but comment in the Readme mentioned 1000? maybe I'll change it back to the 100 as downloaded, and run again...) - and I have some likely more up-to-date python libraries (couldn't seem to find compatible torch 1.0.1 for example) - any suggestions?
Thanks in advance,
Nat
from voicefilter.
Hi,
Thank you for publishing your code!
I am encountering a training problem. As an initial phase I have tried to train only on 1000 samples from LibriSpeech train-clean-100 dataset. I am using the default configuration as was published in your VoiceFilter repo. The only difference is that I used batch size of 6 due to memory limitations. Is it possible that the problem is related to the small batch size that I use?Another question is related to the generation of the training and testing sets. I have noticed that there is an option to use a VAD for generating the training set but by default it is not used. What is the best practice? to use the VAD or not?
I appreciate your help!
hi,Can you share your settings, I run the same situation , thanks
from voicefilter.
I have also the same question.
The best result is 8 of SDR.
from voicefilter.
i meet same question with you,how did you solve this?
from voicefilter.
Related Issues (20)
- how to create file embedder HOT 5
- Question about start point of SDR HOT 4
- Question about normalize-resample.sh HOT 4
- Question when preprocessing wav files HOT 1
- Question when training VoiceFilter HOT 1
- Question about utils/evaluation.py HOT 2
- Need to try power-law compression loss HOT 2
- embedder.pt with new dataset HOT 4
- Can you get the initial mean SDR on LibriSpeech using Google's test list? HOT 8
- hop_length and win_length
- the model implementation comprehension HOT 1
- inference
- Can I get the pretrained model please! I so dearly need it for my project, here's my email just in case, [email protected] HOT 1
- Is the VoiceFilter model checkpoint available to be used directly? HOT 1
- question about ffmpeg-normalize
- Question about wav2spec function in utils/audo.py
- Cannot reproduce reported SDR & retrain the speaker embedding
- how to work for multi noise
- What is the term spk refers to in the below code ? line 127
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from voicefilter.