Dear <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url

Hi, <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Thanks <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-u

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

Hi, <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Question about start point of SDR about voicefilter HOT 4 CLOSED

maum-ai commented on September 6, 2024

Question about start point of SDR

from voicefilter.

Comments (4)

seungwonpark commented on September 6, 2024

Hi, @lycox1
Thanks for your interest in VoiceFilter open-source repo.

As discussed in #5, SDR may significantly differ from results in README since it's measured from the random sample. Please refer to Jungwon Seo's comment here: #5 (comment)

from voicefilter.

lycox1 commented on September 6, 2024

Thanks @seungwonpark
I already read #5.

I think that key checkpoint of #5 are below

train-other-500 don't use for training. Just use train-clean-100 and train-clean-360
--> I use train-clean-100, train-clean-360 and dev-clean
comparing to the published sample. (origianl paper's sample https://google.github.io/speaker-id/publications/VoiceFilter/).
--> I checked the dev_tuples.csv and train_tuples.csv (https://github.com/google/speaker-id/tree/master/publications/VoiceFilter/dataset/LibriSpeech). Files in dev-clean are exist in dev_tuples.csv but files in train-clean-100 and train-clean-360 don't exists in dev_tuples.csv and train_tuples.csv.

Could plz let me know if you have any other clue!

Thanks.

from voicefilter.

lawlict commented on September 6, 2024

Hello @seungwonpark , I also get the similar problem with @lycox1 . Could you please give me a hand?
I almost follow all the README steps, except that the suffix of audios in LibriSpeech is .flac, so I changed 24th line of normalize-resample.sh from
"for f in $(find . -name ".wav"); do"
to
"for f in $(find . -name ".flac"); do".

Since I clone down the newest code, train-other-500 has been removed. By the way, I notice that in README the number of test cases is 1000, while the code use only 100 test cases.

Here are the images of the training loss, test loss and test SDR in my experiment. Although the test data may be different, I believe that a correct training loss curve should be similar, right?

from voicefilter.

seungwonpark commented on September 6, 2024

Hi, @lawlict

The test loss curve may fluctuate since we didn't perform the evaluation for a sufficient amount of data. So I think the curve may look bit different.

from voicefilter.

Recommend Projects

Question about start point of SDR about voicefilter HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent